What question did this study set out to answer?

This research examines how users engage with AI-generated guidance, focusing on trust and perceived risk in high-stakes domains.

June 7, 2026Open Access

A Human-Centered Evaluation of AI-Generated Guidance: Integrated Statistical and Machine Learning Analysis with a Risk Framework for High-Stakes Domains

Key Points

This research examines how users engage with AI-generated guidance, focusing on trust and perceived risk in high-stakes domains.
Survey data collected from 572 participants in Saudi Arabia.
Quantitative analysis of trust, privacy concerns, and credibility using statistical methods.
Qualitative analysis employing machine learning techniques like BERT and HDBSCAN for clustering responses.
Moderate usage of AI systems with low trust levels reported (mean trust score 3/10).
Users exhibit strong concerns about reliability and source credibility.
Preference for expert validation of AI outputs in complex scenarios, indicating a cautious engagement with AI.

Abstract

The increasing use of large language models (LLMs) in domains requiring interpretation and judgment has raised critical questions about trust, reliability, and account-ability, particularly in contexts where decisions carry significant consequences. While prior work has focused primarily on improving system performance, limited attention has been given to how users evaluate and interact with AI-generated guidance in real-world, high-stakes settings. This paper addresses this gap through a large-scale empirical investigation of public perceptions of AI-generated religious guidance in Saudi Arabia. The analysis is based on survey data collected from 572 participants and combines quantitative statistical methods with a machine learning-based pipeline for analyzing open-ended responses. The quantitative component examines patterns in trust, perceived risk, privacy concerns, credibility, and user practices, while the qualitative component employs embedding-based clustering using Bidirectional Encoder Representations from Transformers (BERT), Uniform Manifold Approximation and Projection (UMAP), and Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN), followed by expert interpretation to derive structured parameters. The results indicate a cautious and conditional engagement with AI systems, characterized by moderate usage, low levels of trust, and strong concerns regarding reliability and source credibility. Users frequently verify AI-generated outputs and demonstrate a preference for human expert validation, particularly in complex or sensitive cases. Building on these insights, the study introduces a layered taxonomy of perceived risks spanning epistemic, reasoning, interactional, and institutional dimensions, providing a structured analytical framework for understanding how technical limitations translate into broader behavioural and governance challenges. These results highlight the importance of aligning AI system design with user expectations, emphasizing transparency, verifiability, and human oversight. The proposed taxonomy and analytical framework provide a foundation for future research and contribute to the development of governance approaches for AI systems deployed in high-stakes interpretive domains.

AI에게 질문

Bookmark

View Full Paper

Cite This Study

Al-Turki et al. (Thu,) studied this question.

synapsesocial.com/papers/6a250c957def13d035e1cd27 https://doi.org/https://doi.org/10.14569/ijacsa.2026.0170591

AI에게 질문

Bookmark

View Full Paper