What question did this study set out to answer?

This study evaluates how effectively ChatGPT-4o provides patient education on urinary incontinence using key metrics.

June 10, 2026Open Access

Assessing the understandability, actionability, reliability, and readability of ChatGPT-4o in providing patient education on urinary incontinence

Key Points

This study evaluates how effectively ChatGPT-4o provides patient education on urinary incontinence using key metrics.
Posed 13 patient-focused questions based on AUA/SUFU and EAU guidelines to ChatGPT-4o in Turkish.
Responses evaluated by three blinded experts using the Patient Education Materials Assessment Tool (PEMAT) and modified DISCERN (mDISCERN) tool.
Readability assessed with the Çetinkaya–Uzun formula and statistically analyzed with descriptive statistics and Intraclass Correlation Coefficient (ICC).
Experts showed strong agreement in assessments with ICC scores of 0.80 for understandability and 0.82 for reliability.
Responses were highly understandable (94.4%), but actionability was low, particularly in surgical considerations at 68.2%.
Most responses were rated as 'difficult', requiring university-level education for comprehension, especially concerning surgical topics.

Abstract

Objective This study assesses ChatGPT-4o′s responses to common patient inquiries regarding urinary incontinence (UI), a condition that significantly impacts quality of life but often goes untreated due to low healthcare-seeking behavior. The evaluation focuses on four key metrics: understandability, actionability, reliability, and readability. Material and Methods In this non-human subject qualitative study, 13 patient-focused questions—derived from AUA/SUFU and EAU guidelines—were posed to ChatGPT-4o in Turkish. The questions were categorized into four themes: Definition, Diagnosis, Management, and Surgical Considerations. Three blinded experts (an urogynecologist, a urologist, and a pelvic floor physiotherapist) independently evaluated the responses using the Patient Education Materials Assessment Tool (PEMAT) for understandability and actionability and the modified DISCERN (mDISCERN) tool for reliability. Readability was measured using the Çetinkaya–Uzun formula , specifically designed for Turkish text. Statistical analysis included descriptive statistics and the Intraclass Correlation Coefficient (ICC) to determine inter-rater reliability. Results In evaluating ChatGPT-4o’s performance in urinary incontinence education, experts found strong agreement in their assessments, with inter-rater reliability scores were 0.80 (95% CI: 0.70-0.91) for PEMAT and 0.82 (95% CI: 0.70-0.91) for mDISCERN. The AI’s responses were consistently highly understandable, particularly when explaining diagnoses (achieving a peak score of 94.4 %), yet they were significantly less actionable, meaning they often failed to provide clear, practical steps for patients to follow. This gap was most evident in surgical considerations, which were deemed the least actionable at 68.2 %. The overall reliability of the content was rated as “fair” across all categories—with surgical information being the most reliable. Most responses were classified as “difficult,” requiring a university-level education to comprehend, with surgery-related topics being the most linguistically complex. Conclusion While ChatGPT-4o yields comprehensible health information, its limited actionability and high linguistic complexity pose barriers to patients with lower health literacy.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Ayşe Filiz Gökmen Karasu

Bezmiâlem Vakıf Üniversitesi

Betul Cinar

Bezmiâlem Vakıf Üniversitesi

Melda Kuyucu

Izmir University

Journals

Digital Health

Actions

Institutions

Bingöl University

Bezmiâlem Vakıf Üniversitesi

State Hospital

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Assessing the understandability, actionability, reliability, and readability of ChatGPT-4o in providing patient education on urinary incontinence

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study