Key points are not available for this paper at this time.
Abstract The aim of the study is to evaluate and compare the quality and readability of responses generated by five different artificial intelligence (AI) chatbots—ChatGPT, Bard, Bing, Ernie, and Copilot—to the top searched queries of erectile dysfunction (ED). Google Trends was used to identify ED-related relevant phrases. Each AI chatbot received a specific sequence of 25 frequently searched terms as input. Responses were evaluated using DISCERN, Ensuring Quality Information for Patients (EQIP), and Flesch-Kincaid Grade Level (FKGL) and Reading Ease (FKRE) metrics. The top three most frequently searched phrases were “erectile dysfunction cause”, “how to erectile dysfunction,” and “erectile dysfunction treatment.” Zimbabwe, Zambia, and Ghana exhibited the highest level of interest in ED. None of the AI chatbots achieved the necessary degree of readability. However, Bard exhibited significantly higher FKRE and FKGL ratings ( p = 0.001), and Copilot achieved better EQIP and DISCERN ratings than the other chatbots ( p = 0.001). Bard exhibited the simplest linguistic framework and posed the least challenge in terms of readability and comprehension, and Copilot’s text quality on ED was superior to the other chatbots. As new chatbots are introduced, their understandability and text quality increase, providing better guidance to patients.
Building similarity graph...
Analyzing shared references across papers
Loading...
Mehmet Fatih Şahin
Hüseyin Ateş
Anıl Keleş
Journal of Medical Systems
Tekirdağ Namık Kemal University
Bursa Technical University
Building similarity graph...
Analyzing shared references across papers
Loading...
Şahin et al. (Wed,) studied this question.
www.synapsesocial.com/papers/68e7079eb6db643587681eb9 — DOI: https://doi.org/10.1007/s10916-024-02056-0
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: