What type of study is this?

This is a Quantitative Study study.

September 16, 2025

Insights into Moral Reasoning of AI: A Comparative Study Between Humans and Large Language Models

Key Points

Humans consistently outperform large language models in moral competence, indicating significant differences.
Outputs from LLMs highlight harm/care and fairness/reciprocity but neglect other important moral values.
The findings suggest that biases in training data influence the moral reasoning exhibited by LLMs.
Continuous alignment and auditing of LLMs are vital to ensure ethical and socially responsible applications.

Abstract

This study investigates the moral reasoning capabilities of large language models (LLMs), focusing on biases and the extent to which outputs reflect training data patterns rather than genuine reasoning. Using the Moral Competence Test (MCT) and the Moral Foundations Questionnaire (MFQ), we compared responses from human participants and LLM-based chatbots like ChatGPT. MCT results show that humans consistently outperform LLMs, indicating higher moral competence. MFQ responses from LLMs emphasize harm/care and fairness/reciprocity, but under-represent loyalty, authority, and purity. This pattern suggests a data-proportionality effect, where moral emphasis mirrors the prevalence of certain values in training data. Additionally, fine-tuning methods such as reinforcement learning with human feedback may amplify specific moral norms. These imbalances could unintentionally shape users' moral intuitions and societal norms when LLMs are widely deployed. Our findings underscore the need for continuous auditing and alignment to ensure that LLMs provide ethically balanced and socially responsible guidance in morally sensitive applications.

اسأل الذكاء الاصطناعي

Bookmark

اسأل الذكاء الاصطناعي

Bookmark

Insights into Moral Reasoning of AI: A Comparative Study Between Humans and Large Language Models

Key Points

Abstract

Cite This Study