What question did this study set out to answer?

This research aims to develop a human-centered framework for assessing large language models and their various intelligences compared to humans.

May 10, 2026

Unraveling Generative AI from a Human Intelligence Perspective: A Battery of Experiments

Key Points

This research aims to develop a human-centered framework for assessing large language models and their various intelligences compared to humans.
Conducted extensive online experiments using a novel framework based on behavioral theory.
Assessed GPT-4 against humans in multiple intelligence domains including cognitive, emotional, and social intelligence.
Evaluated the impact of GPT-4 across various job roles to validate the framework's effectiveness.
GPT-4 outperformed humans in cognitive, emotional, and creative intelligence but lagged in social intelligence.
Identification of specific areas where GPT-4 struggles, such as understanding mental states and social interest.
Findings align with established labor market research regarding the integration and impact of LLMs.

Abstract

This study introduces a novel, human-centered framework for evaluating the holistic intelligence of large language models (LLMs), using behavioral theory and experimental benchmarks drawn from human intelligence. Through extensive online experiments, the framework reveals that GPT-4 outperforms humans in cognitive, emotional, and creative intelligence, but falls short in social intelligence, especially in social interest, self-efficacy, and understanding mental states. Beyond theoretical insight, the study validates this framework by assessing GPT-4’s impact across diverse job roles, finding results consistent with established labor market research. It also offers a reusable tool for firms and policymakers to evaluate LLM intelligence and forecast job-level impacts. This enables informed decisions about where and how to integrate LLMs, match models to specific job requirements, and identify risks in socially intensive roles. The framework provides a foundation for responsible LLM deployment, ensuring alignment with human-centered structures and supporting strategic workforce planning.

AI से पूछें

Bookmark