What type of study is this?

This is a Quantitative Study study.

October 9, 2025Open Access

Do AIs Dream of Electric Butterflies? Benchmarking LLM Consciousness via Theory-Grounded Self-Reports

Puntos clave

The study reveals distinct cognitive profiles among various large language models, indicating different engagement with consciousness.
Using 840 self-report responses, performance differences suggest advanced models showcase specialization in cognitive tasks.
The developed ConsciousnessBench benchmark offers a new empirical method for assessing consciousness-related traits in AI.
While not conclusive on AI consciousness, findings indicate that exploring consciousness traits in models is now feasible.

Resumen

Are state-of-the-art large language models conscious, or capable of anything like consciousness? We introduce ConsciousnessBench: the first systematic benchmark designed to empirically evaluate consciousness-relevant traits in frontier language models, grounded in 5 leading scientific theories. We assess 8 advanced models via 840 self-report responses, finding not only statistically robust performance differences, but—more importantly—evidence of distinct model cognitive profiles and engagement strategies with consciousness-related constructs. Our results reveal that some models demonstrate theoretical fluency, specialization in certain cognitive tasks, or even phenomenological exploration, while others default to deflection. While we cannot deliver a definitive verdict on AI consciousness, our findings show that consciousness-related capacities—and their computational diversity—are now empirically tractable, even if not yet empirically decidable.

Leer artículo completoexternamente

Preguntar a la IA

Me gusta

Guardar

Ver artículo completo

Cite This Study

Haoran Zheng (Tue,) studied this question.

synapsesocial.com/papers/68e70da790569dd607ee5abe https://doi.org/https://doi.org/10.31234/osf.io/fqwp9_v1

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Preguntar a la IA

Me gusta

Guardar

Ver artículo completo