What question did this study set out to answer?

This research aims to evaluate the effectiveness and limitations of MBTI-based personality profiling utilizing large language models.

April 26, 2026Open Access

A critical analysis of MBTI-based personality profiling with large language models

Key Points

This research aims to evaluate the effectiveness and limitations of MBTI-based personality profiling utilizing large language models.
Analyzed recent work on machine learning and transformer models from 2020 to 2025.
Reviewed performance on datasets like Kaggle MBTI and MBTIBench, assessing accuracy and biases.
Investigated the application of MBTI instruments to evaluate AI-generated personalities.
LLM-based systems achieved 75%–85% accuracy at the dichotomy level but faced modest improvements over baselines.
Found issues like polarized predictions and overconfidence in AI outputs.
Identified that LLMs exhibit context-dependent, socially desirable personality-like traits.

Abstract

This paper critically analyzes MBTI-based personality profiling using Large Language Models (LLMs), examining both their use as tools for inferring human personality and as subjects evaluated through psychometric frameworks. We review recent work (2020–2025) spanning traditional machine learning, fine-tuned transformer models, and zero-shot prompting approaches across datasets such as Kaggle MBTI, PersonalityCafe, Pandora, and MBTIBench. While top-performing LLM-based systems report 75%–85% accuracy at the dichotomy level, improvements over baselines are often modest, domain-dependent, and sensitive to dataset biases. Recent benchmarks employing soft labels reveal systematic issues, including polarized predictions, overconfidence, and limited calibration relative to population trait distributions. Beyond predictive performance, we examine emerging research that applies MBTI instruments directly to LLMs, showing that models exhibit reproducible yet context-dependent “personality-like” profiles, often skewed toward socially desirable traits due to alignment training. These findings raise conceptual questions about whether stable internal dispositions can meaningfully be attributed to generative systems whose outputs vary across prompts and versions. We argue that MBTI-based modeling with LLMs faces three core challenges: psychometric limitations of the MBTI construct itself, methodological weaknesses in self-reported training data, and philosophical ambiguity regarding the notion of AI personality. The paper concludes by outlining ethical risks, evaluation gaps, and research directions for more rigorous, calibrated, and theoretically grounded personality modeling in artificial intelligence systems.

AI에게 질문

Bookmark

View Full Paper

Cite This Study

Tshimula et al. (Wed,) studied this question.

synapsesocial.com/papers/69edaa9b4a46254e215b30f8 https://doi.org/https://doi.org/10.3389/fncom.2026.1800284

AI에게 질문

Bookmark

View Full Paper