December 4, 2025Open Access

The Artificial Intelligence Quotient (AIQ): Measuring Machine Intelligence Based on Multi-Domain Complexity and Similarity

Puntos clave

Evaluation indicates that AIQ-based benchmarks can effectively assess intelligent capabilities in AI systems, promoting equity among diverse domains.
Test suites constructed through the AIQ framework facilitate understanding of machine intelligence across varying complexities and similarities.
The proposed AIQ framework aims to eliminate bias in multi-domain evaluation, enhancing the quality of AI assessments.
Using known complexities, this framework underscores the importance of comprehensive benchmarks for valid intelligence evaluation. ”],

Resumen

The development of AI systems and benchmarks has been rapidly increasing, yet there has been a disproportionately small amount of examination into the domains used to evaluate these systems. Most benchmarks introduce bias by focusing on a particular type of domain or combine different domains without consideration of their relative complexity or similarity. We propose the Artificial Intelligence Quotient (AIQ) framework as a means for measuring the similarity and complexity of domains in order to remove these biases and assess the scope of intelligent capabilities evaluated by a benchmark composed of multiple domains. These measures are evaluated with several intuitive experiments using simple domains with known complexities and similarities. We construct test suites using the AIQ framework and evaluate them using known AI systems to validate that AIQ-based benchmarks capture an agent’s intelligence.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo