Contemporary artificial intelligence systems, despite their remarkable achievements in natural language processing and computer vision, operate on a fundamentally reductive paradigm: they treat language and visual concepts as abstract statistical patterns divorced from their historical genesis, cultural embeddedness, and existential purpose. This position paper identifies what we term the "Ontological Gap" in current transformer-based architectures the systematic absence of mechanisms that encode the raison d'être of concepts beyond their distributional correlations in training data. We propose Teleo-Transformers, a novel architectural framework that augments traditional attention mechanisms with a fourth dimension: Causal Embeddings that link vocabulary and visual concepts to their etymological roots, historical contexts, and existential meanings. Unlike existing approaches that treat "resistance" merely as a token statistically proximate to "conflict" and "war," our framework grounds it in its ontological foundation the rejection of humiliation and the assertion of dignity enabling more culturally coherent, historically authentic, and semantically grounded outputs. We demonstrate how this ontological dimension addresses critical limitations across both language and vision domains, including semantic hallucination in image generation, cultural bias in multimodal models, and temporal inconsistency in video synthesis. This paper establishes the theoretical foundation for a research program that treats language and vision not as probability distributions to be optimized, but as cultural inheritances and intentional acts to be understood. We present this work as foundational conceptual architecture preceding empirical implementation, inviting the research community to explore this overlooked dimension of artificial intelligence.
Building similarity graph...
Analyzing shared references across papers
Loading...
Momen Ghazouani
Building similarity graph...
Analyzing shared references across papers
Loading...
Momen Ghazouani (Wed,) studied this question.
www.synapsesocial.com/papers/698586498f7c464f2300a451 — DOI: https://doi.org/10.5281/zenodo.18477392