What question did this study set out to answer?

To improve multilingual information retrieval and reasoning using a hybrid neural-symbolic framework.

April 18, 2026

Robust Long-Context Multilingual Retrieval and Reasoning Enabled by Combined Neural and Symbolic Techniques

Key Points

To improve multilingual information retrieval and reasoning using a hybrid neural-symbolic framework.
Proposed a hybrid model called CROSS for cross-lingual retrieval.
Integrated multilingual embeddings for effective context narrowing.
Developed NeuroSymbolic Augmented Reasoning (NSAR) for structured fact extraction and code generation.
Evaluated on the mLongRR-V2 benchmark with seven languages and very long documents.
CROSS achieved up to 92% retrieval accuracy, surpassing neural-only baselines.
NSAR reduced reasoning failures by five times compared to previous methods.
Maintained consistent performance across varied languages and document lengths.

Abstract

Large language models (LLMs) are increasingly deployed for multilingual information retrieval and reasoning over very long documents, yet they often struggle with extracting dispersed facts and synthesizing robust answers across linguistic boundaries. In this work, we propose a hybrid neural-symbolic framework that integrates scalable cross-lingual retrieval with explicit symbolic reasoning. Our approach, CROSS (Cross-lingual Retrieval Optimized for Scalable Solutions), efficiently narrows massive multilingual contexts using multilingual embeddings, dramatically improving retrieval accuracy and mitigating the “lost-in-the-middle” problem. Building on this, we introduce NeuroSymbolic Augmented Reasoning (NSAR) , which prompts LLMs to extract structured facts and generate executable Python code, enabling deterministic and interpretable multitarget reasoning. We evaluate our methods on the mLongRR-V2 benchmark, spanning seven languages, 49 cross-lingual pairs, and documents up to 512,000 words. Our experiments show that compared to neural-only baselines, CROSS boosts a retrieval accuracy of up to 92% and NSAR reduces reasoning failures fivefold, while maintaining stable performance across languages and context sizes. These results establish a new standard for robust, scalable, and interpretable multilingual information extraction, demonstrating the promise of hybrid neural-symbolic architectures for future artificial intelligence systems.

Bookmark

Robust Long-Context Multilingual Retrieval and Reasoning Enabled by Combined Neural and Symbolic Techniques

Key Points

Abstract

Cite This Study