What question did this study set out to answer?

The research aims to compare the performance and operational characteristics of RAG and MCP pipelines in small LLM environments.

March 7, 2026Open Access

Performance and Operational Characteristics of RAG vs MCP Knowledge Access Pipelines in Small LLM Environments

Key Points

The research aims to compare the performance and operational characteristics of RAG and MCP pipelines in small LLM environments.
Empirical comparison of RAG and MCP pipelines
Used 105 Docker documentation queries
Tested with Qwen 2.5 7B and Llama 3.1 8B models
Evaluated peak accuracy and operational costs
Assessed tuning parameters and failure points
Tuned RAG achieved peak accuracy of 75.0% and naive RAG at 29.4% shows high configuration sensitivity
MCP parallel achieved 64.0% accuracy without parameter tuning
MCP reduced tuning parameters from 9 to 2 and failure points from 3 to 1
On 14B models, MCP outscored tuned RAG with 73.1% vs. 66.7%
Demonstrated a performance–operational cost trade-off favoring MCP for simplicity.

Abstract

Retrieval-Augmented Generation (RAG) is the standard approach for augmenting LLM knowledge, yet high operational complexity and personnel dependency remain persistent challenges. This study empirically compares RAG and MCP (Model Context Protocol)-based pipelines on 105 Docker documentation queries using Qwen 2.5 7B and Llama 3.1 8B models. Tuned RAG achieves peak accuracy (75.0%), but the 45.6 percentage point gap from naive RAG (29.4%) reveals extreme configuration sensitivity. MCP parallel achieves 64.0% without parameter tuning, while reducing tuning parameters from 9 to 2, failure points from 3 to 1, and eliminating external dependencies beyond the LLM. On 14B models, MCP (73.1%) outscored tuned RAG (66.7%), suggesting a possible tipping point as model capabilities improve. We present quantitative evidence for the performance–operational cost trade-off between RAG and MCP, demonstrating that MCP-inspired architectures constitute a viable alternative where operational simplicity is prioritized.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Hyunwoo Jeon

Taesung Kim

Kang Han

Actions

Institutions

Sejong University

Peace Corps

SNPedia

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Performance and Operational Characteristics of RAG vs MCP Knowledge Access Pipelines in Small LLM Environments

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study