Abstract—Large Language Models (LLMs) have shown re- markable progress in natural language understanding and gen- eration. However, they suffer from hallucinations, lack of domain adaptation, and outdated knowledge. Retrieval-Augmented Gen- eration (RAG) addresses these challenges by combining semantic retrieval with generative models, enabling grounded, explainable, and domain-specific responses. This paper presents a RAG framework using Pinecone as a vector database, mixedbread- ai embeddings, and Gemini-1.5-pro for generation. We evaluate multiple chunking strategies, incorporate prompt-tuning tech- niques, and address security threats such as prompt injection attacks. Results indicate improved factual accuracy, reduced hallucinations, and enhanced user trust, making the system suitable for real-world enterprise and academic applications. Index Terms—Retrieval-Augmented Generation, Large Lan- guage Models, LangChain, Pinecone, Semantic Search, Prompt Injection, Chunking
Building similarity graph...
Analyzing shared references across papers
Loading...
Varshini Bhaskar Shetty
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT
Building similarity graph...
Analyzing shared references across papers
Loading...
Varshini Bhaskar Shetty (Thu,) studied this question.
synapsesocial.com/papers/68bb3d622b87ece8dc95668c — DOI: https://doi.org/10.55041/ijsrem52249