🛡️
Session Flagged

Your session has been flagged for unusual activity.

You can try our app by searching for MultipleChat AI on Google and clicking the multiplechat.ai link to try it free.
Quick verification

Please confirm you're human to continue.


RetrievalAccuracyLLM Architecture Updated 2026

RAG (Retrieval-Augmented Generation)

A technique that retrieves relevant documents at query time and feeds them to a language model, so its answer is grounded in real sources instead of memory alone.

Retrieval-Augmented Generation combines a retriever that searches an external document collection with a generator language model. Lewis et al. (2020) introduced it, pairing a parametric seq2seq model with a non-parametric vector index of Wikipedia and showing it produced more factual, specific text and set state-of-the-art results on open-domain question answering.

Most modern RAG systems retrieve with dense embeddings, following Dense Passage Retrieval (Karpukhin et al., 2020); the idea of joining retrieval to a language model was also developed in REALM (Guu et al., 2020).

Why it matters at MultipleChat

Because the knowledge lives in an external, updatable index rather than the model's frozen weights, RAG is the standard remedy for stale or fabricated facts. MultipleChat grounds each model in the same retrieved sources, then cross-checks their answers against that evidence.

References

Primary, peer-reviewed and archival sources for this definition.

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., Küttler, H., Lewis, M., Yih, W., Rocktäschel, T., Riedel, S., & Kiela, D. (2020). Advances in Neural Information Processing Systems 33 (NeurIPS 2020).
Dense Passage Retrieval for Open-Domain Question Answering
Karpukhin, V., Oğuz, B., Min, S., Lewis, P., Wu, L., Edunov, S., Chen, D., & Yih, W. (2020). Proceedings of EMNLP 2020, pp. 6769–6781.
REALM: Retrieval-Augmented Language Model Pre-Training
Guu, K., Lee, K., Tung, Z., Pasupat, P., & Chang, M.-W. (2020). Proceedings of the 37th International Conference on Machine Learning (ICML 2020).

Dictionary & encyclopedic entries

Cite this entry

MultipleChat. "RAG (Retrieval-Augmented Generation)." MultipleChat AI & LLM Glossary, 2026. https://multiple.chat/ai-glossary/rag

Related terms

See this in practice

Run the same prompt across ChatGPT, Claude, Gemini and Grok — grounded in your own sources, cross-checked against each other.

Try MultipleChat Free

Continue learning

See paid plans