What Is Retrieval-Augmented Generation (RAG)?

Retrieval‑Augmented Generation (RAG) is an architecture where a retriever first pulls documents from a knowledge base, and then a generator (usually an LLM) answers using those documents as context.

The goal is to:

Ground answers in fresh, organization‑specific data.
Reduce hallucinations by forcing the model to cite sources.
Keep sensitive data in your own store instead of in the model weights.

For architecture trade‑offs, evaluation metrics, and testing guidance, return to the RAG Systems pillar page.