What is RAG? | TapUp Digital Glossary

RAG is a technique where an AI retrieves relevant information from an external database or document collection before generating its response.

A standard generative AI composes answers from the data it was trained on, which means it can't reliably handle information that emerged after training — such as recent news or proprietary internal data. When it doesn't know something, it can still produce a confident-sounding but incorrect answer, a problem known as hallucination.

With RAG, the AI first searches an external database for information related to the question. Rather than reading an entire document, it picks up short excerpts called chunks that match the search and passes those as context when generating its answer. For example, a question about an internal policy would trigger a search through the policy manual, and the matching passages would be passed to the AI to base its response on.

This means the AI doesn't need to be retrained whenever data changes — keeping the knowledge base up to date is enough to improve accuracy. That said, RAG is not foolproof: if the right chunk isn't retrieved, if key information is split across chunk boundaries, or if the AI fills in missing details on its own, errors can still occur. It's a system that improves accuracy — not one that guarantees correct answers.

RAG

In Simple Terms

Behind the Name

Take a Closer Look!