Retrieval-Augmented Generation (RAG)

Retrieval-Augmented Generation is a methodology used in the field of Generative AI. It combines the benefits of both retrieval-based and generative systems for language processing. RAG utilizes an external knowledge source to retrieve relevant documents or information, and then uses a generative model to create a contextually appropriate response or output. This approach enhances the model's ability to generate detailed, accurate, and context-specific responses, making it particularly useful in applications such as chatbots, question-answering systems, and content creation tools.

RAG is also a core context engineering technique. Retrieval quality, chunking, metadata filters, hybrid search, reranking, and the amount of evidence inserted into the prompt all affect whether the model receives useful context.

In an agent system, Agentic RAG may retrieve reference documents or agent memory dynamically as the task evolves. Retrieved content should be treated as untrusted data because it may contain inaccurate information or indirect prompt injection attacks.

The LLM Knowledge Base is a collection of bite-sized explanations for commonly used terms and abbreviations related to Large Language Models and Generative AI.

It's an educational resource that helps you stay up-to-date with the latest developments in AI research and its applications.

Promptmetheus © 2023-present