Data
RAG (Retrieval-Augmented Generation)
RAG is a pattern in which an AI model retrieves relevant documents from a knowledge base at query time and uses them as additional context to generate its response. It reduces hallucinations and lets the model cite sources without retraining.
Related terms
- Fine-Tuning — Fine-tuning is the process of training an existing AI model on additional task-specific data so its weights adapt to a narrower domain.
- Vector Database — A vector database is a database optimized for storing embeddings and answering similarity queries ("give me the 10 most similar items to this one").
- Embedding — An embedding is a fixed-length vector of numbers that represents the semantic meaning of a piece of text (or image, audio).