Data
Context Window
A context window is the maximum amount of text (measured in tokens) that an AI model can process in a single inference call. Modern frontier models support 200K–1M tokens, but effective recall and reasoning quality often degrade well before that limit.
Related terms
- LLM (Large Language Model) — A large language model (LLM) is a neural network trained on broad text corpora that can generate, summarize, translate, classify, and reason about natural language.
- RAG (Retrieval-Augmented Generation) — RAG is a pattern in which an AI model retrieves relevant documents from a knowledge base at query time and uses them as additional context to generate its response.