Data
Token
A token is the unit of text an AI model processes — roughly ¾ of an English word on average. Models charge by input + output tokens, so prompt length and response length both drive cost.
Related terms
- Context Window — A context window is the maximum amount of text (measured in tokens) that an AI model can process in a single inference call.
- Inference — Inference is the process of running a trained AI model to produce outputs from inputs.