Data

Token

A token is the unit of text an AI model processes — roughly ¾ of an English word on average. Models charge by input + output tokens, so prompt length and response length both drive cost.

Related terms

  • Context Window A context window is the maximum amount of text (measured in tokens) that an AI model can process in a single inference call.
  • Inference Inference is the process of running a trained AI model to produce outputs from inputs.

← Back to glossary