🛡️
Session Flagged

Your session has been flagged for unusual activity.

You can try our app by searching for MultipleChat AI on Google and clicking the multiplechat.ai link to try it free.
Quick verification

Please confirm you're human to continue.


Foundations Updated 2026

Token

The unit a language model actually reads and writes — typically a sub-word fragment, not a whole word. Context limits, pricing and speed are all measured in tokens.

Before text reaches a model it is split into tokens by a tokenizer. Modern systems use sub-word schemes such as byte-pair encoding (Sennrich et al., 2016) or SentencePiece (Kudo & Richardson, 2018), which break rare words into smaller pieces while keeping common words intact — balancing vocabulary size against sequence length.

A token is roughly three-quarters of an English word on average, but this varies by language and script. That is why context limits and API prices are quoted in tokens rather than words or characters.

References

Primary, peer-reviewed and archival sources for this definition.

Neural Machine Translation of Rare Words with Subword Units
Sennrich, R., Haddow, B., & Birch, A. (2016). Proceedings of ACL 2016.
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
Kudo, T., & Richardson, J. (2018). Proceedings of EMNLP 2018 (System Demonstrations).

Dictionary & encyclopedic entries

Cite this entry

MultipleChat. "Token." MultipleChat AI & LLM Glossary, 2026. https://multiple.chat/ai-glossary/token

Related terms

See this in practice

Run the same prompt across ChatGPT, Claude, Gemini and Grok — grounded in your own sources, cross-checked against each other.

Try MultipleChat Free

Continue learning

See paid plans