🛡️
Session Flagged

Your session has been flagged for unusual activity.

You can try our app by searching for MultipleChat AI on Google and clicking the multiplechat.ai link to try it free.
Quick verification

Please confirm you're human to continue.


DecodingPrompting Updated 2026

Temperature

A setting that controls randomness in a model's output: low temperature gives focused, repeatable answers; high temperature gives more varied, creative ones.

When a model picks the next token it produces a probability distribution via the softmax function. Temperature scales the logits before softmax by a factor of 1/T — a mechanism described by Hinton et al. (2015). Higher T flattens the distribution so unlikely tokens get a fairer chance; lower T sharpens it toward the single most probable token.

At temperature 0 the output is effectively deterministic — useful for extraction or code. Higher values suit brainstorming, but raise the risk of drift and hallucination.

References

Primary, peer-reviewed and archival sources for this definition.

Distilling the Knowledge in a Neural Network
Hinton, G., Vinyals, O., & Dean, J. (2015). NeurIPS 2014 Deep Learning Workshop (arXiv preprint).

Dictionary & encyclopedic entries

Cite this entry

MultipleChat. "Temperature." MultipleChat AI & LLM Glossary, 2026. https://multiple.chat/ai-glossary/temperature

Related terms

See this in practice

Run the same prompt across ChatGPT, Claude, Gemini and Grok — grounded in your own sources, cross-checked against each other.

Try MultipleChat Free

Continue learning

See paid plans