🛡️
Session Flagged

Your session has been flagged for unusual activity.

You can try our app by searching for MultipleChat AI on Google and clicking the multiplechat.ai link to try it free.
Quick verification

Please confirm you're human to continue.


Models & Products Updated 2026

Whisper

OpenAI's speech-recognition model, trained on a very large, diverse audio set to transcribe and translate speech robustly across languages.

Whisper, introduced by Radford et al. (2022), is an automatic speech-recognition and translation model trained with weak supervision on 680,000 hours of multilingual audio. That scale let it generalize to many languages and conditions in a zero-shot setting, approaching human robustness without task-specific fine-tuning.

Whisper is widely used to turn voice into text for transcription, captioning and voice interfaces — including voice input to AI assistants.

References

Primary, peer-reviewed and archival sources for this definition.

Robust Speech Recognition via Large-Scale Weak Supervision (Whisper)
Radford, A., Kim, J. W., Xu, T., Brockman, G., McLeavey, C., & Sutskever, I. (2022). arXiv preprint (OpenAI).

Dictionary & encyclopedic entries

Cite this entry

MultipleChat. "Whisper." MultipleChat AI & LLM Glossary, 2026. https://multiple.chat/ai-glossary/whisper

Related terms

See this in practice

Run the same prompt across ChatGPT, Claude, Gemini and Grok — grounded in your own sources, cross-checked against each other.

Try MultipleChat Free

Continue learning

See paid plans