🛡️
Session Flagged

Your session has been flagged for unusual activity.

You can try our app by searching for MultipleChat AI on Google and clicking the multiplechat.ai link to try it free.
Quick verification

Please confirm you're human to continue.


Models & ProductsLLM Architecture Updated 2026

BERT (Bidirectional Encoder Representations from Transformers)

An encoder-only Transformer pre-trained to read text in both directions at once, widely used for understanding tasks like classification, search and extraction.

BERT, introduced by Devlin et al. (2019), is pre-trained with a masked-language-modelling objective: it hides random tokens and learns to predict them from both left and right context. This bidirectional reading made it exceptionally strong at language understanding and reset the state of the art across many NLP benchmarks.

Unlike GPT, BERT is an encoder built for comprehension rather than open-ended generation — still the backbone of many search and classification systems.

References

Primary, peer-reviewed and archival sources for this definition.

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). Proceedings of NAACL-HLT 2019, pp. 4171–4186.

Dictionary & encyclopedic entries

Cite this entry

MultipleChat. "BERT (Bidirectional Encoder Representations from Transformers)." MultipleChat AI & LLM Glossary, 2026. https://multiple.chat/ai-glossary/bert

Related terms

See this in practice

Run the same prompt across ChatGPT, Claude, Gemini and Grok — grounded in your own sources, cross-checked against each other.

Try MultipleChat Free

Continue learning

See paid plans