🛡️
Session Flagged

Your session has been flagged for unusual activity.

You can try our app by searching for MultipleChat AI on Google and clicking the multiplechat.ai link to try it free.
Quick verification

Please confirm you're human to continue.


Models & ProductsLLM Architecture Updated 2026

GPT (Generative Pre-trained Transformer)

A family of decoder-only Transformer models that are pre-trained to predict the next token on large text corpora, then adapted to downstream tasks.

GPT denotes the generative pre-training recipe introduced by Radford et al. (2018): pre-train a decoder-only Transformer on unlabelled text, then fine-tune. Brown et al. (2020) scaled this to GPT-3 (175B parameters) and showed strong few-shot, in-context learning, establishing the template most modern LLMs follow.

OpenAI's ChatGPT is the best-known product built on this lineage, but "GPT" now broadly names the decoder-only, next-token-prediction approach.

References

Primary, peer-reviewed and archival sources for this definition.

Improving Language Understanding by Generative Pre-Training
Radford, A., Narasimhan, K., Salimans, T., & Sutskever, I. (2018). OpenAI Technical Report.
Language Models are Few-Shot Learners
Brown, T. B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., et al. (2020). Advances in Neural Information Processing Systems 33 (NeurIPS 2020).

Dictionary & encyclopedic entries

Cite this entry

MultipleChat. "GPT (Generative Pre-trained Transformer)." MultipleChat AI & LLM Glossary, 2026. https://multiple.chat/ai-glossary/gpt

Related terms

See this in practice

Run the same prompt across ChatGPT, Claude, Gemini and Grok — grounded in your own sources, cross-checked against each other.

Try MultipleChat Free

Continue learning

See paid plans