🛡️
Session Flagged

Your session has been flagged for unusual activity.

You can try our app by searching for MultipleChat AI on Google and clicking the multiplechat.ai link to try it free.
Quick verification

Please confirm you're human to continue.


TrainingFoundations Updated 2026

Adam (Optimizer)

A widely used optimization algorithm that adapts the learning rate for each parameter, making neural-network training faster and more stable.

Kingma & Ba (2015) introduced Adam, which maintains running estimates of both the first and second moments of each parameter's gradients and uses them to scale per-parameter step sizes. It combines the benefits of earlier methods (momentum and adaptive learning rates) and works well with little tuning.

Adam and its variants are the default optimizers for training large language models and most modern neural networks.

References

Primary, peer-reviewed and archival sources for this definition.

Adam: A Method for Stochastic Optimization
Kingma, D. P., & Ba, J. (2015). International Conference on Learning Representations (ICLR 2015).

Dictionary & encyclopedic entries

Cite this entry

MultipleChat. "Adam (Optimizer)." MultipleChat AI & LLM Glossary, 2026. https://multiple.chat/ai-glossary/adam

Related terms

See this in practice

Run the same prompt across ChatGPT, Claude, Gemini and Grok — grounded in your own sources, cross-checked against each other.

Try MultipleChat Free

Continue learning

See paid plans