🛡️

Session Flagged

Your session has been flagged for unusual activity.

You can try our app by searching for MultipleChat AI on Google and clicking the multiplechat.ai link to try it free.

Quick verification

Please confirm you're human to continue.

AlignmentSafety Updated 2026

Alignment

The work of making an AI system's behaviour match human intent and values — why a model follows instructions reliably and refuses harmful requests.

Alignment research aims to ensure capable models do what their designers and users actually want. Practical techniques include learning from human preferences (Christiano et al., 2017), instruction-following via RLHF (Ouyang et al., 2022), and rule-guided self-improvement as in Constitutional AI (Bai et al., 2022).

Alignment is an open problem, not a solved checkbox: as models grow more capable, specifying and verifying intended behaviour becomes harder, not easier.

References

Primary, peer-reviewed and archival sources for this definition.

Deep Reinforcement Learning from Human Preferences

Christiano, P., Leike, J., Brown, T. B., Martic, M., Legg, S., & Amodei, D. (2017). Advances in Neural Information Processing Systems 30 (NeurIPS 2017).

Source arXiv:1706.03741

Training language models to follow instructions with human feedback

Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C. L., et al. (2022). Advances in Neural Information Processing Systems 35 (NeurIPS 2022).

Source arXiv:2203.02155

Constitutional AI: Harmlessness from AI Feedback

Bai, Y., Kadavath, S., Kundu, S., Askell, A., Kernion, J., et al. (2022). arXiv preprint (Anthropic).

Source arXiv:2212.08073

Dictionary & encyclopedic entries

Wikipedia — AI alignment
IBM — Think / Topics — What is AI alignment?

Cite this entry

MultipleChat. "Alignment." MultipleChat AI & LLM Glossary, 2026. https://multiple.chat/ai-glossary/alignment

Related terms

RLHF (Reinforcement Learning from Human Feedback) System Prompt Hallucination

Back to the full glossary

See this in practice

Run the same prompt across ChatGPT, Claude, Gemini and Grok — grounded in your own sources, cross-checked against each other.

Try MultipleChat Free

Continue learning

See paid plans

Pricing

Alignment

References

Dictionary & encyclopedic entries

Cite this entry

Related terms

See this in practice

Compare MultipleChat plans

Compare AI models side by side

Which AI should I use?

Use ChatGPT, Claude and Gemini together

Multi-model AI platform

What is multi-model AI?

AI model comparison tool

AI productivity toolkit 2026

Free AI tools from MultipleChat

References

Dictionary & encyclopedic entries

Cite this entry

Related terms

See this in practice

Related AI guides and next steps

Compare MultipleChat plans

Compare AI models side by side

Which AI should I use?

Use ChatGPT, Claude and Gemini together

Multi-model AI platform

What is multi-model AI?

AI model comparison tool

AI productivity toolkit 2026

Free AI tools from MultipleChat