🛡️
Session Flagged

Your session has been flagged for unusual activity.

You can try our app by searching for MultipleChat AI on Google and clicking the multiplechat.ai link to try it free.
Quick verification

Please confirm you're human to continue.


Evaluation Updated 2026

ROUGE

A family of automatic metrics for summarization that measure overlap between generated text and human reference summaries.

ROUGE, introduced by Lin (2004), scores a summary by how much of its content (overlapping n-grams or longest common subsequences) matches human reference summaries — emphasising recall, i.e. how much reference content is captured. It became the standard automatic metric for summarisation.

Like BLEU, ROUGE measures surface overlap rather than meaning, so it complements rather than replaces human review.

References

Primary, peer-reviewed and archival sources for this definition.

ROUGE: A Package for Automatic Evaluation of Summaries
Lin, C.-Y. (2004). Text Summarization Branches Out (ACL-04 Workshop), pp. 74–81.

Dictionary & encyclopedic entries

Cite this entry

MultipleChat. "ROUGE." MultipleChat AI & LLM Glossary, 2026. https://multiple.chat/ai-glossary/rouge

Related terms

See this in practice

Run the same prompt across ChatGPT, Claude, Gemini and Grok — grounded in your own sources, cross-checked against each other.

Try MultipleChat Free

Continue learning

See paid plans