Skip to main content
All terms
Evaluation

LiveBench

A frequently updated benchmark designed to resist contamination.

Definition

LiveBench is a benchmark that refreshes its questions often, using recent sources to reduce contamination — the problem of test questions leaking into a model's training data. It scores language models across tasks such as math, coding, reasoning, language, instruction following, and data analysis.