All terms
Evaluation
SimpleQA
A factual question benchmark for measuring accuracy and hallucination.
Definition
SimpleQA is a benchmark of short, fact-based questions that measures whether a model answers accurately and resists guessing when it does not know. It is useful for gauging hallucination — confident wrong answers — and overall factual reliability.