The Public Standard for Real World AI Performance

Generic benchmarks only go so far.
Vals AI evaluates models on the real tasks each industry relies on.

Vals Index
Vals Index
Finance
Healthcare
Math
AIME

Challenging national math exam given to top high-school students

View Details
MATH 500

Academic math benchmark on probability, algebra, and trigonometry

View Details
MGSM

A multilingual benchmark for mathematical questions.

View Details
Academic
Education
Coding
Beta