Independent Evaluation, Unbiased Benchmarks

Testing AI on Real-World Tasks

We benchmark the world's leading AI models on rigorous, domain-specific tasks in finance, law, software, healthcare, and more. We run all of our own evaluations and create many of our benchmarks in-house.

Vals AI Updates

Fresh updates from our testing queue

benchmark
06/17/2026

Harvey's Legal Agent Benchmark Released

Harvey's Legal Agent Benchmark Released

View Details

System

Accuracy

11.25%

± 2.17

9.58%

± 2.15

5.00%

± 1.65

4.17%

± 1.65

3.75%

± 1.43

3.75%

± 1.17

2.50%

± 0.84

1.67%

± 0.83

1.67%

± 0.84

0.42%

± 0.00
Showing top 10 models from the benchmark. Visit the benchmark page to view more

Industry Leaderboard

Independent benchmarks for industry-specific AI performance.

Industry
Benchmark