Benchmarks
Models
Methodology
Updates
VLAIR
About
Changelog
Private question-answer benchmark over Canadian court cases.
Updated 12/11/2024
Benchmarking model performance on Contract Law Tasks
Evaluating language models on a wide range of open source legal reasoning tasks.
Evaluating Language Models on a Corporate Finance Task
Evaluating Language Models on Tax Domain Questions