Mixtral model with 8x22B modules for large-scale inference.
Released Date: 2/15/2024
Avg. Accuracy:
76.8%Latency:
6.45sPerformance by Benchmark
Benchmarks
Accuracy
Rankings
Academic Benchmarks
Proprietary Benchmarks (contact us to get access)
Cost Analysis
Input Cost
$1.20 / M Tokens
Output Cost
$1.20 / M Tokens
Input Cost (per char)
N/A
Output Cost (per char)
N/A