Llama 3.3 Instruct Turbo, 70B parameters with FP16 quantization.
Released Date: 12/6/2024
Avg. Accuracy:
71.1%Latency:
9.38sPerformance by Benchmark
Benchmarks
Accuracy
Rankings
Academic Benchmarks
Proprietary Benchmarks (contact us to get access)
Cost Analysis
Input Cost
$0.88 / M Tokens
Output Cost
$0.88 / M Tokens
Input Cost (per char)
$0.22 / M chars
Output Cost (per char)
$0.29 / M chars