Llama 2 Reference Model, 13B parameters with FP16 quantization.
Released Date: 7/18/2023
Avg. Accuracy:
53.8%Latency:
1.41sPerformance by Benchmark
Benchmarks
Accuracy
Rankings
Academic Benchmarks
Proprietary Benchmarks (contact us to get access)
Cost Analysis
Input Cost
$0.20 / M Tokens
Output Cost
$0.20 / M Tokens
Input Cost (per char)
$0.00 / M chars
Output Cost (per char)
$0.01 / M chars