Gemini 2.0 Flash High (February 2025)
Performance overview across all HAL benchmarks
1
Benchmarks
1
Agents
1
Pareto Optimal Benchmarks
Benchmark Performance
On the Pareto Frontier? indicates whether this model achieved a Pareto-optimal trade-off between accuracy and cost on that benchmark. Models on the Pareto frontier represent the current state-of-the-art efficiency for their performance level.
| Benchmark | Agent | Accuracy | Cost | On the Pareto Frontier? |
|---|---|---|---|---|
|
Taubench Airline
|
TAU-bench Tool Calling | 28.00% | $0.31 | Yes |