Gemini 2.0 Flash High (February 2025)

Performance overview across all HAL benchmarks

1
Benchmarks
1
Agents
1
Pareto Optimal Benchmarks

Benchmark Performance

On the Pareto Frontier? indicates whether this model achieved a Pareto-optimal trade-off between accuracy and cost on that benchmark. Models on the Pareto frontier represent the current state-of-the-art efficiency for their performance level.

Benchmark Agent Accuracy Cost On the Pareto Frontier?
Taubench Airline
TAU-bench Tool Calling 28.00% $0.31 Yes