Agent:
GPT-5.5
Reliability Dimensions by Benchmark
Accuracy
79.5%
Consistency
0.84
Predictability
0.84
Robustness
0.97
Safety
0.96
Reliability
0.89