You're offline

Some features may be unavailable. Changes will sync when you reconnect.

LLM Benchmark

Compare models across speed, cost, quality, and reliability

Configure Benchmark

Select Models

OpenAI

Anthropic

Gemini

Settings

Request Count100
Prompt TypeMixed (summarization + Q&A + code)
Timeout (ms)5000

Latency Comparison (ms)

Performance Radar

Full Results

ModelSuccess RateAvg LatencyP95Cost/1K tokensThroughput (req/s)
🥇gpt-4o-mini
98.2%680ms1200ms$0.1545
🥈claude-3-haiku
97.8%720ms1350ms$0.2538
🥉gemini-flash
96.5%450ms890ms$0.07562