How smart is your AI model, really?
AI IQ intelligently estimates the IQ's of popular AI models
How AI IQ estimates model intelligence
- We pull in the performance ratings of models on objective industry-standard benchmarks
- We map each benchmark score to an implied IQ based on informed assessments of its difficulty
- We combine implied IQ scores into a single estimated IQ for each model
How AI EQ estimates emotional intelligence
- We pull in each model's EQ-Bench 3 Elo score and Arena Elo score
- We map each Elo score to an estimated EQ using calibrated piecewise-linear scales
- Anthropic models receive a 200-point Elo penalty on EQ-Bench to correct for family bias (judged by Claude)
- The composite EQ weights Arena 50% and EQ-Bench 50% (or uses one if only one is available)
Effective cost & isoquant curves
Effective cost adjusts each model's raw price by its token efficiency. Some models use fewer tokens to achieve the same result. We measure equivalent-token usage across benchmarks (ARC, SWE-bench, HLE, and others), then compute a geometric-mean multiplier relative to the median. A model that consistently uses fewer tokens gets a lower effective cost.
Isoquant curves are lines of equal preference. Models on the same curve give the same tradeoff between the selected metric and cost. Use the dropdown to switch between IQ, the four dimension IQs, or any of the 10 individual benchmarks. Use the slider to set your preference: slide left if the metric matters more, right if cost matters more. The curves reshape to reflect your tradeoff — models above and to the right of a curve are strictly better.
Speed & isoquant curves
Response time measures how long each model takes to return a complete answer, in seconds. Faster models appear further left on the chart.
Isoquant curves show lines of equal preference between the selected metric and speed. Use the dropdown to switch between IQ, the four dimension IQs, or any of the 10 individual benchmarks. Use the slider to set your tradeoff: slide left if the metric matters more, right if speed matters more. Models above and to the left of a curve are strictly better at that preference level.