AI IQ — Intelligently Measuring AI Intelligence

How smart is your AI model, really?

AI IQ intelligently estimates the IQ's of popular AI models

AI Models by IQ

Each model's estimated IQ plotted on a standard normal IQ distribution

How AI IQ estimates model intelligence

We pull in the performance ratings of models on objective industry-standard benchmarks
We map each benchmark score to an implied IQ based on informed assessments of its difficulty
We combine implied IQ scores into a single estimated IQ for each model

AI Models by EQ

Each model's estimated EQ plotted on a standard normal IQ distribution

How AI EQ estimates emotional intelligence

We pull in each model's EQ-Bench 3 Elo score and Arena Elo score
We map each Elo score to an estimated EQ using calibrated piecewise-linear scales
Anthropic models receive a 200-point Elo penalty on EQ-Bench to correct for family bias (judged by Claude)
The composite EQ weights Arena 50% and EQ-Bench 50% (or uses one if only one is available)

Effective Cost

IQ vs Effective Cost

Each model's estimated IQ plotted against its token-efficiency-adjusted cost per 1M output tokens

Effective cost & isoquant curves

Effective cost adjusts each model's raw price by its token efficiency. Some models use fewer tokens to achieve the same result. We measure equivalent-token usage across benchmarks (ARC, SWE-bench, HLE, and others), then compute a geometric-mean multiplier relative to the median. A model that consistently uses fewer tokens gets a lower effective cost.

Isoquant curves are lines of equal preference. Models on the same curve give the same tradeoff between the selected metric and cost. Use the dropdown to switch between IQ, the four dimension IQs, or any of the 10 individual benchmarks. Use the slider to set your preference: slide left if the metric matters more, right if cost matters more. The curves reshape to reflect your tradeoff — models above and to the right of a curve are strictly better.

IQ Cost ?x:y = x cost halvings are required to justify a drop of y IQ points

Response Time

IQ vs Response Time

Each model's estimated IQ plotted against its response time in seconds

Speed & isoquant curves

Response time measures how long each model takes to return a complete answer, in seconds. Faster models appear further left on the chart.

Isoquant curves show lines of equal preference between the selected metric and speed. Use the dropdown to switch between IQ, the four dimension IQs, or any of the 10 individual benchmarks. Use the slider to set your tradeoff: slide left if the metric matters more, right if speed matters more. Models above and to the left of a curve are strictly better at that preference level.

IQ Speed ?x:y = x response time halvings are required to justify a drop of y IQ points

How smart is your AI model, really?

How AI IQ estimates model intelligence

How AI EQ estimates emotional intelligence

Effective cost & isoquant curves

Speed & isoquant curves

10 benchmarks, 4 dimensions

How dimensions relate to composite IQ

2 benchmarks, 1 composite

Anthropic family-bias adjustment

How benchmarks relate to composite EQ