AI IQ

Home About

How smart is your AI model, really?

AI IQ intelligently estimates the IQ's of popular AI models

AI Models by IQ
Each model's estimated IQ plotted on a standard normal IQ distribution

How AI IQ estimates model intelligence

  1. We pull in the performance ratings of models on objective industry-standard benchmarks
  2. We map each benchmark score to an implied IQ based on informed assessments of its difficulty
  3. We combine implied IQ scores into a single estimated IQ for each model
AI Models by EQ
Each model's estimated EQ plotted on a standard normal IQ distribution

How AI EQ estimates emotional intelligence

  1. We pull in each model's EQ-Bench 3 Elo score and Arena Elo score
  2. We map each Elo score to an estimated EQ using calibrated piecewise-linear scales
  3. Anthropic models receive a 200-point Elo penalty on EQ-Bench to correct for family bias (judged by Claude)
  4. The composite EQ weights Arena 50% and EQ-Bench 50% (or uses one if only one is available)
IQ vs Effective Cost
Each model's estimated IQ plotted against its token-efficiency-adjusted cost per 1M output tokens

Effective cost & isoquant curves

Effective cost adjusts each model's raw price by its token efficiency. Some models use fewer tokens to achieve the same result. We measure equivalent-token usage across benchmarks (ARC, SWE-bench, HLE, and others), then compute a geometric-mean multiplier relative to the median. A model that consistently uses fewer tokens gets a lower effective cost.

Isoquant curves are lines of equal preference. Models on the same curve give the same tradeoff between the selected metric and cost. Use the dropdown to switch between IQ, the four dimension IQs, or any of the 10 individual benchmarks. Use the slider to set your preference: slide left if the metric matters more, right if cost matters more. The curves reshape to reflect your tradeoff — models above and to the right of a curve are strictly better.

IQ Cost ?x:y = x cost halvings are required to justify a drop of y IQ points
IQ vs Response Time
Each model's estimated IQ plotted against its response time in seconds

Speed & isoquant curves

Response time measures how long each model takes to return a complete answer, in seconds. Faster models appear further left on the chart.

Isoquant curves show lines of equal preference between the selected metric and speed. Use the dropdown to switch between IQ, the four dimension IQs, or any of the 10 individual benchmarks. Use the slider to set your tradeoff: slide left if the metric matters more, right if speed matters more. Models above and to the left of a curve are strictly better at that preference level.

IQ Speed ?x:y = x response time halvings are required to justify a drop of y IQ points
IQ Methodology
EQ Methodology