Quantitative Intelligence
Model Benchmarks
Comparing frontier capability against capital intensity. Cost is mapped on a logarithmic scale to account for the exponential spread between commodity and reasoning tiers.
Intelligence vs. Cost
The Performance Leaders
Models like Claude 3.5 Sonnet and GPT-4o offer the best balance of performance and value. DeepSeek V3 has recently set a new industry floor for cost-efficiency.
Reasoning & Logic
The o1-series represents a premium tier for advanced reasoning tasks. While more expensive, these models offer significantly higher performance on complex math and coding benchmarks.