Model Intelligence

o1

Name: o1
Author: OpenAI

Frontier Reasoningby OpenAIReleased September 2024

OpenAI o1 introduced a new paradigm: test-time compute scaling. Rather than simply increasing parameters, o1 "thinks" before answering — spending compute on chain-of-thought reasoning traces before producing a response. This yields dramatically improved performance on hard math, scientific reasoning, and competitive programming at the cost of slower response times.

Context Window

128,000 tokens

License

Proprietary

Capabilities

Chain-of-thought reasoning (internal)
PhD-level science and math
Competitive programming
Multi-step logical deduction
Complex coding and debugging

Pricing Summary

Input: $15.00/1M tokens | Output: $60.00/1M tokens. Significantly more expensive than GPT-4o, justified for hard reasoning tasks.

Benchmarks

AIME 2024: 83.3% | MATH: 94.8% | GPQA Diamond: 78.3% | Codeforces Rating: 1807 (96th percentile)

Competitive Positioning

Opens a new product category: reasoning models. Anthropic's response is expected. Google's Gemini 1.5 thinking mode is a partial analog. o1's benchmark performance on hard tasks is the current undisputed frontier.

Recent Intel

May 3, 2026

OpenAI Models Edge Human Doctors in Early ER Diagnoses

→

Related Models

GPT-4oo1-minio1-preview