61 Total Models Tracked
10 Providers
20 Release Months

Release Timeline

September 2025

Qwen

Qwen 3 Max

Flagship Qwen 3 model. Top-tier reasoning and coding, competitive with Claude Opus and GPT-4.1.

Input: $5.00/M Output: $20.00/M Context: 256K tokens
xAI

Grok 4

Next-generation xAI model with enhanced reasoning and coding capability. Competes with Claude Opus and o3 Pro tier.

Input: $5.00/M Output: $25.00/M Context: 256K tokens

August 2025

Qwen

Qwen 3 Coder

Latest Qwen coding-specialized model. Strong performance on HumanEval and competitive programming benchmarks.

Input: $0.500/M Output: $2.00/M Context: 256K tokens
Anthropic

Claude 4 Haiku

Updated Haiku model with improved reasoning over Claude 3.5 Haiku. Fast and affordable for high-volume tasks.

Input: $0.800/M Output: $4.00/M Context: 200K tokens

July 2025

DeepSeek

DeepSeek V3.2

Updated V3 model with improved general reasoning and multilingual capability. Strong value proposition.

Input: $0.300/M Output: $1.20/M Context: 128K tokens
Anthropic

Claude Sonnet 4 Lite

Lighter version of Claude Sonnet 4. Good balance of quality and cost for day-to-day coding.

Input: $1.00/M Output: $5.00/M Context: 200K tokens

June 2025

Google

Gemini 2.5 Flash Lite

The most affordable Gemini model. Ultra-low cost for high-volume, simple coding and text tasks.

Input: $0.037/M Output: $0.150/M Context: 1M tokens
Qwen

Qwen Coder Turbo V2

Updated Qwen Coder Turbo with improved code generation quality. Strong value for budget coding.

Input: $0.300/M Output: $1.20/M Context: 128K tokens
Mistral

Mistral Large 3

Latest Mistral flagship model. Improved coding and multilingual capability over Large 2.

Input: $2.00/M Output: $6.00/M Context: 128K tokens
Qwen

Qwen 3.6 Plus

Qwen's latest general-purpose model. Competitive with Claude Sonnet pricing.

Input: $3.00/M Output: $15.00/M Context: 128K tokens
OpenAI

OpenAI o3 Pro

Top-tier reasoning model combining o3's coding strength with extended compute. The most powerful OpenAI model for reasoning-heavy coding.

Input: $20.00/M Output: $80.00/M Context: 200K tokens

May 2025

Qwen

Qwen 3 Turbo

Fast and affordable Qwen 3 generation model. Good for high-volume tasks with improved quality over Qwen Turbo.

Input: $0.150/M Output: $0.600/M Context: 128K tokens
DeepSeek

DeepSeek Jiuge

Ultra-budget DeepSeek model for high-volume tasks. Competitive with Gemini Flash pricing.

Input: $0.150/M Output: $0.600/M Context: 128K tokens
xAI

Grok Code

xAI's coding-specialized model. Optimized for code generation, debugging, and software engineering tasks.

Input: $1.50/M Output: $7.50/M Context: 128K tokens
Anthropic

Claude Sonnet 4

Anthropic's balanced model for coding and general tasks. Best price-performance ratio in the Claude family.

Input: $3.00/M Output: $15.00/M Context: 200K tokens
Anthropic

Claude Opus 4

Anthropic's most powerful model. Best for complex reasoning and challenging coding tasks.

Input: $15.00/M Output: $75.00/M Context: 200K tokens

April 2025

Google

Gemini 2.5 Flash

Fast and affordable Google model. Great for high-volume coding and processing.

Input: $0.150/M Output: $0.600/M Context: 1M tokens
DeepSeek

DeepSeek Coder V3

Latest generation DeepSeek coding model. Improved code understanding and generation over V2.

Input: $0.270/M Output: $1.10/M Context: 128K tokens
OpenAI

GPT-4.1 mini

Cost-optimized GPT-4.1 variant. Strong coding capability at budget pricing, replacing GPT-4o mini for many use cases.

Input: $0.400/M Output: $1.60/M Context: 128K tokens
OpenAI

OpenAI o4-mini

Updated mini reasoning model. Similar pricing to o3-mini with updated capabilities.

Input: $1.10/M Output: $4.40/M Context: 200K tokens
OpenAI

GPT-4.1

Updated GPT-4 generation with improved instruction following and reduced hallucination. Better coding accuracy than GPT-4o.

Input: $2.00/M Output: $8.00/M Context: 128K tokens
Google

Gemini 2.5 Pro

Google's most capable model. Strong coding, multimodal understanding, and very competitive pricing.

Input: $1.25/M Output: $10.00/M Context: 1M tokens
xAI

Grok 3 Vision

xAI's multimodal vision model. Combines Grok 3's reasoning with image and diagram understanding.

Input: $5.00/M Output: $20.00/M Context: 128K tokens

March 2025

Google

Gemma 3 27B

Google's open-weight 27B model. Budget-friendly with strong coding capability and Google's research backing.

Input: $0.100/M Output: $0.400/M Context: 128K tokens
Mistral

Mistral Medium

Mid-tier Mistral model between Small and Large. Strong coding capability at a moderate price point.

Input: $0.400/M Output: $2.00/M Context: 128K tokens
OpenAI

OpenAI o1 Pro

Premium reasoning model with extended compute. Best-in-class for complex math, science, and advanced coding challenges.

Input: $20.00/M Output: $80.00/M Context: 200K tokens

February 2025

xAI

Grok 3 Mini

Cost-effective xAI model for high-volume tasks. Good balance of capability and affordability.

Input: $0.300/M Output: $0.500/M Context: 128K tokens
xAI

Grok 3

xAI's flagship model. Strong general-purpose capability with real-time knowledge access through X platform integration.

Input: $3.00/M Output: $15.00/M Context: 128K tokens
OpenAI

OpenAI o3

Next generation reasoning model. Improved coding and math over o1.

Input: $10.00/M Output: $40.00/M Context: 200K tokens

January 2025

Mistral

Mistral Small 3

Mistral's cost-effective model. Very affordable for general-purpose tasks.

Input: $0.100/M Output: $0.300/M Context: 32K tokens
Microsoft

Microsoft Phi-4

Microsoft's compact 14B model with strong reasoning and coding capability. Excellent value for small-scale deployments.

Input: $0.100/M Output: $0.300/M Context: 128K tokens
DeepSeek

DeepSeek Reasoner (R1)

DeepSeek's reasoning model. Comparable to OpenAI's o1 but at much lower cost.

Input: $0.550/M Output: $2.19/M Context: 128K tokens
OpenAI

OpenAI o3-mini

Affordable reasoning model for coding tasks. Best price-performance for algorithm-heavy work.

Input: $1.10/M Output: $4.40/M Context: 200K tokens

December 2024

Google

Gemini 2.0 Flash

Cheapest Google model. Fast responses for simple coding tasks.

Input: $0.100/M Output: $0.400/M Context: 1M tokens
Meta

Llama 3.3 70B

Meta's open-weight 70B model. Strong coding and general capability, widely supported across AI platforms.

Input: $0.250/M Output: $1.00/M Context: 128K tokens
DeepSeek

DeepSeek Chat V3

Very affordable general-purpose model from DeepSeek. Strong coding and reasoning at low cost.

Input: $0.270/M Output: $1.10/M Context: 128K tokens
Google

Gemini 2.0 Pro

Mid-tier Gemini 2.0 model. Better quality than Flash at a competitive price point for enterprise coding tasks.

Input: $2.50/M Output: $10.00/M Context: 1M tokens

October 2024

Anthropic

Claude 3.5 Haiku

Fast, cost-effective model for high-volume tasks. Great for code review and simple queries.

Input: $0.800/M Output: $4.00/M Context: 200K tokens
Anthropic

Claude 3.5 Sonnet

Previous generation Sonnet. Still excellent for coding tasks at the same price point.

Input: $3.00/M Output: $15.00/M Context: 200K tokens

September 2024

Reka

Reka Flash

Reka's fast multimodal model. Compact and efficient for high-volume tasks with vision capability.

Input: $0.200/M Output: $0.800/M Context: 128K tokens
OpenAI

OpenAI o1-mini

Cost-effective reasoning model. Good for coding tasks that require logical reasoning.

Input: $1.10/M Output: $4.40/M Context: 128K tokens
OpenAI

OpenAI o1

Reasoning model optimized for complex problem-solving. Excels at math, science, and advanced coding.

Input: $15.00/M Output: $60.00/M Context: 200K tokens

July 2024

Mistral

Mistral Nemo

Compact 12B open-weight model co-developed with NVIDIA. Excellent coding performance at minimal cost.

Input: $0.150/M Output: $0.150/M Context: 128K tokens
OpenAI

GPT-4o mini

Affordable small model. Fast and cost-effective for high-volume coding tasks.

Input: $0.150/M Output: $0.600/M Context: 128K tokens
Mistral

Mistral Large 2

Mistral's flagship model. Strong multilingual and coding capability.

Input: $2.00/M Output: $6.00/M Context: 128K tokens

June 2024

Mistral

Codestral

Mistral's dedicated coding model. Open-weight and highly optimized for code generation and completion.

Input: $0.300/M Output: $0.900/M Context: 128K tokens
DeepSeek

DeepSeek Coder V2

DeepSeek's coding-specialized model. Open-source and very affordable.

Input: $0.270/M Output: $1.10/M Context: 128K tokens
Qwen

Qwen Coder Turbo

Fast coding model from Qwen. Good price-performance for code generation.

Input: $0.250/M Output: $1.25/M Context: 128K tokens
Qwen

Qwen Coder Plus

Qwen model specifically optimized for coding tasks.

Input: $0.800/M Output: $4.00/M Context: 128K tokens

May 2024

Google

Gemini 1.5 Flash

Cheapest Gemini model. Good for high-volume, simple tasks.

Input: $0.075/M Output: $0.300/M Context: 1M tokens
OpenAI

GPT-4o

OpenAI's flagship multimodal model. Strong coding and reasoning at competitive pricing.

Input: $2.50/M Output: $10.00/M Context: 128K tokens

April 2024

Google

Gemini 1.5 Pro

Previous generation Google pro model. Good for general tasks.

Input: $1.25/M Output: $5.00/M Context: 1M tokens
OpenAI

GPT-4 Turbo

Previous generation high-performance model. Good for complex reasoning tasks.

Input: $10.00/M Output: $30.00/M Context: 128K tokens

March 2024

Anthropic

Claude 3 Haiku

Cheapest Claude model. Fast responses for simple tasks and basic coding.

Input: $0.250/M Output: $1.25/M Context: 200K tokens
Qwen

Qwen Max

Qwen's most powerful model. Strong reasoning and coding capabilities.

Input: $1.60/M Output: $6.40/M Context: 32K tokens

February 2024

Anthropic

Claude 3 Sonnet

First generation Sonnet. Balanced performance for general tasks.

Input: $3.00/M Output: $15.00/M Context: 200K tokens
Anthropic

Claude 3 Opus

First generation Opus. Highest reasoning capability in the Claude 3 family.

Input: $15.00/M Output: $75.00/M Context: 200K tokens

January 2024

Qwen

Qwen Turbo

Fastest and cheapest Qwen model. Good for high-volume tasks.

Input: $0.080/M Output: $0.240/M Context: 1M tokens
Qwen

Qwen Plus

Balanced Qwen model for general tasks. Good price-performance ratio.

Input: $0.400/M Output: $1.20/M Context: 128K tokens

March 2023

OpenAI

GPT-3.5 Turbo

Budget model for simple tasks. Being phased out but still widely used.

Input: $0.500/M Output: $1.50/M Context: 16K tokens
OpenAI

GPT-4

Original GPT-4. Most expensive OpenAI model, largely superseded by newer options.

Input: $30.00/M Output: $60.00/M Context: 8K tokens