Budget AI Models — Pricing Comparison 2026 | AI Dev Tools — AI Coding Model Pricing Comparison 2026

GLM-4-Flash

Zhipu AI

Zhipu AI's ultra-cheap model. Near-free pricing for high-volume Chinese and English text tasks.

$0.010/M input $0.010/M output ~<$0.01 per medium feature

Amazon Nova Micro

Amazon

Amazon's most cost-effective model. Optimized for speed and low-cost text generation tasks.

$0.035/M input $0.140/M output ~$0.04 per medium feature

Gemini 2.5 Flash Lite

Google

The most affordable Gemini model. Ultra-low cost for high-volume, simple coding and text tasks.

$0.037/M input $0.150/M output ~$0.04 per medium feature

MiniMax Text 01

MiniMax

MiniMax's cost-effective text model. Optimized for high-volume Chinese text generation.

$0.050/M input $0.200/M output ~$0.06 per medium feature

Stable Code 3B

Stability AI

Stability AI's code-focused model. Small, efficient model for code completion and generation.

$0.050/M input $0.200/M output ~$0.06 per medium feature

Llama 3.1 8B

Phi-3 Mini

Microsoft

Microsoft's compact Phi-3 model. Small but capable model for edge and IoT deployment.

$0.050/M input $0.100/M output ~$0.04 per medium feature

Gemini 1.5 Flash

Google

Cheapest Gemini model. Good for high-volume, simple tasks.

$0.075/M input $0.300/M output ~$0.09 per medium feature

Gemini 2.0 Flash Lite

Google

Google's most cost-effective Gemini model. Great for high-volume, latency-sensitive applications.

$0.075/M input $0.300/M output ~$0.09 per medium feature

Qwen Turbo

Qwen

Fastest and cheapest Qwen model. Good for high-volume tasks.

$0.080/M input $0.240/M output ~$0.08 per medium feature

Gemini 2.0 Flash

Google

Cheapest Google model. Fast responses for simple coding tasks.

$0.100/M input $0.400/M output ~$0.12 per medium feature

Mistral Small 3

Mistral

Mistral's cost-effective model. Very affordable for general-purpose tasks.

$0.100/M input $0.300/M output ~$0.10 per medium feature

Microsoft Phi-4

Microsoft

Microsoft's compact 14B model with strong reasoning and coding capability. Excellent value for small-scale deployments.

$0.100/M input $0.300/M output ~$0.10 per medium feature

Gemma 3 27B

Google

Google's open-weight 27B model. Budget-friendly with strong coding capability and Google's research backing.

$0.100/M input $0.400/M output ~$0.12 per medium feature

GPT-4.1 Nano

OpenAI

OpenAI's smallest and cheapest GPT-4.1 model. Fast responses for simple tasks.

$0.100/M input $0.400/M output ~$0.12 per medium feature

Mistral Small 3

Mistral

Mistral's efficient small model. Great performance for its size at very competitive pricing.

$0.100/M input $0.300/M output ~$0.10 per medium feature

Phi-4 Mini

Microsoft

Microsoft's latest small model with improved coding ability. Better than Phi-3 for developer tasks.

$0.100/M input $0.300/M output ~$0.10 per medium feature

GPT-4o mini

OpenAI

Affordable small model. Fast and cost-effective for high-volume coding tasks.

$0.150/M input $0.600/M output ~$0.18 per medium feature

Gemini 2.5 Flash

Google

Fast and affordable Google model. Great for high-volume coding and processing.

$0.150/M input $0.600/M output ~$0.17 per medium feature

Mistral Nemo

Mistral

Compact 12B open-weight model co-developed with NVIDIA. Excellent coding performance at minimal cost.

$0.150/M input $0.150/M output ~$0.08 per medium feature

Qwen 3 Turbo

Qwen

Fast and affordable Qwen 3 generation model. Good for high-volume tasks with improved quality over Qwen Turbo.

$0.150/M input $0.600/M output ~$0.17 per medium feature

DeepSeek Jiuge

DeepSeek

Ultra-budget DeepSeek model for high-volume tasks. Competitive with Gemini Flash pricing.

$0.150/M input $0.600/M output ~$0.17 per medium feature

Yi-Lightning

01.ai

01.ai's cost-effective model. Competitive Chinese-English bilingual model at very low prices.

$0.150/M input $0.600/M output ~$0.17 per medium feature

Pixtral 12B

Mistral

Mistral's lightweight vision-language model. Affordable image understanding with good performance.

$0.150/M input $0.150/M output ~$0.08 per medium feature

Reka Flash

Reka

Reka's fast multimodal model. Compact and efficient for high-volume tasks with vision capability.

$0.200/M input $0.800/M output ~$0.23 per medium feature

Groq Gemma 2 9B

Groq

Google's Gemma 2 9B on Groq's LPU. Extremely fast small model for simple tasks.

$0.200/M input $0.200/M output ~$0.11 per medium feature

Qwen 2.5 Coder 32B

Qwen

Qwen's code-specialized 32B model. Trained on 130+ programming languages.

$0.200/M input $0.400/M output ~$0.15 per medium feature

Groq Mixtral 8x7B

Groq

Mixtral MoE on Groq's LPU. Fast, cost-effective inference for general tasks.

$0.240/M input $0.240/M output ~$0.13 per medium feature

Claude 3 Haiku

Anthropic

Cheapest Claude model. Fast responses for simple tasks and basic coding.

$0.250/M input $1.25/M output ~$0.34 per medium feature

Qwen Coder Turbo

Qwen

Fast coding model from Qwen. Good price-performance for code generation.

$0.250/M input $1.25/M output ~$0.34 per medium feature

Llama 3.3 70B

DeepSeek Chat V3

DeepSeek

Very affordable general-purpose model from DeepSeek. Strong coding and reasoning at low cost.

$0.270/M input $1.10/M output ~$0.31 per medium feature

DeepSeek Coder V2

DeepSeek

DeepSeek's coding-specialized model. Open-source and very affordable.

$0.270/M input $1.10/M output ~$0.31 per medium feature

DeepSeek Coder V3

DeepSeek

Latest generation DeepSeek coding model. Improved code understanding and generation over V2.

$0.270/M input $1.10/M output ~$0.31 per medium feature

DeepSeek V3.2

DeepSeek

Updated V3 model with improved general reasoning and multilingual capability. Strong value proposition.

$0.300/M input $1.20/M output ~$0.34 per medium feature

Grok 3 Mini

xAI

Cost-effective xAI model for high-volume tasks. Good balance of capability and affordability.

$0.300/M input $0.500/M output ~$0.21 per medium feature

Qwen Coder Turbo V2

Qwen

Updated Qwen Coder Turbo with improved code generation quality. Strong value for budget coding.

$0.300/M input $1.20/M output ~$0.34 per medium feature

Qwen Plus

Qwen

Balanced Qwen model for general tasks. Good price-performance ratio.

$0.400/M input $1.20/M output ~$0.38 per medium feature

GPT-4.1 mini

OpenAI

Cost-optimized GPT-4.1 variant. Strong coding capability at budget pricing, replacing GPT-4o mini for many use cases.

$0.400/M input $1.60/M output ~$0.46 per medium feature

Reka Edge

Reka

Reka's lightweight multimodal model. Affordable image and video understanding.

$0.400/M input $1.00/M output ~$0.34 per medium feature

GPT-3.5 Turbo

OpenAI

Budget model for simple tasks. Being phased out but still widely used.

$0.500/M input $1.50/M output ~$0.48 per medium feature

DeepSeek Reasoner (R1)

DeepSeek

DeepSeek's reasoning model. Comparable to OpenAI's o1 but at much lower cost.

$0.550/M input $2.19/M output ~$0.63 per medium feature

Claude 3.5 Haiku

Anthropic

Fast, cost-effective model for high-volume tasks. Great for code review and simple queries.

$0.800/M input $4.00/M output ~$1.24 per medium feature

Claude 4 Haiku

Anthropic

Updated Haiku model with improved reasoning over Claude 3.5 Haiku. Fast and affordable for high-volume tasks.

$0.800/M input $4.00/M output ~$1.24 per medium feature

Together Mistral Small 3

Together AI

Mistral Small 3 via Together AI. Efficient mid-size model for general tasks.

$0.800/M input $0.800/M output ~$0.44 per medium feature

Claude Sonnet 4 Lite

Anthropic

Lighter version of Claude Sonnet 4. Good balance of quality and cost for day-to-day coding.

$1.00/M input $5.00/M output ~$1.55 per medium feature

Perplexity Sonar

Perplexity

Perplexity's standard search model. Fast, cited answers at lower cost than Sonar Pro.

$1.00/M input $1.00/M output ~$0.55 per medium feature

OpenAI o1-mini

OpenAI

Cost-effective reasoning model. Good for coding tasks that require logical reasoning.

$1.10/M input $4.40/M output ~$1.27 per medium feature

OpenAI o3-mini

OpenAI

Affordable reasoning model for coding tasks. Best price-performance for algorithm-heavy work.

$1.10/M input $4.40/M output ~$1.27 per medium feature

OpenAI o4-mini

OpenAI

Updated mini reasoning model. Similar pricing to o3-mini with updated capabilities.

$1.10/M input $4.40/M output ~$1.27 per medium feature