Budget AI Models
The most cost-effective AI models. Under $1/million input tokens. Best for high-volume tasks and cost optimization.
50 models in this category.
GLM-4-Flash
Zhipu AIZhipu AI's ultra-cheap model. Near-free pricing for high-volume Chinese and English text tasks.
Amazon Nova Micro
AmazonAmazon's most cost-effective model. Optimized for speed and low-cost text generation tasks.
Gemini 2.5 Flash Lite
GoogleThe most affordable Gemini model. Ultra-low cost for high-volume, simple coding and text tasks.
MiniMax Text 01
MiniMaxMiniMax's cost-effective text model. Optimized for high-volume Chinese text generation.
Stable Code 3B
Stability AIStability AI's code-focused model. Small, efficient model for code completion and generation.
Llama 3.1 8B
MetaMeta's smallest Llama 3.1 model. Open weights, deploy anywhere. Great for self-hosted applications.
Phi-3 Mini
MicrosoftMicrosoft's compact Phi-3 model. Small but capable model for edge and IoT deployment.
Gemini 1.5 Flash
GoogleCheapest Gemini model. Good for high-volume, simple tasks.
Gemini 2.0 Flash Lite
GoogleGoogle's most cost-effective Gemini model. Great for high-volume, latency-sensitive applications.
Qwen Turbo
QwenFastest and cheapest Qwen model. Good for high-volume tasks.
Gemini 2.0 Flash
GoogleCheapest Google model. Fast responses for simple coding tasks.
Mistral Small 3
MistralMistral's cost-effective model. Very affordable for general-purpose tasks.
Microsoft Phi-4
MicrosoftMicrosoft's compact 14B model with strong reasoning and coding capability. Excellent value for small-scale deployments.
Gemma 3 27B
GoogleGoogle's open-weight 27B model. Budget-friendly with strong coding capability and Google's research backing.
GPT-4.1 Nano
OpenAIOpenAI's smallest and cheapest GPT-4.1 model. Fast responses for simple tasks.
Mistral Small 3
MistralMistral's efficient small model. Great performance for its size at very competitive pricing.
Phi-4 Mini
MicrosoftMicrosoft's latest small model with improved coding ability. Better than Phi-3 for developer tasks.
GPT-4o mini
OpenAIAffordable small model. Fast and cost-effective for high-volume coding tasks.
Gemini 2.5 Flash
GoogleFast and affordable Google model. Great for high-volume coding and processing.
Mistral Nemo
MistralCompact 12B open-weight model co-developed with NVIDIA. Excellent coding performance at minimal cost.
Qwen 3 Turbo
QwenFast and affordable Qwen 3 generation model. Good for high-volume tasks with improved quality over Qwen Turbo.
DeepSeek Jiuge
DeepSeekUltra-budget DeepSeek model for high-volume tasks. Competitive with Gemini Flash pricing.
Yi-Lightning
01.ai01.ai's cost-effective model. Competitive Chinese-English bilingual model at very low prices.
Pixtral 12B
MistralMistral's lightweight vision-language model. Affordable image understanding with good performance.
Reka Flash
RekaReka's fast multimodal model. Compact and efficient for high-volume tasks with vision capability.
Groq Gemma 2 9B
GroqGoogle's Gemma 2 9B on Groq's LPU. Extremely fast small model for simple tasks.
Qwen 2.5 Coder 32B
QwenQwen's code-specialized 32B model. Trained on 130+ programming languages.
Groq Mixtral 8x7B
GroqMixtral MoE on Groq's LPU. Fast, cost-effective inference for general tasks.
Claude 3 Haiku
AnthropicCheapest Claude model. Fast responses for simple tasks and basic coding.
Qwen Coder Turbo
QwenFast coding model from Qwen. Good price-performance for code generation.
Llama 3.3 70B
MetaMeta's open-weight 70B model. Strong coding and general capability, widely supported across AI platforms.
DeepSeek Chat V3
DeepSeekVery affordable general-purpose model from DeepSeek. Strong coding and reasoning at low cost.
DeepSeek Coder V2
DeepSeekDeepSeek's coding-specialized model. Open-source and very affordable.
DeepSeek Coder V3
DeepSeekLatest generation DeepSeek coding model. Improved code understanding and generation over V2.
DeepSeek V3.2
DeepSeekUpdated V3 model with improved general reasoning and multilingual capability. Strong value proposition.
Grok 3 Mini
xAICost-effective xAI model for high-volume tasks. Good balance of capability and affordability.
Qwen Coder Turbo V2
QwenUpdated Qwen Coder Turbo with improved code generation quality. Strong value for budget coding.
Qwen Plus
QwenBalanced Qwen model for general tasks. Good price-performance ratio.
GPT-4.1 mini
OpenAICost-optimized GPT-4.1 variant. Strong coding capability at budget pricing, replacing GPT-4o mini for many use cases.
Reka Edge
RekaReka's lightweight multimodal model. Affordable image and video understanding.
GPT-3.5 Turbo
OpenAIBudget model for simple tasks. Being phased out but still widely used.
DeepSeek Reasoner (R1)
DeepSeekDeepSeek's reasoning model. Comparable to OpenAI's o1 but at much lower cost.
Claude 3.5 Haiku
AnthropicFast, cost-effective model for high-volume tasks. Great for code review and simple queries.
Claude 4 Haiku
AnthropicUpdated Haiku model with improved reasoning over Claude 3.5 Haiku. Fast and affordable for high-volume tasks.
Together Mistral Small 3
Together AIMistral Small 3 via Together AI. Efficient mid-size model for general tasks.
Claude Sonnet 4 Lite
AnthropicLighter version of Claude Sonnet 4. Good balance of quality and cost for day-to-day coding.
Perplexity Sonar
PerplexityPerplexity's standard search model. Fast, cited answers at lower cost than Sonar Pro.
OpenAI o1-mini
OpenAICost-effective reasoning model. Good for coding tasks that require logical reasoning.
OpenAI o3-mini
OpenAIAffordable reasoning model for coding tasks. Best price-performance for algorithm-heavy work.
OpenAI o4-mini
OpenAIUpdated mini reasoning model. Similar pricing to o3-mini with updated capabilities.