GLM-4-Flash

Zhipu AI

Zhipu AI's ultra-cheap model. Near-free pricing for high-volume Chinese and English text tasks.

Context Window: 128K tokens Released: 2024-08 Best For: High-volume text processing, Chinese NLP tasks

Ultra-low cost

Chinese-English

Fast

GLM-4-Flash Pricing

Token Type	Price per Million
Input tokens	$0.010
Output tokens	$0.010

Estimated Cost by Project Size

Realistic cost estimates for common coding scenarios. Assumes 30% cache hit rate where caching is available.

Scenario	Token Usage	Estimated Cost
Small Script (1K lines)	50K input / 30K output	<$0.01
Medium Feature (10K lines)	500K input / 200K output	<$0.01
Large Project (50K lines)	2,500K input / 1,000K output	$0.03
Code Review (5K lines)	250K input / 25K output	<$0.01

Get Access to GLM-4-Flash

Ready to start using GLM-4-Flash? Get API access directly from Zhipu AI.

Get API Access → Try GLM-4-Flash Free →

How Does GLM-4-Flash Compare?

Model	Input ($/M)	Medium Feature Cost
GLM-4-Flash	$0.010	$0.01	selected
Amazon Nova Micro	$0.035	$0.04	Compare
Gemini 2.5 Flash Lite	$0.037	$0.04	Compare
MiniMax Text 01	$0.050	$0.06	Compare
Stable Code 3B	$0.050	$0.06	Compare
Llama 3.1 8B	$0.050	$0.04	Compare

Related Models

Claude Sonnet 4

Anthropic's balanced model for coding and general tasks. Best price-performance ratio in the Claude family.

$3.00/M input $15.00/M output ~$4.66 per medium feature

Claude Opus 4

Anthropic's most powerful model. Best for complex reasoning and challenging coding tasks.

$15.00/M input $75.00/M output ~$23.29 per medium feature

Claude 3.5 Sonnet

Previous generation Sonnet. Still excellent for coding tasks at the same price point.

$3.00/M input $15.00/M output ~$4.66 per medium feature

Claude 3.5 Haiku

Fast, cost-effective model for high-volume tasks. Great for code review and simple queries.

$0.800/M input $4.00/M output ~$1.24 per medium feature

Categories

Budget AI Models General-Purpose AI Models