GLM-4-Flash

Zhipu AI

Zhipu AI's ultra-cheap model. Near-free pricing for high-volume Chinese and English text tasks.

Context Window: 128K tokens Released: 2024-08 Best For: High-volume text processing, Chinese NLP tasks
  • Ultra-low cost
  • Chinese-English
  • Fast
  • GLM-4-Flash Pricing

    Token TypePrice per Million
    Input tokens$0.010
    Output tokens$0.010

    Estimated Cost by Project Size

    Realistic cost estimates for common coding scenarios. Assumes 30% cache hit rate where caching is available.

    ScenarioToken UsageEstimated Cost
    Small Script (1K lines) 50K input / 30K output <$0.01
    Medium Feature (10K lines) 500K input / 200K output <$0.01
    Large Project (50K lines) 2,500K input / 1,000K output $0.03
    Code Review (5K lines) 250K input / 25K output <$0.01

    Get Access to GLM-4-Flash

    Ready to start using GLM-4-Flash? Get API access directly from Zhipu AI.

    Get API Access → Try GLM-4-Flash Free →

    How Does GLM-4-Flash Compare?

    ModelInput ($/M)Medium Feature Cost
    GLM-4-Flash $0.010 $0.01 selected
    Amazon Nova Micro $0.035 $0.04 Compare
    Gemini 2.5 Flash Lite $0.037 $0.04 Compare
    MiniMax Text 01 $0.050 $0.06 Compare
    Stable Code 3B $0.050 $0.06 Compare
    Llama 3.1 8B $0.050 $0.04 Compare

    Related Models

    Categories