Grok 3 Vision

xAI

xAI's multimodal vision model. Combines Grok 3's reasoning with image and diagram understanding.

Context Window: 128K tokens Released: 2025-04 Best For: UI/UX analysis, diagram-to-code, visual debugging
  • Multimodal vision
  • Diagram understanding
  • Real-time knowledge
  • Grok 3 Vision Pricing

    Token TypePrice per Million
    Input tokens$5.00
    Output tokens$20.00

    Estimated Cost by Project Size

    Realistic cost estimates for common coding scenarios. Assumes 30% cache hit rate where caching is available.

    ScenarioToken UsageEstimated Cost
    Small Script (1K lines) 50K input / 30K output $0.78
    Medium Feature (10K lines) 500K input / 200K output $5.75
    Large Project (50K lines) 2,500K input / 1,000K output $28.75
    Code Review (5K lines) 250K input / 25K output $1.38

    Get Access to Grok 3 Vision

    Ready to start using Grok 3 Vision? Get API access directly from xAI.

    Get API Access →

    How Does Grok 3 Vision Compare?

    ModelInput ($/M)Medium Feature Cost
    Grok 3 Vision $5.00 $5.75 selected
    Grok 4 $5.00 $6.75 Compare
    Qwen 3 Max $5.00 $5.75 Compare
    Claude Sonnet 4 $3.00 $4.66 Compare
    Claude 3.5 Sonnet $3.00 $4.66 Compare
    Claude 3 Sonnet $3.00 $4.05 Compare

    Related Models

    Categories