Performance Scores

Overall

45
Rank #40 of 45 — Top 11%

SWE-bench

38
Rank #40 of 45 — Top 11%

LiveCodeBench

46
Rank #40 of 45 — Top 11%

HumanEval

68
Rank #40 of 45 — Top 11%

BigCodeBench

30
Rank #40 of 45 — Top 11%

Strengths & Weaknesses

Strengths

  • Very cheap
  • Fast

Weaknesses

  • Basic capability only

Compare with Similar-Priced Models

ModelOverall ScoreInput $/M
Claude 3 Haiku 45 $0.250
Claude 3.5 Haiku 52 $0.800
GPT-4o mini 58 $0.150
GPT-3.5 Turbo 40 $0.500
OpenAI o1-mini 70 $1.10
OpenAI o3-mini 80 $1.10