Performance Scores

Overall

60
Rank #28 of 45 — Top 38%

SWE-bench

54
Rank #28 of 45 — Top 38%

LiveCodeBench

62
Rank #29 of 45 — Top 36%

HumanEval

82
Rank #27 of 45 — Top 40%

BigCodeBench

45
Rank #28 of 45 — Top 38%

Strengths & Weaknesses

Strengths

  • Code-specialized

Weaknesses

  • Weaker at reasoning

Compare with Similar-Priced Models

ModelOverall ScoreInput $/M
Qwen Coder 60