Performance Scores

Overall

65
Rank #22 of 45 — Top 51%

SWE-bench

58
Rank #22 of 45 — Top 51%

LiveCodeBench

68
Rank #22 of 45 — Top 51%

HumanEval

84
Rank #24 of 45 — Top 47%

BigCodeBench

50
Rank #23 of 45 — Top 49%

Strengths & Weaknesses

Strengths

  • Code-specialized
  • Fast

Weaknesses

  • New model

Compare with Similar-Priced Models

ModelOverall ScoreInput $/M
Grok Code Fast 1 65