(composio.dev)

483 points mraniki | 1 comments | 31 Mar 25 12:09 UTC | HN request time: 0.277s | source

1. mvkel ◴[01 Apr 25 05:49 UTC] No.43543253[source]▶

I really wish people would stop evaluating a model's coding capability with one-shots.

The vast majority of coding energy is what comes next.

Even today, sonnet-3.5 is still the best "existing code base" model. Which is gratifying (to Anthropic) and/or alarming to everyone else

Gemini 2.5 Pro vs. Claude 3.7 Sonnet: Coding Comparison