←back to thread

Gemini 2.5 Pro vs. Claude 3.7 Sonnet: Coding Comparison

(composio.dev)

483 points mraniki | 1 comments | 31 Mar 25 12:09 UTC | HN request time: 0.312s | source

Show context

bratao ◴[31 Mar 25 12:46 UTC] No.43534359[source]▶

>>43534029 (OP) #

From my use case, the Gemini 2.5 is terrible. I have a complex Cython code in a single file (1500 lines) for a Sequence Labeling. Claude and o3 are very good in improving this code and following the commands. The Gemini always try to do unrelated changes. For example, I asked, separately, for small changes such as remove this unused function, or cache the arrays indexes. Every time it completely refactored the code and was obsessed with removing the gil. The output code is always broken, because removing the gil is not easy.

replies(10): >>43534409 #>>43534423 #>>43534434 #>>43534511 #>>43534695 #>>43534743 #>>43535378 #>>43536361 #>>43536527 #>>43536933 #

1. kristopolous ◴[31 Mar 25 16:02 UTC] No.43536527[source]▶

I mean it's really in how you use it.

The focus on benchmarks affords a tendency to generalize performance as if it's context and user independent.

Each model really is a different piece of software with different capabilities. Really fascinating to see how dramatically different people's assessments are