←back to thread

483 points mraniki | 1 comments | | HN request time: 0.208s | source
Show context
bratao ◴[] No.43534359[source]
From my use case, the Gemini 2.5 is terrible. I have a complex Cython code in a single file (1500 lines) for a Sequence Labeling. Claude and o3 are very good in improving this code and following the commands. The Gemini always try to do unrelated changes. For example, I asked, separately, for small changes such as remove this unused function, or cache the arrays indexes. Every time it completely refactored the code and was obsessed with removing the gil. The output code is always broken, because removing the gil is not easy.
replies(10): >>43534409 #>>43534423 #>>43534434 #>>43534511 #>>43534695 #>>43534743 #>>43535378 #>>43536361 #>>43536527 #>>43536933 #
1. kristopolous ◴[] No.43536527[source]
I mean it's really in how you use it.

The focus on benchmarks affords a tendency to generalize performance as if it's context and user independent.

Each model really is a different piece of software with different capabilities. Really fascinating to see how dramatically different people's assessments are