(www.anthropic.com)

2127 points bakugo | 1 comments | 24 Feb 25 18:28 UTC | HN request time: 0s | source

Show context

jedberg ◴[24 Feb 25 18:46 UTC] No.43163242[source]▶

Last week when Grok launched the consensus was that its coding ability was better than Claude. Anyone have a benchmark with this new model? Or just warm feelings?

replies(2): >>43163357 #>>43163414 #

1. minihat ◴[24 Feb 25 18:59 UTC] No.43163414[source]▶

>>43163242 #

Grok 3 with thinking is comparable to o1 for writing complex algorithms.

However, Grok sometimes loses the context where o1 seems not to. For this reason I still mostly use o1.

I have found both o1 and Grok 3 to be substantially better than any Claude offering.

↑

Claude 3.7 Sonnet and Claude Code