←back to thread

2127 points bakugo | 3 comments | | HN request time: 1.468s | source
1. jedberg ◴[] No.43163242[source]
Last week when Grok launched the consensus was that its coding ability was better than Claude. Anyone have a benchmark with this new model? Or just warm feelings?
replies(2): >>43163357 #>>43163414 #
2. esafak ◴[] No.43163357[source]
They merely claimed that. I have not seen many people confirm that it is the best, let alone a consensus. I don't believe it is even available through an API yet.
3. minihat ◴[] No.43163414[source]
Grok 3 with thinking is comparable to o1 for writing complex algorithms.

However, Grok sometimes loses the context where o1 seems not to. For this reason I still mostly use o1.

I have found both o1 and Grok 3 to be substantially better than any Claude offering.