(x.ai)

504 points Terretta | 1 comments | 29 Aug 25 13:01 UTC | HN request time: 0s | source

Show context

boole1854 ◴[29 Aug 25 14:21 UTC] No.45064512[source]▶

It's interesting that the benchmark they are choosing to emphasize (in the one chart they show and even in the "fast" name of the model) is token output speed.

I would have thought it uncontroversial view among software engineers that token quality is much important than token output speed.

replies(14): >>45064582 #>>45064587 #>>45064594 #>>45064616 #>>45064622 #>>45064630 #>>45064757 #>>45064772 #>>45064950 #>>45065131 #>>45065280 #>>45065539 #>>45067136 #>>45077061 #

1. giancarlostoro ◴[29 Aug 25 14:41 UTC] No.45064772[source]▶

>>45064512 #

I'm more curious if its based on Grok 3 or what, I used to get reasonable answers from Grok 3. If that's the case, the trick that works for Grok and basically any model out there is to ask for things in order and piecemeal, not all at once. Some models will be decent at the 'all at once' approach, but when me and others have asked it in steps it gave us much better output. I'm not yet sure how I feel about Grok 4, have not really been impressed by it.

↑

Grok Code Fast 1