←back to thread

1 points thm | 1 comments | | HN request time: 0.206s | source
1. Nerd_Nest ◴[] No.44522753[source]
Interesting to see Grok making benchmark progress. I’m still waiting to see how it performs outside of controlled tests, especially in real-world use like coding, summarizing, or reasoning.