1 points thm | 2 comments | | HN request time: 0.485s | source
1. Nerd_Nest ◴[] No.44522753[source]
Interesting to see Grok making benchmark progress. I’m still waiting to see how it performs outside of controlled tests, especially in real-world use like coding, summarizing, or reasoning.