(x.ai)

82 points meetpateltech | 1 comments | 20 Sep 25 01:55 UTC | HN request time: 0.203s | source

Show context

RayVR ◴[20 Sep 25 06:55 UTC] No.45311094[source]▶

A faster model that outperforms its slower version on multiple benchmarks? Can anyone explain why that makes sense? Are they simply retraining on the benchmark tests?

replies(4): >>45311127 #>>45311184 #>>45311402 #>>45311754 #

1. NitpickLawyer ◴[20 Sep 25 07:02 UTC] No.45311127[source]▶

>>45311094 #

> Can anyone explain why that makes sense?

Can be anything from different arch, more data, RL, etc. It's probably RL. In recent months top tier labs seem to have "cracked" RL to a level not seen yet in open models, and by a large margin.

↑

Grok 4 Fast