(www.lesswrong.com)

579 points paulpauper | 1 comments | 06 Apr 25 18:01 UTC | HN request time: 0.224s | source

1. OtherShrezzing ◴[06 Apr 25 20:15 UTC] No.43604605[source]▶

Assuming that the models getting better at SWE benchmarks and math tests would translate into positive outcomes in all other domains could be an act of spectacular hubris by the big frontier labs, which themselves are chock-full of mathematicians and software engineers.

↑

Recent AI model progress feels mostly like bullshit