(openai.com)

277 points simianwords | 1 comments | 06 Sep 25 07:41 UTC | HN request time: 0.2s | source

Show context

robertclaus ◴[06 Sep 25 21:35 UTC] No.45153076[source]▶

While I get the academic perspective of sharing these insights, this article comes across as corporate justifying/complaining that their model's score is lower than it should be on the leaderboards... by saying the leaderboards are wrong.

Or an even darker take is that its coorporate saying they won't prioritize eliminating hallucinations until the leaderboards reward it.

replies(1): >>45153412 #

1. skybrian ◴[06 Sep 25 22:20 UTC] No.45153412[source]▶

>>45153076 #

Yes, it's self-interested because they want to improve the leaderboards, which will help GPT-5 scores, but in the other hand, the changes they suggest seem very reasonable and will hopefully help everyone in the industry do better.

And I'm sure other people will complain if notice that changing the benchmarks makes things worse.

↑

Why language models hallucinate