(openai.com)

277 points simianwords | 1 comments | 06 Sep 25 07:41 UTC | HN request time: 0s | source

Show context

cainxinth ◴[06 Sep 25 21:24 UTC] No.45152987[source]▶

I find the leader board argument a little strange. All their enterprise clients are clamoring for more reliability from them. If they could train a model that conceded ignorance instead of guessing and thus avoid hallucinations, why aren't they doing that? Because of leader board optics?

replies(1): >>45153065 #

1. ospray ◴[06 Sep 25 21:33 UTC] No.45153065[source]▶

>>45152987 #

I think they are trying to communicate that their benchmarks will go down as they try to tackle hallucinations. Honestly I am surprised they didn't just say we think all benchmarks need a incorrect vs abstinence ratio so our cautious honest model can do well on that. Although they did seem to hint that's what they want.

↑

Why language models hallucinate