←back to thread

183 points WolfOliver | 2 comments | | HN request time: 0.496s | source
Show context
crooked-v ◴[] No.45066121[source]
For me it's simple: even the best models are "lazy" and will confidently declare they're finished when they're obviously not, and the immensely increased amount of training effort to get ChatGPT 5's mild improvements on benchmarks suggests that that quality won't go away anytime soon.
replies(2): >>45066370 #>>45066507 #
1. worldsayshi ◴[] No.45066370[source]
Sounds like it's partially about a nuanced trade-off. It can just as well be too eager and add changes I didn't ask for. Being lazy is better than continuing on a bad path.
replies(1): >>45067426 #
2. crooked-v ◴[] No.45067426[source]
There's a long distance between "nuanced behavior" and what it actually does now, which is "complete 6 items of an explicit 10-item task list and then ask the user again if they want to continue".