←back to thread

688 points crescit_eundo | 9 comments | | HN request time: 1.126s | source | bottom
Show context
swiftcoder ◴[] No.42144784[source]
I feel like the article neglects one obvious possibility: that OpenAI decided that chess was a benchmark worth "winning", special-cases chess within gpt-3.5-turbo-instruct, and then neglected to add that special-case to follow-up models since it wasn't generating sustained press coverage.
replies(8): >>42145306 #>>42145352 #>>42145619 #>>42145811 #>>42145883 #>>42146777 #>>42148148 #>>42151081 #
scott_w ◴[] No.42145811[source]
I suspect the same thing. Rather than LLMs “learning to play chess,” they “learnt” to recognise a chess game and hand over instructions to a chess engine. If that’s the case, I don’t feel impressed at all.
replies(5): >>42146086 #>>42146152 #>>42146383 #>>42146415 #>>42156785 #
1. fires10 ◴[] No.42146086[source]
Recognize and hand over to a specialist engine? That might be useful for AI. Maybe I am missing something.
replies(5): >>42146145 #>>42146293 #>>42146329 #>>42147558 #>>42151536 #
2. worewood ◴[] No.42146145[source]
It's because this is standard practice since the early days - there's nothing newsworthy in this at all.
3. generic92034 ◴[] No.42146293[source]
How do you think AI are (correctly) solving simple mathematical questions which they have not trained for directly? They hand it over to a specialist maths engine.
replies(1): >>42149781 #
4. nerdponx ◴[] No.42146329[source]
It is and would be useful, but it would be quite a big lie to the public, but more importantly to paying customers, and even more importantly to investors.
replies(1): >>42148826 #
5. scott_w ◴[] No.42147558[source]
If I was sold a general AI problem solving system, I’d feel ripped off if I learned that I needed to build my own problem solver and hook it up after I’d paid my money…
6. anon84873628 ◴[] No.42148826[source]
The problem is simply that the company has not been open about how it works, so we're all just speculating here.
7. internetter ◴[] No.42149781[source]
This is a relatively recent development (<3 months), at least for OpenAI, where the model will generate code to solve math and use the response
replies(1): >>42151065 #
8. cruffle_duffle ◴[] No.42151065{3}[source]
They’ve been doing that a lot longer than three months. ChatGPT has been handing stuff off to python for a very long time. At least for my paid account anyway.
9. skydhash ◴[] No.42151536[source]
Wasn't that the basis of computing and technology in general? Here is one tedious thing, let's have a specific tool that handles it instead of wasting time and efforts. The fact is that properly using the tool takes training and most of current AI marketing are hyping that you don't need that. Instead, hand over the problem to a GPT and it will "magically" solve it.