(dynomight.substack.com)

696 points crescit_eundo | 1 comments | 14 Nov 24 17:05 UTC | HN request time: 0.205s | source

1. wufufufu ◴[15 Nov 24 16:54 UTC] No.42148616[source]▶

> And then I tried gpt-3.5-turbo-instruct. This is a closed OpenAI model, so details are very murky.

How do you know it didn't just write a script that uses a chess engine and then execute the script? That IMO is the easiest explanation.

Also, I looked at the gpt-3.5-turbo-instruct example victory. One side played with 70% accuracy and the other was 77%. IMO that's not on par with 27XX ELO.

↑

Something weird is happening with LLMs and chess