←back to thread

695 points crescit_eundo | 1 comments | | HN request time: 0.203s | source
Show context
snickerbockers ◴[] No.42144943[source]
Does it ever try an illegal move? OP didn't mention this and I think it's inevitable that it should happen at least once, since the rules of chess are fairly arbitrary and LLMs are notorious for bullshitting their way through difficult problems when we'd rather they just admit that they don't have the answer.
replies(2): >>42145004 #>>42145793 #
sethherr ◴[] No.42145004[source]
Yes, he discusses using a grammar to restrict to only legal moves
replies(4): >>42147380 #>>42148708 #>>42150800 #>>42152205 #
yshui ◴[] No.42150800[source]
I suspect the models probably memorized some chess openings, and afterwards they are just playing random moves with the help of the grammar.
replies(1): >>42151787 #
1. gs17 ◴[] No.42151787[source]
I suspect that as well, however, 3.5-turbo-instruct has been noted by other people to do much better at generating legal chess moves than the other models. https://github.com/adamkarvonen/chess_gpt_eval gave models "5 illegal moves before forced resignation of the round" and 3.5 had very few illegal moves, while 4 lost most games due to illegal moves.