←back to thread

688 points crescit_eundo | 1 comments | | HN request time: 0.207s | source
Show context
bryan0 ◴[] No.42142557[source]
I remember one of the early "breakthroughs" for LLMs in chess was that if it could actually play legal moves(!) In all of these games are the models always playing legal moves? I don't think the article says. The fact that an LLM can even reliably play legal moves, 20+ moves into a chess game is somewhat remarkable. It needs to have an accurate representation of the board state even though it was only trained on next token prediction.
replies(3): >>42142607 #>>42142610 #>>42143119 #
1. pama ◴[] No.42142607[source]
The author explains what they did: restrict the move options to valid ones when possible (for open models with the ability to enforce grammar during inference) or sample the model for a valid move up to ten times, then pick a random valid move.