(dynomight.substack.com)

696 points crescit_eundo | 1 comments | 14 Nov 24 17:05 UTC | HN request time: 0.204s | source

Show context

bryan0 ◴[14 Nov 24 23:48 UTC] No.42142557[source]▶

I remember one of the early "breakthroughs" for LLMs in chess was that if it could actually play legal moves(!) In all of these games are the models always playing legal moves? I don't think the article says. The fact that an LLM can even reliably play legal moves, 20+ moves into a chess game is somewhat remarkable. It needs to have an accurate representation of the board state even though it was only trained on next token prediction.

replies(3): >>42142607 #>>42142610 #>>42143119 #

1. pama ◴[14 Nov 24 23:55 UTC] No.42142607[source]▶

>>42142557 #

The author explains what they did: restrict the move options to valid ones when possible (for open models with the ability to enforce grammar during inference) or sample the model for a valid move up to ten times, then pick a random valid move.

↑

Something weird is happening with LLMs and chess