Most active commenters

viraptor(6)
chongli(3)

←back to thread

Something weird is happening with LLMs and chess

(dynomight.substack.com)

Show context

niobe ◴[15 Nov 24 00:40 UTC] No.42142885[source]▶

>>42138289 (OP) #

I don't understand why educated people expect that an LLM would be able to play chess at a decent level.

It has no idea about the quality of it's data. "Act like x" prompts are no substitute for actual reasoning and deterministic computation which clearly chess requires.

replies(20): >>42142963 #>>42143021 #>>42143024 #>>42143060 #>>42143136 #>>42143208 #>>42143253 #>>42143349 #>>42143949 #>>42144041 #>>42144146 #>>42144448 #>>42144487 #>>42144490 #>>42144558 #>>42144621 #>>42145171 #>>42145383 #>>42146513 #>>42147230 #

viraptor ◴[15 Nov 24 01:17 UTC] No.42143060[source]▶

>>42142885 #

This is a puzzle given enough training information. LLM can successfully print out the status of the board after the given moves. It can also produce a not-terrible summary of the position and is able to list dangers at least one move ahead. Decent is subjective, but that should beat at least beginners. And the lowest level of stockfish used in the blog post is lowest intermediate.

I don't know really what level we should be thinking of here, but I don't see any reason to dismiss the idea. Also, it really depends on whether you're thinking of the current public implementations of the tech, or the LLM idea in general. If we wanted to get better results, we could feed it way more chess books and past game analysis.

replies(2): >>42143139 #>>42143871 #

grugagag ◴[15 Nov 24 01:33 UTC] No.42143139[source]▶

>>42143060 #

LLMs like GPT aren’t built to play chess, and here’s why: they’re made for handling language, not playing games with strict rules and strategies. Chess engines, like Stockfish, are designed specifically for analyzing board positions and making the best moves, but LLMs don’t even "see" the board. They’re just guessing moves based on text patterns, without understanding the game itself.

Plus, LLMs have limited memory, so they struggle to remember previous moves in a long game. It’s like trying to play blindfolded! They’re great at explaining chess concepts or moves but not actually competing in a match.

replies(5): >>42143316 #>>42143409 #>>42143940 #>>42144497 #>>42150276 #

1. viraptor ◴[15 Nov 24 02:03 UTC] No.42143316[source]▶

>>42143139 #

> but LLMs don’t even "see" the board

This is a very vague claim, but they can reconstruct the board from the list of moves, which I would say proves this wrong.

> LLMs have limited memory

For the recent models this is not a problem for the chess example. You can feed whole books into them if you want to.

> so they struggle to remember previous moves

Chess is stateless with perfect information. Unless you're going for mind games, you don't need to remember previous moves.

> They’re great at explaining chess concepts or moves but not actually competing in a match.

What's the difference between a great explanation of a move and explaining every possible move then selecting the best one?

replies(6): >>42143465 #>>42143481 #>>42143484 #>>42143533 #>>42145323 #>>42146931 #

2. mjcohen ◴[15 Nov 24 02:33 UTC] No.42143465[source]▶

>>42143316 (TP) #

Chess is not stateless. Three repetitions of same position is a draw.

replies(1): >>42144802 #

3. cool_dude85 ◴[15 Nov 24 02:36 UTC] No.42143481[source]▶

>>42143316 (TP) #

>Chess is stateless with perfect information. Unless you're going for mind games, you don't need to remember previous moves.

In what sense is chess stateless? Question: is Rxa6 a legal move? You need board state to refer to in order to decide.

replies(1): >>42143555 #

4. sfmz ◴[15 Nov 24 02:36 UTC] No.42143484[source]▶

>>42143316 (TP) #

Chess is not stateless. En Passant requires last move and castling rights requires nearly all previous moves.

https://adamkarvonen.github.io/machine_learning/2024/01/03/c...

replies(1): >>42143592 #

5. ethbr1 ◴[15 Nov 24 02:46 UTC] No.42143533[source]▶

>>42143316 (TP) #

> Chess is stateless with perfect information.

It is not stateless, because good chess isn't played as a series of independent moves -- it's played as a series of moves connected to a player's strategy.

> What's the difference between a great explanation of a move and explaining every possible move then selecting the best one?

Continuing from the above, "best" in the latter sense involves understanding possible future moves after the next move.

Ergo, if I looked at all games with the current board state and chose the next move that won the most games, it'd be tactically sound but strategically ignorant.

Because many of those next moves were making that next move in support of some broader strategy.

replies(2): >>42143634 #>>42144422 #

6. aetherson ◴[15 Nov 24 02:50 UTC] No.42143555[source]▶

>>42143481 #

They mean that you only need board position, you don't need the previous moves that led to that board position.

There are at least a couple of exceptions to that as far as I know.

replies(2): >>42143938 #>>42144645 #

7. viraptor ◴[15 Nov 24 02:57 UTC] No.42143592[source]▶

>>42143484 #

Ok, I did go too far. But castling doesn't require all previous moves - only one bit of information carried over. So in practice that's board + 2 bits per player. (or 1 bit and 2 moves if you want to include a draw)

replies(1): >>42143633 #

8. aaronchall ◴[15 Nov 24 03:06 UTC] No.42143633{3}[source]▶

>>42143592 #

Castling requires no prior moves by either piece (King or Rook). Move the King once and back early on, and later, although the board looks set for castling, the King may not.

replies(1): >>42143643 #

9. viraptor ◴[15 Nov 24 03:07 UTC] No.42143634[source]▶

>>42143533 #

> it's played as a series of moves connected to a player's strategy.

That state belongs to the player, not to the game. You can carry your own state in any game you want - for example remember who starts with what move in rock paper scissors, but that doesn't make that game stateful. It's the player's decision (or bot's implementation) to use any extra state or not.

I wrote "previous moves" specifically (and the extra bits already addressed elsewhere), but the LLM can carry/rebuild its internal state between the steps.

replies(1): >>42143743 #

10. viraptor ◴[15 Nov 24 03:08 UTC] No.42143643{4}[source]▶

>>42143633 #

Yes, which means you carry one bit of extra information - "is castling still allowed". The specific moves that resulted in this bit being unset don't matter.

replies(1): >>42143680 #

11. aaronchall ◴[15 Nov 24 03:16 UTC] No.42143680{5}[source]▶

>>42143643 #

Ok, then for this you need minimum of two bits - one for kingside Rook and one for the queenside Rook, both would be set if you move the King. You also need to count moves since the last exchange or pawn move for the 50 move rule.

replies(1): >>42143705 #

12. viraptor ◴[15 Nov 24 03:23 UTC] No.42143705{6}[source]▶

>>42143680 #

Ah, that one's cool - I've got to admit I've never heard of the 50 move rule.

replies(1): >>42143935 #

13. ethbr1 ◴[15 Nov 24 03:32 UTC] No.42143743{3}[source]▶

>>42143634 #

If we're talking about LLMs, then the state belongs to it.

So even if the rules of chess are (mostly) stateless, the resulting game itself is not.

Thus, you can't dismiss concerns about LLMs having difficulty tracking state by saying that chess is stateless. It's not, in that sense.

14. User23 ◴[15 Nov 24 04:19 UTC] No.42143935{7}[source]▶

>>42143705 #

Also the 3x repetition rule.

replies(1): >>42144595 #

15. User23 ◴[15 Nov 24 04:20 UTC] No.42143938{3}[source]▶

>>42143555 #

The correct phrasing would be is it a Markov process?

16. lxgr ◴[15 Nov 24 06:35 UTC] No.42144422[source]▶

>>42143533 #

> good chess isn't played as a series of independent moves -- it's played as a series of moves connected to a player's strategy.

Maybe good chess, but not perfect chess. That would by definition be game-theoretically optimal, which in turn implies having to maintain no state other than your position in a large but precomputable game tree.

replies(1): >>42144634 #

17. chipsrafferty ◴[15 Nov 24 07:16 UTC] No.42144595{8}[source]▶

>>42143935 #

And 5x repetition rule

18. chongli ◴[15 Nov 24 07:25 UTC] No.42144634{3}[source]▶

>>42144422 #

Right, but your position also includes whether or not you still have the right to castle on either side, whether each pawn has the right to capture en passant or not, the number of moves since the last pawn move or capture (for tracking the 50 move rule), and whether or not the current position has ever appeared on the board once or twice prior (so you can claim a draw by threefold repetition).

So in practice, your position actually includes the log of all moves to that point. That’s a lot more state than just what you can see on the board.

19. chongli ◴[15 Nov 24 07:28 UTC] No.42144645{3}[source]▶

>>42143555 #

Yes, 4 exceptions: castling rights, legal en passant captures, threefold repetition, and the 50 move rule. You actually need quite a lot of state to track all of those.

replies(1): >>42147799 #

20. Someone ◴[15 Nov 24 08:01 UTC] No.42144802[source]▶

>>42143465 #

Yes, there’s state there that’s not in the board position, but technically, threefold repetition is not a draw. Play can go on. https://en.wikipedia.org/wiki/Threefold_repetition:

“The game is not automatically drawn if a position occurs for the third time – one of the players, on their turn, must claim the draw with the arbiter. The claim must be made either before making the move which will produce the third repetition, or after the opponent has made a move producing a third repetition. By contrast, the fivefold repetition rule requires the arbiter to intervene and declare the game drawn if the same position occurs five times, needing no claim by the players.”

21. cowl ◴[15 Nov 24 09:41 UTC] No.42145323[source]▶

>>42143316 (TP) #

> Chess is stateless with perfect information. Unless you're going for mind games, you don't need to remember previous moves.

while it can be played as stateless, remembering previous moves gives you insight into potential strategy that is being build.

22. jackcviers3 ◴[15 Nov 24 13:52 UTC] No.42146931[source]▶

>>42143316 (TP) #

You can feed them whole books, but they have trouble with recall for specific information in the middle of the context window.

23. fjkdlsjflkds ◴[15 Nov 24 15:33 UTC] No.42147799{4}[source]▶

>>42144645 #

It shouldn't be too much extra state. I assume that 2 bits should be enough to cover castling rights (one for each player), whatever is necessary to store the last 3 moves should cover legal en passant captures and threefold repetition, and 12 bits to store two non-overflowing 6 bit counters (time since last capture, and time since last pawn move) should cover the 50 move rule.

So... unless I'm understanding something incorrectly, something like "the three last moves plus 17 bits of state" (plus the current board state) should be enough to treat chess as a memoryless process. Doesn't seem like too much to track.

replies(1): >>42148093 #

24. chongli ◴[15 Nov 24 16:02 UTC] No.42148093{5}[source]▶

>>42147799 #

Threefold repetition does not require the three positions to occur consecutively. So you could conceivably have a position repeat itself for first on the 1st move, second time on the 25th move, and the third time on the 50th move of a sequence and then players could claim a draw by threefold repetition or 50 move rule at the same time!

This means you do need to store the last 50 board positions in the worst case. Normally you need to store less because many moves are irreversible (pawns cannot go backwards, pieces cannot be un-captured).

replies(1): >>42150660 #

25. fjkdlsjflkds ◴[15 Nov 24 20:33 UTC] No.42150660{6}[source]▶

>>42148093 #

Ah... gotcha. Thanks for the clarification.

↑