(dynomight.substack.com)

696 points crescit_eundo | 1 comments | 14 Nov 24 17:05 UTC | HN request time: 0.202s | source

Show context

Havoc ◴[15 Nov 24 01:32 UTC] No.42143134[source]▶

My money is on a fluke inclusion of more chess data in that models training.

All the other models do vaguely similarly well in other tasks and are in many cases architecturally similar so training data is the most likely explanation

replies(2): >>42143272 #>>42143307 #

1. bhouston ◴[15 Nov 24 01:56 UTC] No.42143272[source]▶

>>42143134 #

Yeah. This.

↑

Something weird is happening with LLMs and chess