(dynomight.substack.com)

696 points crescit_eundo | 1 comments | 14 Nov 24 17:05 UTC | HN request time: 0.205s | source

1. kmeisthax ◴[15 Nov 24 01:51 UTC] No.42143243[source]▶

If tokenization is such a big problem, then why aren't we training new base models on randomly non-tokenized data? e.g. during training, randomly substitute some percentage of the input tokens with individual letters.

↑

Something weird is happening with LLMs and chess