←back to thread

296 points todsacerdoti | 1 comments | | HN request time: 0.369s | source
1. fooker ◴[] No.44371956[source]
‘Bytes’ is tokenization.

There’s no reason to assume it’s the best solution. It might be the case that a better tokenization scheme is needed for math, reasoning, video, etc models.