←back to thread

350 points kashifr | 1 comments | | HN request time: 0.236s | source
Show context
tiahura ◴[] No.44501872[source]
Can anyone estimate how much of the 3B is necessitated by multi-language support?
replies(3): >>44502099 #>>44509476 #>>44509763 #
1. rockinghigh ◴[] No.44502099[source]
The vocabulary size is fairly small (128,256) for a multilingual model. I would guess it doesn't require many additional parameters to support these 5 languages as many tokens can be shared.