←back to thread

262 points rain1 | 1 comments | | HN request time: 0.211s | source
Show context
kamranjon ◴[] No.44444101[source]
This is somehow missing the Gemma and Gemini series of models from Google. I also think that not mentioning the T5 series of models is strange from a historical perspective because they sort of pioneered many of the concepts in transfer learning and kinda kicked off quite a bit of interest in this space.
replies(1): >>44444690 #
rain1 ◴[] No.44444690[source]
The Gemma models are too small to be included in this list.

You're right the T5 stuff is very important historically but they're below 11B and I don't have much to say about them. Definitely a very interesting and important set of models though.

replies(2): >>44445159 #>>44448467 #
1. kamranjon ◴[] No.44448467[source]
Since you included GPT-2, everything from Google including T5 would qualify for the list I would think.