https://arxiv.org/abs/2501.00663
https://arxiv.org/pdf/2504.13173
Is there any other company that's openly publishing their research on AI at this level? Google should get a lot of credit for this.
https://arxiv.org/abs/2501.00663
https://arxiv.org/pdf/2504.13173
Is there any other company that's openly publishing their research on AI at this level? Google should get a lot of credit for this.
To wit, it's dangerous to assume the value of this idea based on the lack of public implementations.
You don't necessarily have to prove it out on large foundation models first. Can it beat out a 32b parameter model, for example?
While they do have lots of money and many people, they don't have infinite money and specifically only have so much hot infrastructure to spread around. You'd expect they have to gradually build up the case that a large scale experiment is likely enough to yield a big enough advantage over what's already claiming those resources.
So, I think they could default on doing it for small demonstrators.
Is that supposed to be a long time? Seems fair that companies don't rush to open up their models.
Student: Look, a well known financial expert placed what could potentially be a hundred dollar bill on the ground, other well-known financial experts just leave it there!
> In fact, the UL2 20B model (at Google) was trained by leaving the job running accidentally for a month.
https://www.yitay.net/blog/training-great-llms-entirely-from...