Google Titans architecture, helping AI have long-term memory

(research.google)

584 points Alifatisk | 3 comments | 07 Dec 25 12:23 UTC | HN request time: 0s | source

Show context

okdood64 ◴[07 Dec 25 14:05 UTC] No.46181759[source]▶

From the blog:

Is there any other company that's openly publishing their research on AI at this level? Google should get a lot of credit for this.

replies(12): >>46181829 #>>46182057 #>>46182168 #>>46182358 #>>46182633 #>>46183087 #>>46183462 #>>46183546 #>>46183827 #>>46184875 #>>46186114 #>>46189989 #

mapmeld ◴[07 Dec 25 15:02 UTC] No.46182168[source]▶

>>46181759 #

Well it's cool that they released a paper, but at this point it's been 11 months and you can't download a Titans-architecture model code or weights anywhere. That would put a lot of companies up ahead of them (Meta's Llama, Qwen, DeepSeek). Closest you can get is an unofficial implementation of the paper https://github.com/lucidrains/titans-pytorch

replies(7): >>46182351 #>>46182946 #>>46184154 #>>46185017 #>>46186942 #>>46187280 #>>46188385 #

alyxya ◴[07 Dec 25 16:34 UTC] No.46182946[source]▶

>>46182168 #

The hardest part about making a new architecture is that even if it is just better than transformers in every way, it’s very difficult to both prove a significant improvement at scale and gain traction. Until google puts in a lot of resources into training a scaled up version of this architecture, I believe there’s plenty of low hanging fruit with improving existing architectures such that it’ll always take the back seat.

replies(5): >>46183227 #>>46184404 #>>46184696 #>>46186138 #>>46186853 #

1. p1esk ◴[07 Dec 25 19:36 UTC] No.46184404[source]▶

>>46182946 #

Until google puts in a lot of resources into training a scaled up version of this architecture

If Google is not willing to scale it up, then why would anyone else?

replies(1): >>46187379 #

2. 8note ◴[08 Dec 25 01:44 UTC] No.46187379[source]▶

>>46184404 (TP) #

chatgpt is an example on why.

replies(1): >>46193265 #

3. falcor84 ◴[08 Dec 25 15:23 UTC] No.46193265[source]▶

>>46187379 #

You think that this might be another ChatGPT/Docker/Hadoop case, where Google comes up with the technology but doesn't care to productize it?

↑