(research.google)

584 points Alifatisk | 1 comments | 07 Dec 25 12:23 UTC | HN request time: 0.205s | source

Show context

nubg ◴[07 Dec 25 14:22 UTC] No.46181850[source]▶

Very interesting. Is it correct for me to imagine it as some kind of "LoRA" thats continuously adapted as the model goes through its day?

If so, could there perhaps be a step where the LoRA is merged back into the main model?

That would be like sleeping :-)

replies(2): >>46181941 #>>46183818 #

1. andy12_ ◴[07 Dec 25 18:25 UTC] No.46183818[source]▶

>>46181850 #

Kind-of. You could theoretically use LoRA for this, in fact, but it probably wouldn't have enough capacity to make it a proper substitute of the attention mechanism. Instead a full MLP is trained as input chunks get processed.

↑

Google Titans architecture, helping AI have long-term memory