←back to thread

584 points Alifatisk | 1 comments | | HN request time: 0s | source
Show context
nasvay_factory ◴[] No.46183887[source]
I wrote about that a while ago: https://paxamans.github.io/blog/titans/
replies(1): >>46185805 #
moffkalast ◴[] No.46185805[source]
Are there any pretrained models with this architecture yet or is it all still completely theoretical beyond Google's unverifiable claims? They published the original Titans paper last year and nobody seems to have built on the idea.
replies(2): >>46187050 #>>46187535 #
1. djrhails ◴[] No.46187050[source]
https://github.com/lucidrains/titans-pytorch - is the only public iteration.

But no one appears to have taken the risk/time to properly validate it.