←back to thread

584 points Alifatisk | 1 comments | | HN request time: 0.435s | source
Show context
kgeist ◴[] No.46182122[source]
>The model uses this internal error signal (the gradient) as a mathematical equivalent of saying, "This is unexpected and important!" This allows the Titans architecture to selectively update its long-term memory only with the most novel and context-breaking information

So one can break a model by consistently feeding it with random, highly improbable junk? Everything would be registered as a surprise and get stored, impacting future interactions

replies(6): >>46182150 #>>46182410 #>>46182651 #>>46183200 #>>46183413 #>>46193429 #
1. pmichaud ◴[] No.46182150[source]
I’m guessing that this is the first thing they thought of and the problem only exists in the superficial gloss you’re responding to?