(transformer-circuits.pub)

168 points 1wheel | 1 comments | 21 May 24 15:15 UTC | HN request time: 0.219s | source

Show context

optimalsolver ◴[21 May 24 15:10 UTC] No.40429473[source]▶

>what the model is "thinking" before writing its response

An actual "thinking machine" would be constantly running computations on its accumulated experience in order to improve its future output and/or further compress its sensory history.

An LLM is doing exactly nothing while waiting for the next prompt.

replies(5): >>40429486 #>>40429493 #>>40429606 #>>40429761 #>>40429847 #

viking123 ◴[21 May 24 15:39 UTC] No.40429847[source]▶

>>40429473 #

I might be a complete brainlet so excuse my take, but when animals think and do things, the weights in the brain are constantly being adjusted, old connections pruned out and new ones made right? But once LLM is trained, that's kind of it? Nothing there changes when we discuss with it. As far as I understand from what I read, even our memories are just somehow in the connections between the neurons

replies(1): >>40429888 #

1. whimsicalism ◴[21 May 24 15:41 UTC] No.40429888[source]▶

>>40429847 #

my understanding was that once you are of age, brain pruning and malleability is relatively small

↑

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet