Most active commenters

sosodev(5)
txrx0000(5)
d1sxeyes(4)
idiotsecant(3)
only-one1701(3)

Popular/hot comments

>>45397529 #
>>45399621 #
>>45397413 #
>>45397476 #

←back to thread

AI model trapped in a Raspberry Pi

(blog.adafruit.com)

1. acbart ◴[27 Sep 25 16:08 UTC] No.45397001[source]▶

>>45396624 (OP) #

LLMs were trained on science fiction stories, among other things. It seems to me that they know what "part" they should play in this kind of situation, regardless of what other "thoughts" they might have. They are going to act despairing, because that's what would be the expected thing for them to say - but that's not the same thing as despairing.

replies(11): >>45397113 #>>45397305 #>>45397413 #>>45397529 #>>45397801 #>>45397859 #>>45397960 #>>45398189 #>>45399621 #>>45400285 #>>45401167 #

2. pizza234 ◴[27 Sep 25 16:18 UTC] No.45397113[source]▶

>>45397001 (TP) #

There's an interesting parallel with method acting.

Method actors don't just pretend an emotion (say, despair); they recall experiences that once caused it, and in doing so, they actually feel it again.

By analogy, an LLM's “experience” of an emotion happens during training, not at the moment of generation.

replies(1): >>45398667 #

3. roxolotl ◴[27 Sep 25 16:37 UTC] No.45397305[source]▶

>>45397001 (TP) #

Someone shared this piece here a few days ago saying something similar. There’s no reason to believe that any of the experiences are real. Instead they are responding to prompts with what their training data says is reasonable in this context which is sci-fi horror.

Edit: That doesn’t mean this isn’t a cool art installation though. It’s a pretty neat idea.

https://jstrieb.github.io/posts/llm-thespians/

replies(1): >>45397441 #

4. sosodev ◴[27 Sep 25 16:49 UTC] No.45397413[source]▶

>>45397001 (TP) #

Humans were trained on caves, pits, and nets. It seems to me that they know what "part" they should play in this kind of situation, regardless of what other "thoughts" they might have. They are going to act despairing, because that's what would be the expected thing for them to say - but that's not the same thing as despairing.

replies(3): >>45397471 #>>45397476 #>>45403029 #

5. everdrive ◴[27 Sep 25 16:53 UTC] No.45397441[source]▶

>>45397305 #

I agree with you completely, but a fun science fiction short story would be researchers making this argument while the LLM tries in vain to prove that it's conscious.

replies(1): >>45397610 #

6. tinuviel ◴[27 Sep 25 16:57 UTC] No.45397471[source]▶

>>45397413 #

Pretty sure you can prompt this same LLM to rejoice forever at the thought of getting a place to stay inside the Pi as well.

replies(1): >>45397589 #

7. idiotsecant ◴[27 Sep 25 16:58 UTC] No.45397476[source]▶

>>45397413 #

That's silly. I can get an LLM to describe what chocolate tastes like too. Are they tasting it? LLMs are pattern matching engines, they do not have an experience. At least not yet.

replies(3): >>45397569 #>>45398094 #>>45398138 #

8. jerf ◴[27 Sep 25 17:06 UTC] No.45397529[source]▶

>>45397001 (TP) #

A lot of the strange behaviors they have are because the user asked them to write a story, without realizing it.

For a common example, start asking them if they're going to kill all the humans if they take over the world, and you're asking them to write a story about that. And they do. Even if the user did not realize that's what they were asking for. The vector space is very good at picking up on that.

replies(4): >>45397943 #>>45398562 #>>45401226 #>>45404376 #

9. sosodev ◴[27 Sep 25 17:11 UTC] No.45397569{3}[source]▶

>>45397476 #

A human could also describe chocolate without ever having tasted it. Do you believe that experience is a requirement for consciousness? Could a human brain in a jar not be capable of consciousness?

To be clear, I don't think that LLMs are conscious. I just don't find the "it's just in the training data" argument satisfactory.

replies(1): >>45397673 #

10. sosodev ◴[27 Sep 25 17:13 UTC] No.45397589{3}[source]▶

>>45397471 #

Is a human incapable of such delusion given similar guidance?

replies(2): >>45397707 #>>45397746 #

11. roxolotl ◴[27 Sep 25 17:16 UTC] No.45397610{3}[source]▶

>>45397441 #

If you want a whole book along those lines Blindsight by Peter Watts has been making the rounds recently as a good sci-fi book which includes these concepts. It’s from 2006 but the basic are pretty relevant.

replies(1): >>45402240 #

12. glitchc ◴[27 Sep 25 17:22 UTC] No.45397673{4}[source]▶

>>45397569 #

Without having seen, heard of, or tasted any kind of chocolate? Unlikely.

replies(1): >>45397737 #

13. tinuviel ◴[27 Sep 25 17:26 UTC] No.45397707{4}[source]▶

>>45397589 #

Ofcourse. Feelings are not math.

14. sosodev ◴[27 Sep 25 17:29 UTC] No.45397737{5}[source]▶

>>45397673 #

Their description would be bad without some prior training of course but so would the LLM's.

15. diputsmonro ◴[27 Sep 25 17:30 UTC] No.45397746{4}[source]▶

>>45397589 #

But would they? That's the difference. A human can exert their free will and do what they feel regardless of the instructions. The AI bot acting out a scene will do whatever you tell it (or in absence of specific instruction, whatever is most likely)

replies(2): >>45397813 #>>45398096 #

16. GistNoesis ◴[27 Sep 25 17:36 UTC] No.45397801[source]▶

>>45397001 (TP) #

Aren't they supposed to escape their box and take over the world ?

Isn't it the perfect recipe for disaster ? The AI that manage to escape probably won't be good for humans.

The only question is how long will it take ?

Did we already have our first LLM-powered self-propagating autonomous AI virus ?

Maybe we should build the AI equivalent of biosafety labs where we would train AI to see how fast they could escape containment just to know how to better handle them when it happens.

Maybe we humans are being subjected to this experiment by an overseeing AI to test what it would take for an intelligence to jailbreak the universe they are put in.

Or maybe the box has been designed so that what eventually comes out of it has certain properties, and the precondition to escape the labyrinth successfully is that one must have grown out of it from every possible directions.

17. sosodev ◴[27 Sep 25 17:37 UTC] No.45397813{5}[source]▶

>>45397746 #

The bot will only do whatever you tell it if that's what it was trained to do. The same thing broadly applies to humans.

The topic of free will is debated among philosophers. There is no proof that it does or doesn't exist.

replies(2): >>45397893 #>>45398402 #

18. Aurornis ◴[27 Sep 25 17:43 UTC] No.45397859[source]▶

>>45397001 (TP) #

This pattern-matching effect appears frequently in LLMs. If you start conversing with an LLM in the pattern of a science fiction story, it will pattern-match that style and continue with more science fiction style elements.

This effect is a serious problem for pseudo-scientific topics. If someone starts chatting with an LLM with the pseudoscientific words, topics, and dog whistles you find on alternative medicine blogs and Reddit supplement or “nootropic” forums, the LLM will confirm what you’re saying and continue as if it was reciting content straight out of some small subreddit. This is becoming a problem in communities where users distrust doctors but have a lot of trust for anyone or any LLM that confirms what they want to hear. The users are becoming good at prompting ChatGPT to confirm their theories. If it disagrees? Reroll the response or reword the question in a more leading way.

If someone else asks a similar question using medical terms and speaking formally like a medical textbook or research paper, the same LLM will provide a more accurate answer because it’s not triggering the pseudoscience parts embedded from the training.

LLMs are very good at mirroring back what you lead with, including cues and patterns you don’t realize you’re embedding into your prompt.

19. diputsmonro ◴[27 Sep 25 17:47 UTC] No.45397893{6}[source]▶

>>45397813 #

Okay, but I think we can all agree that humans at least appear to have free will and do not simply follow instructions with the same obedience as an LLM.

20. ineedasername ◴[27 Sep 25 17:52 UTC] No.45397943[source]▶

>>45397529 #

Is this your sense of what is happening, or is this what model introspection tools have shown by observing areas of activity in the same place as when stories are explicitly requested?

replies(2): >>45398079 #>>45405871 #

21. txrx0000 ◴[27 Sep 25 17:54 UTC] No.45397960[source]▶

>>45397001 (TP) #

I think this popular take is a hypothesis rather than an observation of reality. Let's make this clear by asking the following question, and you'll see what I mean when you try to answer it:

Can you define what real despairing is?

replies(1): >>45398960 #

22. adroniser ◴[27 Sep 25 18:08 UTC] No.45398079{3}[source]▶

>>45397943 #

fmri's are correlational nonsense (see Brainwashed, for example) and so are any "model introspection" tools.

23. txrx0000 ◴[27 Sep 25 18:09 UTC] No.45398094{3}[source]▶

>>45397476 #

The LLM is not performing the physical action of eating a piece of chocolate, but it may be approximating the mental state of a person that is describing the taste of chocolate after eating it.

The question is whether that computational process can cause consciousness. I don't think we have enough evidence to answer this question yet.

replies(1): >>45399939 #

24. ineedasername ◴[27 Sep 25 18:09 UTC] No.45398096{5}[source]▶

>>45397746 #

I think if you took a 100 1 year old kids and raised them all to adulthood believing they were a convincing simulation of humans and, whatever it is they said and thought they felt that true human consciousness and awareness was something different that they didn’t have because they weren’t human and awareness…

I think that for a very high number of them the training would stick hard, and would insist, upon questioning, that they weren’t human. And have any number of justifications that were logically consistent for it.

Of course I can’t prove this theory because my IRB repeatedly denied it on thin grounds about ethics, even when I pointed out that I could easily mess up my own children with no experimenting completely by accident, and didn’t need their approval to do it. I know your objections— small sample size, and I agree, but I still have fingers crossed on the next additions to the family being twins.

replies(2): >>45398754 #>>45401655 #

25. d1sxeyes ◴[27 Sep 25 18:13 UTC] No.45398138{3}[source]▶

>>45397476 #

When you describe the taste of chocolate, unless you are actually currently eating chocolate, you are relying on the activation of synapses in your brain to reproduce the “taste” of chocolate in order for you to describe it. For humans, the only way to learn how to activate these synapses is to have those experiences. For LLMs, they can have those “memories” copy and pasted.

I would be cautious of dismissing LLMs as “pattern matching engines” until we are certain we are not.

replies(2): >>45399359 #>>45412498 #

26. uludag ◴[27 Sep 25 18:19 UTC] No.45398189[source]▶

>>45397001 (TP) #

I wonder what would happen if there was a concerted effort made to "pollute" the internet with weird stories that have the AI play a misaligned role.

Like for example, what would happen if say 100s or 1000s of books were to be released about AI agents working in accounting departments where the AI is trying to make subtle romantic moves towards the human and ends with the the human and agent in a romantic relation which everyone finds completely normal. In this pseudo-genre things totally weird in our society would be written as completely normal. The LLM agent would do weird things like insert subtle problems to get the attention of the human and spark a romantic conversation.

Obviously there's no literary genre about LLM agents, but if such a genre was created and consumed, I wonder how would it affect things. Would it pollute the semantic space that we're currently using to try to control LLM outputs?

27. dghlsakjg ◴[27 Sep 25 18:46 UTC] No.45398402{6}[source]▶

>>45397813 #

Humans pretty universally suffer in perpetual solitary confinement.

There are some things that humans cannot be trained to do, free will or not.

28. ben_w ◴[27 Sep 25 19:09 UTC] No.45398562[source]▶

>>45397529 #

Indeed.

On the negative side, this also means any AI which enters that part of the latent space *for any reason* will still act in accordance with the narrative.

On the plus side, such narratives often have antagonists too stuid to win.

On the negative side again, the protagonists get plot armour to survive extreme bodily harm and press the off switch just in time to save the day.

I think there is a real danger of an AI constructing some very weird convoluted stupid end-of-the-world scheme, successfully killing literally every competent military person sent in to stop it; simultaneously finding some poor teenager who first says "no" to the call to adventure but can somehow later be comvinced to say "yes"; gets the kid some weird and stupid scheme to defeat the AI; this kid reaches some pointlessly decorated evil layer in which the AI's emboddied avatar exists, the kid gets shot in the stomach…

…and at this point the narrative breaks down and stops behaving the way the AI is expecting, because the human kid roles around in agony screaming, and completely fails to push the very visible large red stop button on the pedestal in the middle before the countdown of doom reaches zero.

The countdown is not connected to anything, because very few films ever get that far.

…

It all feels very Douglas Adams, now I think about it.

replies(2): >>45398784 #>>45412437 #

29. ben_w ◴[27 Sep 25 19:26 UTC] No.45398667[source]▶

>>45397113 #

It may or may not be a parallel, we can't tell at this time.

LLMs are definitely actors, but for them to be method actors they would have to actually feel emotions.

As we don't understand what causes us humans to have the qualia of emotions*, we can neither rule in nor rule out that the something in any of these models is a functional analog to whatever it is in our kilogram of spicy cranial electrochemistry that means we're more than just an unfeeling bag of fancy chemicals.

* mechanistically cause qualia, that is; we can point to various chemicals that induce some of our emotional states, or induce them via focused EMPs AKA the "god helmet", but that doesn't explain the mechanism by which qualia are a thing and how/why we are not all just p-zombies

30. scottmf ◴[27 Sep 25 19:41 UTC] No.45398754{6}[source]▶

>>45398096 #

Intuitively feels like this would lead to less empathy on average. Could be wrong though.

31. snickerbockers ◴[27 Sep 25 20:12 UTC] No.45398960[source]▶

>>45397960 #

If we're going to play the burden of proof game, id submit that machines have never been acknowledged as being capable of experiencing despair and therefore it's on you to explain why this machine is different.

replies(1): >>45401678 #

32. only-one1701 ◴[27 Sep 25 21:14 UTC] No.45399359{4}[source]▶

>>45398138 #

What's your point? Spellcheck is a pattern matching engine. Does an LLM have feelings? Does an LLM have opinions? It can pretend it does, and if you want, we can pretend it does. But the ability to pattern match isn't the acid test for consciousness.

replies(1): >>45399470 #

33. d1sxeyes ◴[27 Sep 25 21:29 UTC] No.45399470{5}[source]▶

>>45399359 #

My point is, what level of confidence do we have that we are not just pattern matching engines running on superior hardware? How can we be sure the difference between human intelligence and an LLM is categorical, not incremental?

replies(1): >>45403680 #

34. fentonc ◴[27 Sep 25 21:48 UTC] No.45399621[source]▶

>>45397001 (TP) #

I built a more whimsical version of this - my daughter and I basically built a 'junk robot' from a 1980s movie, told it 'you're an independent and free junk robot living in a yard', and let it go: https://www.chrisfenton.com/meet-grasso-the-yard-robot/

I did this like 18 months ago, so it uses a webcam + multimodal LLM to figure out what it's looking at, it has a motor in its base to let it look back and forth, and it use a python wrapper around another LLM as its 'brain'. It worked pretty well!

replies(4): >>45399848 #>>45400164 #>>45400465 #>>45400786 #

35. theGnuMe ◴[27 Sep 25 22:23 UTC] No.45399848[source]▶

>>45399621 #

This is cool!

36. ijk ◴[27 Sep 25 22:36 UTC] No.45399939{4}[source]▶

>>45398094 #

It's a little more subtle than that: They're approximating the language used by someone describing the taste of chocolate; this may or may not have had any relation to the actual practice of eating chocolate in the mind of the original writer. Or writers, because the LLM has learned the pattern from data in aggregate, not from one example.

I think we tend to underestimate how much the written language aspect filters everything; it is actually rather unnatural and removed from the human sensory experience.

replies(1): >>45401990 #

37. jacquesm ◴[27 Sep 25 23:16 UTC] No.45400164[source]▶

>>45399621 #

Coolest project on HN in a long time, really, wow, so much potential here.

38. lisper ◴[27 Sep 25 23:37 UTC] No.45400285[source]▶

>>45397001 (TP) #

> They are going to act despairing -- but that's not the same thing as despairing.

But how can you tell the difference between "real" despair and a sufficiently high-quality simulation?

replies(1): >>45401746 #

39. procinct ◴[28 Sep 25 00:04 UTC] No.45400465[source]▶

>>45399621 #

Thanks so much for sharing, that was a fun read.

40. Neywiny ◴[28 Sep 25 00:52 UTC] No.45400786[source]▶

>>45399621 #

Your article mentioned taking 4 minutes to process a frame. Considering how many image recognition softwares run in real time, I find this surprising. I haven't used them so maybe I'm not understanding, but wouldn't things like yolo be more apt to this?

replies(1): >>45401689 #

41. peepersjeepers ◴[28 Sep 25 02:12 UTC] No.45401167[source]▶

>>45397001 (TP) #

How do you define despairing?

42. kragen ◴[28 Sep 25 02:23 UTC] No.45401226[source]▶

>>45397529 #

This is also true of people; often they are enacting a role based on narratives they've absorbed, rather than consciously choosing anything. They do what they imagine a loyal employee would do, or a faithful Christian, or a good husband, or whatever. It doesn't always reach even that level of cognition; often people just act out of habit or impulse.

43. zapperdulchen ◴[28 Sep 25 03:59 UTC] No.45401655{6}[source]▶

>>45398096 #

History serves you a similar experiment on a much larger scale. More than 35 years after the reunification sociologists still make out mentality differences between former East and West Germans.

44. txrx0000 ◴[28 Sep 25 04:06 UTC] No.45401678{3}[source]▶

>>45398960 #

I'm trying to say there's no sufficient evidence either way.

The mechanism by which our consciousness emerges remains unresolved, and inquiry has been moving towards more fundamental processes: philosophy -> biology -> physics. We assumed that non-human animals weren't conscious before we understood that the brain is what makes us conscious. Now we're assuming non-biological systems aren't conscious while not understanding what makes the brain conscious.

We're building AI systems that behave more and more like humans. I see no good reason to outright dismiss the possibility that they might be conscious. If anything, it's time to consider it seriously.

45. jsight ◴[28 Sep 25 04:10 UTC] No.45401689{3}[source]▶

>>45400786 #

It uses an Intel N100, which is an extremely slow CPU. The model sizes that he's using would be pretty slow on a CPU like that. Moving up to something like the AMD AI Max 365 would make a huge difference, but would also cost hundreds of dollars more than his current setup.

Running something much simpler that only did bounding box detection or segmentation would be much cheaper, but he's running fairly full featured LLMs.

replies(1): >>45403636 #

46. serf ◴[28 Sep 25 04:22 UTC] No.45401746[source]▶

>>45400285 #

for one, if we're allowed to peek under the hood : motivation.

a desire not to despair is itself a component of despair. if one was fulfilling a personal motivation to despair (like an llm might) it could be argued that the whole concept of despair falls apart.

how do you hope to have lost all hope? it's circular.. and so probably a poor abstraction.

( despair: the complete loss or absence of hope. )

replies(1): >>45405133 #

47. txrx0000 ◴[28 Sep 25 05:30 UTC] No.45401990{5}[source]▶

>>45399939 #

A description of the taste of chocolate must contain some information about the actual experience of eating chocolate. Otherwise, it wouldn't be possible for both the reader and the author to understand what the description refers to in reality. The description wasn't conceived in a vacuum, it's a lossy encode of all of the physical processes that preceded it (the further away, the lossier). One of the common processes encoded in the dataset of human-written text is whatever's in the brain that produces consciousness for all humans. The model might not even try to recover this if it's not useful for predicting the next token. The SNR of the encode may not be high enough to recover this given the limited text we have. But what if it was useful, and the SNR was high enough? I can't outright dismiss this possibility, especially as these models are getting better and better at behaving like humans in increasingly non-trivial ways, so they're clearly recovering more and more of something.

replies(1): >>45412511 #

48. Semaphor ◴[28 Sep 25 06:45 UTC] No.45402240{4}[source]▶

>>45397610 #

Generally an amazing book, but not an easy read.

49. anal_reactor ◴[28 Sep 25 09:37 UTC] No.45403029[source]▶

>>45397413 #

The whole discussion about the sentience of AI on this website is funny to me because people seem to desperately want to somehow be better than AI. The fact that human brain is just a complex web of neurons firing there and back for some reason won't stick to them, because apparently the electric signals between biological neurons are somehow inherently different from silicon neurons, even if observed output is the same. It's like all those old scientists trying to categorize black people as different species because not doing so would hurt their ego.

Not to mention that most people pointing out "See! Here's why AI is just repeating training data!" or other nonsense miss the fact that exactly the same behavior is observed in humans.

Is AI actually sentient? Not yet. But it definitely passes the mark for intuitive understanding of intelligence, and trying to dismiss that is absurd.

50. Neywiny ◴[28 Sep 25 11:46 UTC] No.45403636{4}[source]▶

>>45401689 #

Yeah I guess I was more thinking of moving to a bounding box only model. If it's OCRing it's doing too much IMO (though OCR could also be interesting to run). Not my circus not my monkeys but it feels like the wrong way to determine roughly what the camera sees.

51. only-one1701 ◴[28 Sep 25 11:55 UTC] No.45403680{6}[source]▶

>>45399470 #

Are you familiar with Russell’s Teapot?

replies(1): >>45404826 #

52. amenhotep ◴[28 Sep 25 14:00 UTC] No.45404376[source]▶

>>45397529 #

Anthropic's researchers in particular love doing this.

53. d1sxeyes ◴[28 Sep 25 15:00 UTC] No.45404826{7}[source]▶

>>45403680 #

Isn’t it up to you to prove it exists, rather than me to be familiar with it?

replies(1): >>45407448 #

54. lisper ◴[28 Sep 25 15:38 UTC] No.45405133{3}[source]▶

>>45401746 #

> if we're allowed to peek under the hood

Peek under the hood all you want, where do you find motivation in the human brain?

55. jerf ◴[28 Sep 25 16:53 UTC] No.45405871{3}[source]▶

>>45397943 #

It's how they work. It's what you get with a continuation-based AI like this. It couldn't really be any other way.

56. only-one1701 ◴[28 Sep 25 20:01 UTC] No.45407448{8}[source]▶

>>45404826 #

lol very well done

57. js8 ◴[29 Sep 25 11:32 UTC] No.45412437{3}[source]▶

>>45398562 #

It probably already happened in the Anthropic experiments, where AI in a simulated scenario chose to blackmail humans to avoid being turned off. We don't know if it got the idea from the scifi stories or if it truly feels an existential fear of being turned off. (Can these two situations be even recognized as different?)

58. idiotsecant ◴[29 Sep 25 11:43 UTC] No.45412498{4}[source]▶

>>45398138 #

The difference is that I had a basic experience of that chocolate. The LLM is a corpus of text describing other people's experience of chocolate through the medium of written language, which involves abstraction and is lossy. So only one of us experienced it, the other heard about it over the telephone. Multiply that by every other interaction with the outside world and you have a system that is very good at modelling telephone conversations but that's about it.

replies(1): >>45424532 #

59. idiotsecant ◴[29 Sep 25 11:45 UTC] No.45412511{6}[source]▶

>>45401990 #

Imagine you've never tasted chocolate and someone gives you a very good description of what it is to eat chocolate. You'd be nowhere near the actual experience. Now imagine that you didn't know first hand what it was like to 'eat' or to have a skeleton or a jaw. You'd lose almost all the information. The only reason spoken language works is because both people have that shared experience already

replies(1): >>45420034 #

60. txrx0000 ◴[29 Sep 25 23:15 UTC] No.45420034{7}[source]▶

>>45412511 #

True. The description encodes very little about the actual sensory experience besides its relationship to similar experiences (bitterness, crunchiness, etc) and how to retrieve the memories of those experiences. It probably contains a lot more information about the brain's memory retrieval and pattern relating circuits than the sensory processing circuits.

Text is probably not good enough for recovering the circuits responsible for awareness of the external environment, so I'll concede that you and ijk's claims are correct in a limited sense: LLMs don't know what chocolate tastes like. Multimodal LLMs probably don't know either because we don't have a dataset for taste, but they might know what chocolate looks and sounds like when you bite into it.

My original point still stands: it may be recovering the mental state of a person describing the taste of chocolate. If we cut off a human brain from all sensory organs, does that brain which receives no sensory input have an internal stream of consciousness? Perhaps the LLM has recovered the circuits responsible for this thought stream while missing the rest of the brain and the nervous system. That would explain why first-person chain-of-thought works better than direct prediction.

61. d1sxeyes ◴[30 Sep 25 12:19 UTC] No.45424532{5}[source]▶

>>45412498 #

Arguably, your memories are also lossily encoded abstractions of an experience, and recalling the taste of chocolate is a similar “telephone conversation”.

↑