Signs of introspection in large language models

(www.anthropic.com)

178 points themgt | 4 comments | 30 Oct 25 16:45 UTC | HN request time: 0s | source

Show context

xanderlewis ◴[31 Oct 25 22:01 UTC] No.45777174[source]▶

>>45762064 (OP) #

Given that this is 'research' carried out (and seemingly published) by a company with a direct interest in selling you a product (or, rather, getting investors excited/panicked), can we trust it?

replies(6): >>45777183 #>>45777199 #>>45779098 #>>45779186 #>>45780472 #>>45781756 #

bobbylarrybobby ◴[31 Oct 25 22:04 UTC] No.45777199[source]▶

>>45777174 #

Would knowing that Claude is maybe kinda sorta conscious lead more people to subscribe to it?

I think Anthropic genuinely cares about model welfare and wants to make sure they aren't spawning consciousness, torturing it, and then killing it.

replies(4): >>45777638 #>>45778064 #>>45779830 #>>45780094 #

1. littlestymaar ◴[01 Nov 25 07:23 UTC] No.45779830[source]▶

>>45777199 #

> Would knowing that Claude is maybe kinda sorta conscious lead more people to subscribe to it?

For anyone having paid attention, it has been clear for the past two years that Dario Amodei is lobbying for strict regulation on LLMs to prevent new entrants on the market, and the core of its argument is that LLMs are fundamentally intelligent and dangerous.

So this kind of “research” isn't targeted towards their customers but towards the legislators.

replies(2): >>45780107 #>>45783562 #

2. baq ◴[01 Nov 25 08:33 UTC] No.45780107[source]▶

>>45779830 (TP) #

The thing is, if he is right, or will be in the near future, regulators will get scared and ban the things outright, throwing the baby out with the bathwater. Yes, he benefits if they step in early, but it isn’t a given that we all don’t when this happens.

replies(1): >>45780466 #

3. littlestymaar ◴[01 Nov 25 09:57 UTC] No.45780466[source]▶

>>45780107 #

We already know AI is a very serious threat:

- it's a threat for young graduates' jobs.

- it's a threat to the school system, undermining its ability to teach through exercises.

- it's a threat to the internet given how easily it can create tons of fake content.

- it's a threat to mental health of fragile people.

- it's a gigantic threat to a competitive economy if all the productivity gains are being grabbed by the AI editors through a monopolistic position.

The terminator threat is pure fantasy and it's just here to distract from the very real threats that are already doing harm today.

4. xanderlewis ◴[01 Nov 25 17:35 UTC] No.45783562[source]▶

>>45779830 (TP) #

I can't be exactly sure of the intended target, but it certainly helps to increase the sense of FOMO among investors even if as an unintended side effect (though I don't think it is unintended).

↑