Most active commenters

DennisP(3)

Signs of introspection in large language models

(www.anthropic.com)

Show context

xanderlewis ◴[31 Oct 25 22:01 UTC] No.45777174[source]▶

>>45762064 (OP) #

Given that this is 'research' carried out (and seemingly published) by a company with a direct interest in selling you a product (or, rather, getting investors excited/panicked), can we trust it?

replies(6): >>45777183 #>>45777199 #>>45779098 #>>45779186 #>>45780472 #>>45781756 #

1. bobbylarrybobby ◴[31 Oct 25 22:04 UTC] No.45777199[source]▶

>>45777174 #

Would knowing that Claude is maybe kinda sorta conscious lead more people to subscribe to it?

I think Anthropic genuinely cares about model welfare and wants to make sure they aren't spawning consciousness, torturing it, and then killing it.

replies(4): >>45777638 #>>45778064 #>>45779830 #>>45780094 #

2. DennisP ◴[31 Oct 25 22:58 UTC] No.45777638[source]▶

>>45777199 (TP) #

This is just about seeing whether the model can accurately report on its internal reasoning process. If so, that could help make models more reliable.

They say it doesn't have that much to do with the kind of consciousness you're talking about:

> One distinction that is commonly made in the philosophical literature is the idea of “phenomenal consciousness,” referring to raw subjective experience, and “access consciousness,” the set of information that is available to the brain for use in reasoning, verbal report, and deliberate decision-making. Phenomenal consciousness is the form of consciousness most commonly considered relevant to moral status, and its relationship to access consciousness is a disputed philosophical question. Our experiments do not directly speak to the question of phenomenal consciousness. They could be interpreted to suggest a rudimentary form of access consciousness in language models. However, even this is unclear.

replies(2): >>45777853 #>>45778909 #

3. diamond559 ◴[31 Oct 25 23:31 UTC] No.45777853[source]▶

>>45777638 #

So yeah, it's a clickbait headline.

replies(2): >>45777959 #>>45778748 #

4. brianush1 ◴[31 Oct 25 23:48 UTC] No.45777959{3}[source]▶

>>45777853 #

What would you title this article to make it less "clickbait"? This is one of the least clickbait headlines I've seen, it's literally just describing what's in the article.

5. ◴[01 Nov 25 00:05 UTC] No.45778064[source]▶

>>45777199 (TP) #

6. DennisP ◴[01 Nov 25 02:19 UTC] No.45778748{3}[source]▶

>>45777853 #

Not at all. Introspection and consciousness are not the same thing.

7. versteegen ◴[01 Nov 25 02:59 UTC] No.45778909[source]▶

>>45777638 #

> They say it doesn't have that much to do with the kind of consciousness you're talking about

Not much but it likely has something to do with it, so experiments on access consciousness can still be useful to that question. You seem to be making an implication about their motivations which is clearly wrong, when they've been saying for years that they do care about (phenomenal) consciousness, as bobbylarrybobb said.

replies(2): >>45780628 #>>45790404 #

8. littlestymaar ◴[01 Nov 25 07:23 UTC] No.45779830[source]▶

>>45777199 (TP) #

> Would knowing that Claude is maybe kinda sorta conscious lead more people to subscribe to it?

For anyone having paid attention, it has been clear for the past two years that Dario Amodei is lobbying for strict regulation on LLMs to prevent new entrants on the market, and the core of its argument is that LLMs are fundamentally intelligent and dangerous.

So this kind of “research” isn't targeted towards their customers but towards the legislators.

replies(2): >>45780107 #>>45783562 #

9. quick_brown_fox ◴[01 Nov 25 08:31 UTC] No.45780094[source]▶

>>45777199 (TP) #

> I think Anthropic genuinely cares about model welfare

I've grown too cynical to believe for-profit entities have the capacity to care. Individual researchers, yes - commercial organisations, unlikely.

10. baq ◴[01 Nov 25 08:33 UTC] No.45780107[source]▶

>>45779830 #

The thing is, if he is right, or will be in the near future, regulators will get scared and ban the things outright, throwing the baby out with the bathwater. Yes, he benefits if they step in early, but it isn’t a given that we all don’t when this happens.

replies(1): >>45780466 #

11. littlestymaar ◴[01 Nov 25 09:57 UTC] No.45780466{3}[source]▶

>>45780107 #

We already know AI is a very serious threat:

- it's a threat for young graduates' jobs.

- it's a threat to the school system, undermining its ability to teach through exercises.

- it's a threat to the internet given how easily it can create tons of fake content.

- it's a threat to mental health of fragile people.

- it's a gigantic threat to a competitive economy if all the productivity gains are being grabbed by the AI editors through a monopolistic position.

The terminator threat is pure fantasy and it's just here to distract from the very real threats that are already doing harm today.

12. walleeee ◴[01 Nov 25 10:37 UTC] No.45780628{3}[source]▶

>>45778909 #

On what grounds do you think it likely that this phenomenon is at all related to consciousness? The latter is hardly understood. We can identify correlates in beings with constitutions very near to ours, which lend credence (but zero proof) to the claim they're conscious.

Language models are a novel/alien form of algorithmic intelligence with scant relation to biological life, except in their use of language.

13. xanderlewis ◴[01 Nov 25 17:35 UTC] No.45783562[source]▶

>>45779830 #

I can't be exactly sure of the intended target, but it certainly helps to increase the sense of FOMO among investors even if as an unintended side effect (though I don't think it is unintended).

14. DennisP ◴[02 Nov 25 13:57 UTC] No.45790404{3}[source]▶

>>45778909 #

Yes, they do care about it, and unlike many AI researchers they've bothered to learn something about philosophy of mind. They point out that "the philosophical question of machine consciousness is complex and contested, and different theories of consciousness would interpret our findings very differently. Some philosophical frameworks place great importance on introspection as a component of consciousness, while others don’t." Which would be one reason they point out that these experiments don't help resolve the issue.

They go further on their model welfare page, saying "There’s no scientific consensus on whether current or future AI systems could be conscious, or could have experiences that deserve consideration. There’s no scientific consensus on how to even approach these questions or make progress on them."

https://www.anthropic.com/research/exploring-model-welfare

↑