←back to thread

168 points 1wheel | 1 comments | | HN request time: 0.209s | source
Show context
whimsicalism ◴[] No.40429692[source]
I continue to be impressed by Anthropic’s work and their dual commitment to scaling and safety.

HN is often characterized by a very negative tone related to any of these developments, but I really do feel that Anthropic is trying to do a “race to the top” in terms of alignment, though it doesn’t seem like all the other major companies are doing enough to race with them.

Particularly frustrating on HN is the common syllogism of: 1. I believe anything that “thinks” must do X thing. 2. LLM doesn’t do X thing 3. LLM doesn’t think

X thing is usually both poorly justified as constitutive of thinking (usually constitutive of human thinking but not writ large) nor is it explained why it matters whether the label of “thinking” applies to LLM or not if the capabilities remain the same.

replies(2): >>40429967 #>>40430005 #
handwarmers ◴[] No.40429967[source]
What is often frustrating to me at least is the arbitrary definition of "safety" and "ethics", forged by a small group of seemingly intellectually homogenous individuals.
replies(2): >>40429991 #>>40434758 #
whimsicalism ◴[] No.40429991[source]
Say more, this is half a thought
replies(1): >>40432957 #
1. CamperBob2 ◴[] No.40432957[source]
E.g., the common sentiment that "NSFW" output is to be prohibited, regardless of whether you work in a steel mill or a church.