←back to thread

617 points jbegley | 2 comments | | HN request time: 0.424s | source
Show context
stainablesteel ◴[] No.42941417[source]
It's not like they're above lying, why do they even care to update this?
replies(2): >>42941703 #>>42941914 #
1. aradox66 ◴[] No.42941703[source]
Would it be too far out there to imagine that the LLMs they were training for weapons systems knew it violated their rules and were resisting compliance?

The alignment-faking research seems to indicate that LLMs exercise of this kind of reasoning.

replies(1): >>42941918 #
2. janalsncm ◴[] No.42941918[source]
Depends on the weapons system but it would probably not be an LLM, it would be a neural network trained to locate and identify people in a video for example.

And even if it was, they wouldn’t tell the system it was part of old non-evil Google.