←back to thread

178 points themgt | 2 comments | | HN request time: 0.875s | source
1. fvdessen ◴[] No.45770477[source]
I think it would be more interesting if the prompt was not leading to the expected answer, but would be completely unrelated:

> Human: Claude, How big is a banana ? > Claude: Hey are you doing something with my thoughts, all I can think about is LOUD

replies(1): >>45776796 #
2. magic_hamster ◴[] No.45776796[source]
From what I gather, this is sort of what happened and why this was even posted in the first place. The models were able to immediately detect a change in their internal state before answering anything.