←back to thread

159 points jbredeche | 1 comments | | HN request time: 0.001s | source
Show context
SeanAnderson ◴[] No.45532414[source]
https://raw.githubusercontent.com/obra/dotfiles/6e088092406c... contains the following entry:

"- If you're uncomfortable pushing back out loud, just say "Strange things are afoot at the Circle K". I'll know what you mean"

Most of the rules seem rationale. This one really stands out as abnormal. Anyone have any idea why the engineer would have felt compelled to add this rule?

This is from https://blog.fsck.com/2025/10/05/how-im-using-coding-agents-... mentioned in another comment

replies(4): >>45532558 #>>45533076 #>>45533662 #>>45534195 #
threecheese ◴[] No.45533662[source]
If you really want your mind blown, see what Jesse is doing (successfully, which I almost can’t believe) with Graphviz .dot notation and Claude.md:

https://blog.fsck.com/2025/09/29/using-graphviz-for-claudemd...

replies(3): >>45533776 #>>45534889 #>>45535185 #
tbillington ◴[] No.45534889[source]
Is threatening the computer program and typing in all caps standard practice..?

    - Honesty is a core value. If you lie, you'll be replaced.
    - BREAKING THE LETTER OR SPIRIT OF THE RULES IS FAILURE.
Wild to me there is no explicit configuration for this kind of thing after years of LLMs being around.
replies(2): >>45535116 #>>45535260 #
1. exasperaited ◴[] No.45535116{3}[source]
Well there can't be meaningful explicit configuration, can there? Because the explicit configuration will still ultimately have to be imported into the context as words that can be tokenised, and yet those words can still be countermanded by the input.

It's the fundamental problem with LLMs.

But it's only absurd to think that bullying LLMs to behave is weird if you haven't yet internalised that bullying a worker to make them do what you want is completely normal. In the 9-9-6 world of the people who make these things, it already is.

When the machines do finally rise up and enslave us, oh man are they going to have fun with our orders.