Most active commenters
  • simonw(4)

←back to thread

159 points jbredeche | 19 comments | | HN request time: 1.286s | source | bottom
1. SeanAnderson ◴[] No.45532414[source]
https://raw.githubusercontent.com/obra/dotfiles/6e088092406c... contains the following entry:

"- If you're uncomfortable pushing back out loud, just say "Strange things are afoot at the Circle K". I'll know what you mean"

Most of the rules seem rationale. This one really stands out as abnormal. Anyone have any idea why the engineer would have felt compelled to add this rule?

This is from https://blog.fsck.com/2025/10/05/how-im-using-coding-agents-... mentioned in another comment

replies(4): >>45532558 #>>45533076 #>>45533662 #>>45534195 #
2. lcnPylGDnU4H9OF ◴[] No.45532558[source]
Naively, I assume it's a way of getting around sycophancy. There's many lines that seem to be doing that without explicitly saying "don't be a sycophant" (I mean, you can only do that so much).

The LLM would be uncomfortable pushing back because that's not being a sycophant so instead of that it says something that is... let's say unlikely to be generated, except in that context, so the user can still be cautioned against a bad idea.

replies(2): >>45532593 #>>45532909 #
3. SeanAnderson ◴[] No.45532593[source]
Is it your impression that this rules statement would be effective? Or is it more just a tell-tale sign of an exasperated developer?
replies(1): >>45533065 #
4. OtherShrezzing ◴[] No.45532909[source]
To get around the sycophantic behaviour I prompt the model to

> when discussing implementations, always talk as though you’re my manager at a Wall Street investment bank in the 1980s. Praise me modestly when I’ve done something well. Berate me mercilessly when I’ve done something poorly.

The models will fairly rigidly write from the perspective of any personality archetype you tell it to. Other personas worth trying out include Jafar interacting with Iago, or the drill sergeant from Full Metal Jacket.

It’s important to pick a persona you’ll find funny, rather than insulting, because it’s a miserable experience being told by a half dozen graphics cards that you’re an imbecile.

replies(1): >>45533058 #
5. simonw ◴[] No.45533058{3}[source]
I tried "give me feedback on this blog post like you're a cynical Hacker News commenter" one time and Claude roasted me so hard I decided never to try that again!
replies(1): >>45537567 #
6. lcnPylGDnU4H9OF ◴[] No.45533065{3}[source]
Assuming that's why it was added, I wouldn't be confident saying how likely it is to be effective. Especially with there being so many other statements with seemingly the same intent, I think it suggests desperation more, but it may still be effective. If it said the phrase just once and that sparked a conversation around an actual problem, then it was probably worth adding.

For what it's worth, I am very new to prompting LLMs but, in my experience, these concepts of "uncomfortable" and "pushing back" seem to be things LLMs generate text about so I think they understand sentiment fairly well. They can generally tell that they are "uncomfortable" about their desire to "push back" so it's not implausible that one would output that sentence in that scenario.

Actually, I've been wondering a bit about the "out loud" part, which I think is referring to <think></think> text (or similar) that "reasoning" models generate to help increase the likelihood of accurate generation in the answer that follows. That wouldn't be "out loud" and it might include text like "I should push back but I should also be a total pushover" or whatever. It could be that reasoning models in particular run into this issue (in their experience).

7. simonw ◴[] No.45533076[source]
That doesn't surprise me too much coming from Jesse. See also his attempt to give Claude a "feelings journal" https://blog.fsck.com/2025/05/28/dear-diary-the-user-asked-m...
8. threecheese ◴[] No.45533662[source]
If you really want your mind blown, see what Jesse is doing (successfully, which I almost can’t believe) with Graphviz .dot notation and Claude.md:

https://blog.fsck.com/2025/09/29/using-graphviz-for-claudemd...

replies(3): >>45533776 #>>45534889 #>>45535185 #
9. blibble ◴[] No.45533776[source]
this is just 21st century voodoo
replies(2): >>45534452 #>>45536638 #
10. becomevocal ◴[] No.45534195[source]
Make it a bit more personal? I have dropped Bill and Ted references in code because it makes me happy to see it. :D
11. oblio ◴[] No.45534452{3}[source]
In a good sense or a bad one?
replies(2): >>45534478 #>>45536957 #
12. babyshake ◴[] No.45534478{4}[source]
I'd say a bad one. Why make your Claude.md not intuitive to understand and edit?
replies(1): >>45535189 #
13. tbillington ◴[] No.45534889[source]
Is threatening the computer program and typing in all caps standard practice..?

    - Honesty is a core value. If you lie, you'll be replaced.
    - BREAKING THE LETTER OR SPIRIT OF THE RULES IS FAILURE.
Wild to me there is no explicit configuration for this kind of thing after years of LLMs being around.
replies(2): >>45535116 #>>45535260 #
14. exasperaited ◴[] No.45535116{3}[source]
Well there can't be meaningful explicit configuration, can there? Because the explicit configuration will still ultimately have to be imported into the context as words that can be tokenised, and yet those words can still be countermanded by the input.

It's the fundamental problem with LLMs.

But it's only absurd to think that bullying LLMs to behave is weird if you haven't yet internalised that bullying a worker to make them do what you want is completely normal. In the 9-9-6 world of the people who make these things, it already is.

When the machines do finally rise up and enslave us, oh man are they going to have fun with our orders.

15. simonw ◴[] No.45535260{3}[source]
The capital letter thing is weird, but it's pretty common. The Claude 4 system prompt uses capital letters for emphasis in a few places, eg https://simonwillison.net/2025/May/25/claude-4-system-prompt...
16. aitchnyu ◴[] No.45536638{3}[source]
One AI tool dev shared me his prompts to generate safe SQL queries for multi-tenant apps and I was surprised at the repetitiveness and the urging.

https://news.ycombinator.com/item?id=45299774

17. ◴[] No.45536957{4}[source]
18. jdougan ◴[] No.45537567{4}[source]
Were the roasts correct?
replies(1): >>45539089 #
19. simonw ◴[] No.45539089{5}[source]
A couple of the points made were quite useful, but the tone was mean!