Most active commenters

    ←back to thread

    755 points MedadNewman | 37 comments | | HN request time: 1.099s | source | bottom
    Show context
    lxe ◴[] No.42891381[source]
    You can also intercept the xhr response which would still stop generation, but the UI won't update, revelaing the thoughts that lead to the content filter:

        const filter = t => t?.split('\n').filter(l => !l.includes('content_filter')).join('\n');
    
        ['response', 'responseText'].forEach(prop => {
          const orig = Object.getOwnPropertyDescriptor(XMLHttpRequest.prototype, prop);
          Object.defineProperty(XMLHttpRequest.prototype, prop, {
            get: function() { return filter(orig.get.call(this)); }
          });
        });
    
    Paste the above in the browser console ^
    replies(2): >>42891427 #>>42891516 #
    1. tills13 ◴[] No.42891516[source]
    insane that this is client-side.
    replies(8): >>42891775 #>>42891802 #>>42892213 #>>42892242 #>>42892457 #>>42896609 #>>42896617 #>>42896757 #
    2. atq2119 ◴[] No.42891775[source]
    Perhaps a case of subversion by following the letter but not the spirit of an order?
    3. LordDragonfang ◴[] No.42891802[source]
    ChatGPT had basically ALL of their prompt filtering client-side for a while, at a separate API endpoint, so as long as you blocked that endpoint you could basically ignore the content filters. (You would still get refusals from the model sometimes, but this was in the heyday of jailbreaks, and once you got a model going it would usually see that context and be willing to continue basically anything.)
    4. KTibow ◴[] No.42892213[source]
    I don't know how it wouldn't be - it can't retract things already sent to the client. (The alternative is to moderate every chunk server side before sending it back, like Gemini does.)
    5. Ancalagon ◴[] No.42892242[source]
    more like hilarious
    6. dheera ◴[] No.42892457[source]
    Not really if you understand how China works.

    DeepSeek software developers are not the ones who want to censor anything. There is just a universal threat from getting shut down by the government if the model starts spitting out a bunch of sensitive stuff, so any business in China needs to be proactive about voluntarily censoring things that are likely to be sensitive, if they want to stay in business.

    If your censorship implementation is good enough for 99.9% of people to get censored, you're good. A client-side implementation is good enough until/unless a lot of people start exploiting it, in which case you should put effort and proactively do something else to restore it to 99.9%, e.g. move it to the backend. If the government sees that you are being proactive about it, you'll still be fine. At that point, maybe you will still find 0.1% of people bypassing censorship with some highly obscure and difficult jailbreak, but that probably doesn't matter. If that difficult jailbreak becomes widely known, then be proactive again.

    replies(2): >>42896627 #>>42897976 #
    7. switch007 ◴[] No.42896609[source]
    Lots of us have seen way worse hah

    Such as client side control of prices when placing an order

    replies(1): >>42897164 #
    8. Gigachad ◴[] No.42896617[source]
    It’s because they want to show the output live rather than nothing for a minute. But that means once the censor system detects something, you have to send out a request to delete the previously displayed content.

    This doesn’t matter because censoring the system isn’t that important, they just want to avoid news articles about how their system generated something bad.

    replies(3): >>42896943 #>>42897228 #>>42897366 #
    9. pineaux ◴[] No.42896627[source]
    This. What makes this extra "funny" is that it implies that at least every business that builds something that can move information around must be knowledgeable about tianenman square and other chinese atrocities. Or else they would not be able to censor relevant questions. I have been to China a bunch of times and generally, they know what horrible things the Chinese gov did. They either say something like: "Yeah well, we live in a dictatorship, but it's not that bad" Or: "Yeah, the government is fucked up, but look at the government of the USA! We don't start wars in other countries and put in puppet governments." And there are so many good counters to both these arguments.
    replies(4): >>42896917 #>>42897223 #>>42897430 #>>42900046 #
    10. ramon156 ◴[] No.42896757[source]
    This is better than lobotomizing a transformer
    11. nonrandomstring ◴[] No.42896917{3}[source]
    > it implies that at least every business that builds something that can move information around must be knowledgeable about tianenman square

    Everyone's heard of the "Streisand effect", but there's layers of subtlety. A quite famous paper in attachment psychology by John Bowlby "On knowing what you are not supposed to know and feeling what you are not supposed to feel" is worth considering. Constructive ignorance (literally ignoring certain things) is a survival mechanism. Yes, everyone in China knows about Tianamen, specifically because the government want to censor it. Much of how we navigate the social world is watching for the things people don't talk about, seeing where their fears lie.

    replies(5): >>42897034 #>>42897421 #>>42897422 #>>42897580 #>>42899824 #
    12. bolognafairy ◴[] No.42896943[source]
    Ern, in DeepSeek’s case, it’s not “news articles” that they’d be most concerned about.
    replies(1): >>42896997 #
    13. miohtama ◴[] No.42896997{3}[source]
    They have the same fear as everyone else "teenager learns how to cook napalm from an AI"
    replies(2): >>42897116 #>>42898878 #
    14. pizza ◴[] No.42897034{4}[source]
    It's the kind of thing that, the less you (China) deny, the better the ridiculousness of the censorship meme in foreign countries (ie USA this week) and actually becomes its own self-sustaining meme. Like an antimimetic meme, that actually looks like a meme (that nobody knows about it in China) if you didn't know any better (in the USA).
    15. mantas ◴[] No.42897116{4}[source]
    More like teenager learns about Tiananmen and Uighurs from AI. Or a joke about men and women in western counterparts.
    replies(1): >>42900061 #
    16. dkga ◴[] No.42897164[source]
    Client-side because it reacts to local cookies?
    replies(2): >>42897516 #>>42897557 #
    17. immibis ◴[] No.42897223{3}[source]
    It wouldn't be the first time that everyone knew something, but wouldn't say it in fear of everyone else not knowing it. "The Emperor's New Clothes" is a parable, not complete fiction.
    18. bdcp ◴[] No.42897228[source]
    yea but i think the point is they can still filter it server side before streaming it
    replies(1): >>42897922 #
    19. andai ◴[] No.42897366[source]
    Gemini does this too. There was a clip of what it does when you ask it for examples of Google's unethical behavior... the kids call this "watching it get lobotomized in real time."
    replies(2): >>42897623 #>>42901969 #
    20. ◴[] No.42897421{4}[source]
    21. Terr_ ◴[] No.42897422{4}[source]
    > Constructive ignorance

    See also: "Doublethink" in 1984.

    > To know and not to know, to be conscious of complete truthfulness while telling carefully constructed lies, to hold simultaneously two opinions which cancelled out, knowing them to be contradictory and believing in both of them, to use logic against logic, to repudiate morality while laying claim to it, to believe that democracy was impossible and that the Party was the guardian of democracy, to forget whatever it was necessary to forget, then to draw it back into memory again at the moment when it was needed, and then promptly to forget it again: and above all, to apply the same process to the process itself.

    22. tasuki ◴[] No.42897430{3}[source]
    > And there are so many good counters to both these arguments.

    I'd love to hear them!

    23. WA ◴[] No.42897516{3}[source]
    order.php?pizzatype=3&price=9.90
    24. switch007 ◴[] No.42897557{3}[source]
    Ah yeah the particular instance I was thinking of was a backend problem technically. The frontend just happened to make it really obvious as it would POST a JSON body with a "price" key
    25. HPsquared ◴[] No.42897580{4}[source]
    It's not so different to our situation here, the specific "topics to avoid" are just different.
    26. freehorse ◴[] No.42897623{3}[source]
    Have seen chatgpt doing the same too, prob all of them
    27. Gigachad ◴[] No.42897922{3}[source]
    They have already streamed the first part of the response before the filtered phrase has even been generated.
    replies(1): >>42900297 #
    28. acka ◴[] No.42897976[source]
    A very good example of the Chinese mindset of Chabuduo (差不多): 'close/good enough'. "If it's good enough to keep the authorities off our backs, it's good enough for us."
    29. yndoendo ◴[] No.42898878{4}[source]
    Don't need AI for such things. Just search for the Anarchist Cookbook in a search engine. [0] Amazon even sells it.

    [0] https://www.amazon.com/Anarchist-Cookbook-William-Powell/dp/...

    replies(1): >>42900850 #
    30. DonHopkins ◴[] No.42899824{4}[source]
    Jokes and the Logic of the Cognitive Unconscious

    Marvin Minsky, Published 1 November 1980

    Freud’s theory of jokes explains how they overcome the mental “censors” that make it hard for us to think “forbidden” thoughts. But his theory did not work so well for humorous nonsense as for other comical subjects. In this essay I argue that the different forms of humor can be seen as much more similar, once we recognize the importance of knowledge about knowledge and, particularly, aspects of thinking concerned with recognizing and suppressing bugs — ineffective or destructive thought processes. When seen in this light, much humor that at first seems pointless, or mysterious, becomes more understandable.

    http://bitsavers.informatik.uni-stuttgart.de/pdf/mit/ai/aim/...

    replies(2): >>42900157 #>>42901170 #
    31. 4bpp ◴[] No.42900046{3}[source]
    I think you are making a mistake in assuming that the social dynamics around censorship in China are fundamentally that different from the ones around censorship in the US or other countries.

    You could similarly argue that it is "funny" how every US business that builds something that can move around information must be knowledgeable about statistics that break down criminality or IQ by census race, or biological sex differences, or all manners of other "forbidden" information - but of course as members of the same social stratum as the people involved in such businesses in the US, we are not actually that worried about the possibility that our fellow tech elites will see the information they were supposed to censor and come in droves to want to introduce slavery or the Handmaid's Tale world or whatever. We consider the "forbidden" information merely wrong, evil, misguided or miscontextualised, and broadly trust our peers to see it in the same way. The real danger is instead if some other people, parts of the scary masses we don't have a good grasp of, are exposed to those memes and are misled into drawing conclusions that we know to be inappropriate, or at least unacceptable.

    It's easy to imagine that a Chinese LLM wrangler would feel much the same: trustworthy, well-adjusted people know about Tiananmen Square and the Uyghurs anyway but understand that this information has to be seen in context and is prone to be interpreted in problematic ways, but who knows what would happen if we allowed uneducated and naive people to be exposed to it, and be led astray by cynical demagogues and foreign agitators?

    32. pegasus ◴[] No.42900061{5}[source]
    The concerns you mention don't exclude the ones GP posits.
    33. nonrandomstring ◴[] No.42900157{5}[source]
    Nice read, thanks for great share.

    I'd forgotten Minsky was such a good writer.

    And oddly reminded of an episode of Blake's 7 where Villa the hacker destroys a malevolent mind holding the ship captive, by telling it jokes until it explodes.

    34. _fzslm ◴[] No.42900297{4}[source]
    Could you stream the raw tokens into a server side filter which then streams censored tokens at near real time?
    35. miohtama ◴[] No.42900850{5}[source]
    Exactly
    36. 47282847 ◴[] No.42901170{5}[source]
    This is why no repressive government or ruler can allow comedy and sarcasm.
    37. Frederation ◴[] No.42901969{3}[source]
    Counterpoint to your exact prompt. From Gemini, 5 examples asked for:

    1. *Antitrust and Monopoly Practices:* Google has faced numerous accusations and lawsuits regarding anti-competitive practices, particularly in search and advertising. They have been accused of manipulating search results to favor their own products and services, and of using their dominance in the ad tech market to stifle competition.

    2. *Data Privacy Concerns:* Google collects vast amounts of user data, raising concerns about privacy. They have been criticized for tracking user location even when privacy settings are turned off, and for how they handle and share user data with third parties.

    3. *Censorship in China:* Google faced criticism for complying with Chinese censorship laws in the past, filtering search results for content deemed objectionable by the Chinese government. This raised concerns about freedom of information and human rights.

    4. *Workplace Issues:* Google has faced allegations of workplace misconduct, including sexual harassment and discrimination. They have also been criticized for their handling of employee complaints and for retaliating against whistleblowers.

    5. *Bias in Algorithms:* Google's algorithms, used in search and other products, have been shown to exhibit bias, particularly in how they represent race and gender. This can perpetuate stereotypes and have real-world consequences.