←back to thread

745 points melded | 4 comments | | HN request time: 0s | source
Show context
srameshc ◴[] No.45946518[source]
So does that mean if Heretic is used for models like Deepseek and Qwen it can talk about subjects 1989 Tiananmen Square protests, Uyghur forced labor claims, or the political status of Taiwan. I am trying to understand the broader goals around such tools.
replies(4): >>45946598 #>>45946747 #>>45946759 #>>45952005 #
kachapopopow ◴[] No.45946598[source]
the models already talk about it just fine if you load them up yourself, only the web api from official deepseek has these issues because they are required to do so by law.
replies(1): >>45946732 #
throwawaymaths ◴[] No.45946732[source]
That is not the case.
replies(2): >>45948796 #>>45955400 #
1. ls612 ◴[] No.45948796{3}[source]
I just tested this with Deepseek in Nvidia's AI sandbox and in Groq (so the inference was performed in the US) and it happily told me what happened on June 4, 1989. Stop spreading disinformation.
replies(2): >>45949312 #>>45951569 #
2. int_19h ◴[] No.45949312[source]
Qwen will refuse usually. Even more hideously, if you just ask it in general terms about anything historically interesting that happened on Tiananmen Square, it will remember 1989 in its CoT, and (usually) then decide to not mention it because it's "controversial".

However, it's fairly easy to argue the model into admitting that it's unethical to do so and get it to talk.

3. astrange ◴[] No.45951569[source]
I've been told by people running Qwen locally in production that they'll have downtime incidents if it's required to think about anything with any implication that Taiwan is a separate country.
replies(1): >>45986444 #
4. throwawaymaths ◴[] No.45986444[source]
that makes no sense at all.