(substack.com)

755 points MedadNewman | 1 comments | 31 Jan 25 19:41 UTC | HN request time: 0s | source

Show context

ks2048 ◴[31 Jan 25 20:16 UTC] No.42891407[source]▶

Part of the blog is hypothesizing that the censorship is in a separate filtering stage rather than the model itself. But, the example of hex encoding doesn't prove or disprove that at all, does it? Can't you just check on a version running open-source weights?

replies(2): >>42891485 #>>42891489 #

amrrs ◴[31 Jan 25 20:23 UTC] No.42891485[source]▶

>>42891407 #

I ran the distilled models locally some of the censorships are there.

But on their chat (hosted), deepseek has some keyword based filters - like the moment it generates Chinese president name or other controversial keywords - the "thinking" stops abruptly!

replies(1): >>42891513 #

1. prettyblocks ◴[31 Jan 25 20:25 UTC] No.42891513[source]▶

>>42891485 #

The distilled versions I've run through Ollama are absolutely censored and don't even populate the <think></think> section for some of those questions.

↑

Bypass DeepSeek censorship by speaking in hex