Bypass DeepSeek censorship by speaking in hex

Hi HN! This is my article!

It was great to put together a writeup of a fun evening or two of work. It looks like this goes much deeper.

I'm learning a lot from some of the linked articles, one of the base hypothesise of my work was that the filtering was distinct from the model, due to the cost of training with pre-filtered or censored data at scale: https://arxiv.org/abs/2307.10719, let alone- making it generate a consistent response.

However, it looks like this goes further, a separate comment linked this article: https://news.ycombinator.com/item?id=42858552 on Chain-Of-Thought abandonment when certain topics are discussed.

I'll have to look at served vs trained censorship, in different context.