(github.com)

534 points BlueFalconHD | 2 comments | 06 Jul 25 19:50 UTC | HN request time: 0.446s | source

I managed to reverse engineer the encryption (refered to as “Obfuscation” in the framework) responsible for managing the safety filters of Apple Intelligence models. I have extracted them into a repository. I encourage you to take a look around.

1. jjani ◴[07 Jul 25 07:10 UTC] No.44487505[source]▶

>>44483485 (OP) #

Did you only extract the English versions or is this as usual another case where big tech only cares to censor in English?

replies(1): >>44487660 #

2. jeroenhd ◴[07 Jul 25 07:36 UTC] No.44487660[source]▶

>>44487505 (TP) #

It also contains some German(-speaking) locales to filter out things like Fuhrer and Führer. But the filters are so scarce and there are magical phrases are so prevalent that I think this is mostly test code at the moment.

↑

I extracted the safety filters from Apple Intelligence models