(github.com)

534 points BlueFalconHD | 1 comments | 06 Jul 25 19:50 UTC | HN request time: 0s | source

I managed to reverse engineer the encryption (refered to as “Obfuscation” in the framework) responsible for managing the safety filters of Apple Intelligence models. I have extracted them into a repository. I encourage you to take a look around.

Show context

torginus ◴[06 Jul 25 21:31 UTC] No.44484236[source]▶

>>44483485 (OP) #

I find it funny that AGI is supposed to be right around the corner, while these supposedly super smart LLMs still need to get their outputs filtered by regexes.

replies(8): >>44484268 #>>44484323 #>>44484354 #>>44485047 #>>44485237 #>>44486883 #>>44487765 #>>44493460 #

bahmboo ◴[06 Jul 25 21:33 UTC] No.44484268[source]▶

>>44484236 #

This is just policy and alignment from Apple. Just because the Internet says a bunch of junk doesn't mean you want your model spewing it.

replies(1): >>44484459 #

wistleblowanon ◴[06 Jul 25 21:56 UTC] No.44484459[source]▶

>>44484268 #

sure but models also can't see any truth on their own. They are literally butchered and lobotomized with filters and such. Even high IQ people struggle with certain truth after reading a lot, how is these models going to find it with so much filters?

replies(6): >>44484505 #>>44484950 #>>44484951 #>>44485065 #>>44485409 #>>44487139 #

1. bahmboo ◴[06 Jul 25 23:05 UTC] No.44484950{3}[source]▶

>>44484459 #

What is this truth you speak of? My point is that a generative model will output things that some people don't like. If it's on a product that I make I don't want it "saying" things that don't align with my beliefs.

↑

I extracted the safety filters from Apple Intelligence models