https://github.com/BlueFalconHD/apple_generative_model_safet...
https://github.com/BlueFalconHD/apple_generative_model_safet...
Which as a phenomenon is so very telling that no one actually cares what people are really saying. Everyone, including the platforms knows what that means. It's all performative.
At what point do the new words become the actual words? Are there many instances of people using unalive IRL?
I'm imagining a new exploit: After someone says something totally innocent, people gang up in the comments to act like a terrible vicious slur has been said, and then the moderation system (with an LLM involved somewhere) "learns" that an arbitrary term is heinous eand indirectly bans any discussion of that topic.
If the bigots start using "thank you" as some code word, should we stop saying it, lest we pollute our non-bigoted discussions?
bigots drink coffee too, maybe we should stop drinking it, because something-something...
And that symbol was 100% associated with the Nazis in the West in the 20th century. Nobody used it at the time before the war for anything else, except some tiny fringe.
If it was some mainstream symbol or idiom, merely co-adopted, we'd probably still be using it too.
If the Nazis used the cross for example,people wouldn't stop using the sign of the cross.