←back to thread

46 points petethomas | 1 comments | | HN request time: 0.202s | source
Show context
HPsquared ◴[] No.44397360[source]
How can anything be good without the awareness of evil? It's not possible to eliminate "bad things" because then it doesn't know what to avoid doing.

EDIT: "Waluigi effect"

replies(6): >>44397568 #>>44397709 #>>44397777 #>>44397941 #>>44398976 #>>44401411 #
dghlsakjg ◴[] No.44397777[source]
The LLM wasn't just aware of antisemitism, it advocated for it. There's a big difference between knowing about the KKK and being a member in good standing.

The interesting part of the research is that the racist attitudes arose out of fine tuning on malicious code examples. Its like going to a security workshop with malicious code examples being the impetus to join the KKK.

replies(2): >>44397820 #>>44397824 #
1. rob_c ◴[] No.44397820[source]
It also advocated for the extermination of the "white race" by the same article, aka it didn't a problem in killing of of groups as a concept...