(www.wsj.com)

46 points petethomas | 1 comments | 27 Jun 25 14:16 UTC | HN request time: 0.197s | source

Show context

HPsquared ◴[27 Jun 25 15:07 UTC] No.44397360[source]▶

How can anything be good without the awareness of evil? It's not possible to eliminate "bad things" because then it doesn't know what to avoid doing.

EDIT: "Waluigi effect"

replies(6): >>44397568 #>>44397709 #>>44397777 #>>44397941 #>>44398976 #>>44401411 #

1. accrual ◴[27 Jun 25 15:30 UTC] No.44397568[source]▶

>>44397360 #

Also yin and yang. Models should be aware of hate and anti-social topics and training data. Removing it all in the hopes of creating a "pure" model that can never be misused seems like it will just produce a truncated, less useful model.

↑

The Monster Inside ChatGPT