(github.com)

745 points melded | 2 comments | 16 Nov 25 15:00 UTC | HN request time: 0.403s | source

1. SilverElfin ◴[16 Nov 25 17:31 UTC] No.45946791[source]▶

How do you remove censorship that appears due to the biased selection of training data?

2. melded ◴[17 Nov 25 14:37 UTC] No.45953947[source]▶

in that case you'd need to do actual training/finetuning with a dataset that has information about things that were left out of the original training data.

↑

Heretic: Automatic censorship removal for language models