/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
Heretic: Automatic censorship removal for language models
(github.com)
745 points
melded
| 1 comments |
16 Nov 25 15:00 UTC
|
HN request time: 0.319s
|
source
1.
krackers
◴[
17 Nov 25 01:23 UTC
]
No.
45950075
[source]
▶
>>45945587 (OP)
#
https://www.lesswrong.com/posts/jGuXSZgv6qfdhMCuJ/refusal-in...
provides more detailed information on the theory behind abliteration
ID:
GO
↑