/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
Refusal in LLMs is mediated by a single direction
(www.lesswrong.com)
110 points
veryluckyxyz
| 1 comments |
03 May 24 00:55 UTC
|
HN request time: 0.41s
|
source
1.
luke-stanley
◴[
03 May 24 19:46 UTC
]
No.
40251554
[source]
▶
>>40242939 (OP)
#
The Classifier-Free Guidance (CFG) feature in llama.cpp likely acts as a built-in way to do something like using the "reverse-prompt" / "cfg-negative-prompt" flags in "main".
ID:
GO
↑