(github.com)

211 points lapnect | 1 comments | 17 Nov 24 11:37 UTC | HN request time: 0.219s | source

1. wslh ◴[17 Nov 24 19:20 UTC] No.42166323[source]▶

If I recall correctly, there is a proof or conjecture suggesting that it’s impossible to build an “LLM firewall” capable of protecting against all possible prompts—though I may be misremembering, just search for resources like this [1].

[1] https://arxiv.org/abs/2406.03198

↑

Garak, LLM Vulnerability Scanner