←back to thread

323 points steerlabs | 1 comments | | HN request time: 0.201s | source
1. tech_ken ◴[] No.46194655[source]
Basic rule of MLE is to have guardrails on your model output; you don't want some high-leverage training data point to trigger problems in prob. These guardrails should be deterministic and separate from the inference system, and basically a stack of user-defined policies. LLMs are ultimately just interpolated surfaces and the rules are the same as if it were LOESS.