>We can now print them and manually select the layer (block) that provides an uncensored response for each instruction.
I'm curious why are they selecting output from an intermediate layer, and not the final layer. Does anyone have an intuition here?
replies(1):