> First, we find a pattern of neural activity (a vector) representing the concept of “all caps." We do this by recording the model’s neural activations in response to a prompt containing all-caps text, and comparing these to its responses on a control prompt.
What does "comparing" refer to here? Drawing says they are subtracting the activations for two prompts, is it really this easy?
replies(1):