(www.anthropic.com)

2127 points bakugo | 1 comments | 24 Feb 25 18:28 UTC | HN request time: 0.215s | source

1. zora_goron ◴[25 Feb 25 05:38 UTC] No.43168558[source]▶

Does anyone know how this “user decides how much compute” is implemented architecturally? I assume it’s the same underlying model, so what factor pushes the model to <think> for longer or shorter? Just a prompt-time modification or something else?

↑

Claude 3.7 Sonnet and Claude Code