(www.anthropic.com)

2127 points bakugo | 1 comments | 24 Feb 25 18:28 UTC | HN request time: 0.205s | source

Show context

Daniel_Van_Zant ◴[24 Feb 25 22:36 UTC] No.43165766[source]▶

Being able to control how many tokens are spent on thinking is a game-changer. I've been building fairly complex, efficient, systems with many LLMs. Despite the advantages, reasoning models have been a no-go due to how variable the cost is, and how hard that makes it to calculate a final per-query cost for the customer. Being able to say "I know this model can always solve this problem in this many thinking tokens" and thus limiting the cost for that component is huge.

replies(1): >>43169839 #

1. jahooma ◴[25 Feb 25 09:27 UTC] No.43169839[source]▶

>>43165766 #

Yup, it's just what we wanted for our coding agent. Codebuff can enter a "Deep thinking" mode and we can tell it to burn a lot of tokens hahaha.

↑

Claude 3.7 Sonnet and Claude Code