←back to thread

71 points kristianp | 5 comments | | HN request time: 1.199s | source
Show context
extr ◴[] No.44611097[source]
Have been using this for awhile, I'm on the $100/mo Max plan and have been running $600-800/mo in terms of usage, and I'm hardly pushing it to the limits (missing lots of billing windows).

It makes me wonder what Anthropic's true margins are. I could believe they are overcharging via the API, Sonnet is $3/$15/Mtok and Opus at an ABSURD $15/$75/Mtok. But to break even for me, that would mean that they're overcharging by a factor 5x-10x, which doesn't seem possible. Is the music going to stop for Claude Code the same way it did for Cursor? I have to imagine every incentive in the world is pushing them to lower inference cost rather than introduce stricter limits, and unlike Cursor they can actually can reach into their stack and do this. But I'm not sure they're capable of miracles.

Regardless, I'm bullish Anthropic. Sonnet and Opus don't benchmark as well as O3/Grok4 at pure coding, and aren't as cheap as Kimi K2 for theoretically similar perf, but as any user knows they are top tier at instruction following, highly reliable and predictable, and have a certain intangible theory of mind that is unique to Anthropic.

replies(5): >>44611164 #>>44611202 #>>44611353 #>>44611401 #>>44611566 #
1. swyx ◴[] No.44611401[source]
any dropoff in usage limits for you recently? https://techcrunch.com/2025/07/17/anthropic-tightens-usage-l...

cant help but think that you guys yelling about it so loudly from the rooftops is really really not helping your case lol

replies(3): >>44611453 #>>44611650 #>>44611945 #
2. core-utility ◴[] No.44611453[source]
I started on the Pro plan a week ago and was already contemplating jumping to Max. When I hit a limit yesterday I upgraded to Max and hit a limit again before seeing the news of the changed usage limit.

For what it's worth, everything seems fixed today.

3. ghuntley ◴[] No.44611650[source]
Exactly, swyx. Any flat rate pricing plan is effectively a bet against the future. It's a grab for engineers that's subsidised. Now, the problem is that GPUs are expensive; they are a costly resource to use. Inferencing is expensive.

So what happens is inevitable:

- Wild promises of unlimited usage and consumers feeling tricked when the impossible is impossible to deliver (Cursor pricing changes).

- Quasi-unlimited usage with rate-caps, but the models get quantised to all hell? [search Twitter for folks reporting Claude feels dumber around/near outages].

- Engineers sharing tools and techniques on how to squeeze pounds out of a flat-rate plan (original post), which results in more power users doing that, which puts more pressure on margins.

In goose meme format, "What are the margins?"

https://x.com/GeoffreyHuntley/status/1945636266009399414

replies(1): >>44612258 #
4. extr ◴[] No.44611945[source]
People have been saying this since it first came out. I don’t doubt there are occasional bugs/service disruptions but personally I really doubt Anthropic is silently decreasing the limits.
5. kristianp ◴[] No.44612258[source]
> Any flat rate pricing plan is effectively a bet against the future

How quickly we forget Moore's law, or at least what has replaced Moore's law.