Most active commenters

Are OpenAI and Anthropic losing money on inference?

(martinalderson.com)

Show context

simonw ◴[28 Aug 25 16:20 UTC] No.45054022[source]▶

https://www.axios.com/2025/08/15/sam-altman-gpt5-launch-chat... quotes Sam Altman saying:

> Most of what we're building out at this point is the inference [...] We're profitable on inference. If we didn't pay for training, we'd be a very profitable company.

replies(6): >>45054061 #>>45054069 #>>45054101 #>>45054102 #>>45054593 #>>45054858 #

dcre ◴[28 Aug 25 16:23 UTC] No.45054061[source]▶

>>45054022 #

ICYMI, Amodei said the same in much greater detail:

"If you consider each model to be a company, the model that was trained in 2023 was profitable. You paid $100 million, and then it made $200 million of revenue. There's some cost to inference with the model, but let's just assume, in this cartoonish cartoon example, that even if you add those two up, you're kind of in a good state. So, if every model was a company, the model, in this example, is actually profitable.

What's going on is that at the same time as you're reaping the benefits from one company, you're founding another company that's much more expensive and requires much more upfront R&D investment. And so the way that it's going to shake out is this will keep going up until the numbers go very large and the models can't get larger, and then it'll be a large, very profitable business, or, at some point, the models will stop getting better, right? The march to AGI will be halted for some reason, and then perhaps it'll be some overhang. So, there'll be a one-time, 'Oh man, we spent a lot of money and we didn't get anything for it.' And then the business returns to whatever scale it was at."

https://cheekypint.substack.com/p/a-cheeky-pint-with-anthrop...

replies(9): >>45054612 #>>45054646 #>>45054678 #>>45054731 #>>45054753 #>>45054819 #>>45055347 #>>45055378 #>>45055855 #

1. meshugaas ◴[28 Aug 25 17:27 UTC] No.45054753[source]▶

>>45054061 #

The "model as company" metaphor makes no sense. It should actually be models are products, like a shoe. Nike spends money developing a shoe, then building it, then they sell it, and ideally those R&D costs are made up in shoe sales. But you still have to run the whole company outside of that.

Also, in Nike's case, as they grow they get better at making more shoes for cheaper. LLM model providers tell us that every new model (shoe) costs multiples more than the last one to develop. If they make 2x revenue on training, like he's said, to be profitable they have to either double prices or double users every year, or stop making new models.

replies(6): >>45055103 #>>45055124 #>>45055405 #>>45055669 #>>45055686 #>>45055797 #

2. renjimen ◴[28 Aug 25 18:02 UTC] No.45055103[source]▶

>>45054753 (TP) #

But new models to date have cost more than the previous ones to create, often by an order of magnitude, so the shoe metaphor falls apart.

A better metaphor would be oil and gas production, where existing oil and gas fields are either already finished (i.e. model is no longer SOTA -- no longer making a return on investment) or currently producing (SOTA inference -- making a return on investment). The key similarity with AI is new oil and gas fields are increasingly expensive to bring online because they are harder to make economical than the first ones we stumbled across bubbling up in the desert, and that's even with technological innovation. That is to say, the low hanging fruit is long gone.

replies(2): >>45055795 #>>45056658 #

3. pegasus ◴[28 Aug 25 18:04 UTC] No.45055124[source]▶

>>45054753 (TP) #

If you're going to use shoes as the metaphor, a model would be more like a shoe factory. A shoe would be a LLM answer, i.e. inference. In which case it totally makes sense to consider each factory as an autonomous economic unit, like a company.

4. true_religion ◴[28 Aug 25 18:31 UTC] No.45055405[source]▶

>>45054753 (TP) #

It's model as a company because people are using the VC mentality, and also explaining competition.

Model as a product is the reality, but each model competes with previous models and is only successful if it's both more cost effective, and also more effective in general at its tasks. By the time you get to model Z, you'll never use model A for any task as the model lineage cannibalizes sales of itself.

5. vonneumannstan ◴[28 Aug 25 18:54 UTC] No.45055669[source]▶

>>45054753 (TP) #

>Also, in Nike's case, as they grow they get better at making more shoes for cheaper.

This is clearly the case for models as well. Training and serving inference for GPT4 level models is probably > 100x cheaper than they used to be. Nike has been making Jordan 1's for 40+ years! OpenAI would be incredibly profitable if they could live off the profit from improved inference efficiency on a GPT4 level model!

replies(1): >>45055786 #

6. skybrian ◴[28 Aug 25 18:55 UTC] No.45055686[source]▶

>>45054753 (TP) #

Analogies don't prove anything, but they're still useful for suggesting possibilities for thinking about a problem.

If you don't like "model as company," how about "model as making a movie?" Any given movie could be profitable or not. It's not necessarily the case that movie budgets always get bigger or that an increased budget is what you need to attract an audience.

7. Avshalom ◴[28 Aug 25 19:05 UTC] No.45055786[source]▶

>>45055669 #

>>This is clearly the case ... probably

>>OpenAI would be incredibly profitable if they could live off the profit from improved inference efficiency on a GPT4 level model!

If gpt4 was basically free money at this point it's real weird that their first instinct was to cut it off after gpt5

replies(2): >>45056030 #>>45057555 #

8. meshugaas ◴[28 Aug 25 19:06 UTC] No.45055795[source]▶

>>45055103 #

exactly: it’s like making shoes if you’re really bad at making shoes :)

9. Szpadel ◴[28 Aug 25 19:06 UTC] No.45055797[source]▶

>>45054753 (TP) #

I believe better analogy is CPU development on next process node.

each node is much more expensive to design for, but when you finally have it you basically print money.

and of course you always have to develop next more powerful and power efficient CPU to keep competitive

10. dcre ◴[28 Aug 25 19:26 UTC] No.45056030{3}[source]▶

>>45055786 #

I think the idea here is that gpt-5-mini is the cheap gpt-4 quality model they want to serve and make money on.

11. runako ◴[28 Aug 25 20:25 UTC] No.45056658[source]▶

>>45055103 #

> new models to date have cost more than the previous ones to create

This largely was the case in software in the '80s-'10s (when versions largely disappeared) and still is the case in hardware. iPhone 17 will certainly cost far more to develop than did iPhone 10 or 5. iPhone 5 cost far more than 3G, etc.

replies(1): >>45057040 #

12. Romario77 ◴[28 Aug 25 21:06 UTC] No.45057040{3}[source]▶

>>45056658 #

I don't think it's the case if you take inflation into account.

You could see here: https://www.reddit.com/r/dataisbeautiful/comments/16dr1kb/oc...

new ones are generally cheaper if adjusted for inflation. This is a sale price, but assuming that margins stay the same it should reflect the manufacturing price. And from what I remember about apple earnings their margins increased over time, so it means the new phones are even cheaper. Which kind of makes sense.

replies(1): >>45057335 #

13. runako ◴[28 Aug 25 21:38 UTC] No.45057335{4}[source]▶

>>45057040 #

I should have addressed this. This thread is about the capital costs of getting to the first sale, so that's model training for an LLM vs all the R&D in an iPhone.

Recent iPhones use Apple's own custom silicon for a number of components, and are generally vastly more complex. The estimates I have seen for iPhone 1 development range from $150 million to $2.5 billion. Even adjusting for inflation, a current iPhone generation costs more than the older versions.

And it absolutely makes sense for Apple to spend more in total to develop successive generations, because they have less overall product risk and larger scale to recoup.

14. steveklabnik ◴[28 Aug 25 22:01 UTC] No.45057555{3}[source]▶

>>45055786 #

> If gpt4 was basically free money at this point it's real weird that their first instinct was to cut it off after gpt5

People find the UX of choosing a model very confusing, the idea with 5 is that it would route things appropriately and so eliminate this confusion. That was the motivation for removing 4. But people were upset enough that they decided to bring it back for a while, at least.

replies(1): >>45059472 #

15. solarkraft ◴[29 Aug 25 02:39 UTC] No.45059472{4}[source]▶

>>45057555 #

They picked the worst possible time to make the change if money wasn’t involved (which is why I assumed GPT-5 must be massively cheaper to run). The backlash from being forced to use it cost a fair bit of the model’s reputation.

replies(1): >>45059733 #

16. steveklabnik ◴[29 Aug 25 03:13 UTC] No.45059733{5}[source]▶

>>45059472 #

Yeah it didnt work out for them, for sure.

↑