Most active commenters

Popular/hot comments

(martinalderson.com)

Show context

_sword ◴[28 Aug 25 17:53 UTC] No.45055003[source]▶

I've done the modeling on this a few times and I always get to a place where inference can run at 50%+ gross margins, depending mostly on GPU depreciation and how good the host is at optimizing utilization. The challenge for the margins is whether or not you consider model training costs as part of the calculation. If model training isn't capitalized + amortized, margins are great. If they are amortized and need to be considered... yikes

replies(7): >>45055030 #>>45055275 #>>45055536 #>>45055820 #>>45055835 #>>45056242 #>>45056523 #

BlindEyeHalo ◴[28 Aug 25 18:19 UTC] No.45055275[source]▶

>>45055003 #

Why wouldn't you factor in training? It is not like you can train once and then have the model run for years. You need to constantly improve to keep up with the competition. The lifespan of a model is just a few months at this point.

replies(7): >>45055303 #>>45055495 #>>45055624 #>>45055631 #>>45056110 #>>45056973 #>>45057517 #

1. jacurtis ◴[28 Aug 25 21:57 UTC] No.45057517[source]▶

>>45055275 #

In a recent episode of Hard Fork podcast, the hosts discussed an on-the-record conversation they had with Sam Altman from OpenAI. They asked him about profitability and he claimed that they are losing money mostly because of the cost of training. But as the model advances, they will train less and less. Once you take training out of the equation he claimed they were profitable based on the cost of serving the trained foundation models to users at current prices.

Now, when he said that, his CFO corrected him and said they aren't profitable, but said "it's close".

Take that with a grain of salt, but thats a conversation from one of the big AI companies that is only a few weeks old. I suspect that it is pretty accurate that pricing is currently reasonable if you ignore training. But training is very expensive and the reason most AI companies are losing money right now.

replies(4): >>45057639 #>>45057962 #>>45060581 #>>45061058 #

2. pas ◴[28 Aug 25 22:13 UTC] No.45057639[source]▶

>>45057517 (TP) #

> most AI companies are losing money right now

which is completely "normal" at this point, """right"""? if you have billions of VC money chasing returns there's no time to sit around, it's all in, the hype train doesn't wait for bootstrapping profitability. and of course with these gargantuan valuations and mandatory YoY growth numbers, there is no way they are not fucking with the unit economy numbers too. (biases are hard to beat, especially if there's not much conscious effort to do so.)

replies(1): >>45058223 #

3. dgfitz ◴[28 Aug 25 22:55 UTC] No.45057962[source]▶

>>45057517 (TP) #

> But as the model advances, they will train less and less.

They sure have a lot of training to do between now and whenever that happens. Rolling back from 5 to whatever was before it is their own admission of this fact.

replies(1): >>45058471 #

4. brianwawok ◴[28 Aug 25 23:33 UTC] No.45058223[source]▶

>>45057639 #

Does the cost of good come down 10x or not? For say Uber it didn’t, so we went from great $6 VC funded product to mediocre $24 ride product we have today. I’m not sure I’m going to use Copilot at $1 per request. Or even $0.25. Starts to approach overseas consultant in price and ability.

5. mindwok ◴[29 Aug 25 00:12 UTC] No.45058471[source]▶

>>45057962 #

I think that actually proves the opposite. People wanted an old model, not a new one, indicating that for that user base they could have just... not trained a new model.

replies(3): >>45058933 #>>45060288 #>>45060618 #

6. jazzyjackson ◴[29 Aug 25 01:19 UTC] No.45058933{3}[source]▶

>>45058471 #

for their user base, sure

for their investors, however, they are promising a revolution

7. hnfsfdsd ◴[29 Aug 25 04:42 UTC] No.45060288{3}[source]▶

>>45058471 #

If people want old models, they can go to any of the competitor's , deepseek, claud, opensources, etc... That's not good news for OpenAI.

8. anothernewdude ◴[29 Aug 25 05:34 UTC] No.45060581[source]▶

>>45057517 (TP) #

Unfortunately for those companies, their APIs are a commodity, and are very fungible. So they'll need to keep training or be replaced with whichever competitor will. This is an exercise in attrition.

replies(1): >>45062777 #

9. PeterStuer ◴[29 Aug 25 05:41 UTC] No.45060618{3}[source]▶

>>45058471 #

That is for a very specific class of usecases. If they would turn up the sycophancy on the new model, those people would not call for the old onee.

The reasoning here is off. It is like saying new game development is nearly over as some people keep playing old games.

My feeling: we've yet barely scrarched the surface on the milage we can get out of even today's frontier models, but we are just at the beginning of a huge runway for improved models and architectures. Watch this space.

10. diamond559 ◴[29 Aug 25 06:59 UTC] No.45061058[source]▶

>>45057517 (TP) #

You lost me at "Sam Altman says".

11. LadyCailin ◴[29 Aug 25 11:44 UTC] No.45062777[source]▶

>>45060581 #

I wonder if we’re reaching a point of diminishing returns with training, at least, just by scaling the data set. I mean, there’s a finite amount of information (that can be obtained reasonably) to be trained on. I think we’re already at a sizable chunk of that, not to mention the cost of naively scaling up. My guess is that the ultimate winner will be the one that figures out how to improve without massive training costs, through better algorithms, or maybe even just better hardware (i.e. neuristors). I mean, we know that at worst case, we should be able to build something with human level intelligence that takes about 20 watts to run, and is about the size of a human head, and you only need to ingest a small slice of all available information to do that. And training should only use about 3.5 MWh, total, and can be done with the same hardware that runs the model.

↑

Are OpenAI and Anthropic losing money on inference?