Most active commenters
  • com2kid(5)
  • dagmx(4)
  • VikingCoder(4)
  • croes(4)
  • otabdeveloper4(4)
  • moralestapia(4)
  • Cumpiler69(3)
  • JohnBooty(3)

←back to thread

623 points magicalhippo | 89 comments | | HN request time: 0.65s | source | bottom
Show context
Karupan ◴[] No.42619320[source]
I feel this is bigger than the 5x series GPUs. Given the craze around AI/LLMs, this can also potentially eat into Apple’s slice of the enthusiast AI dev segment once the M4 Max/Ultra Mac minis are released. I sure wished I held some Nvidia stocks, they seem to be doing everything right in the last few years!
replies(21): >>42619339 #>>42619433 #>>42619472 #>>42619544 #>>42619769 #>>42620175 #>>42620289 #>>42620359 #>>42620740 #>>42621569 #>>42621821 #>>42622149 #>>42622154 #>>42622259 #>>42622359 #>>42622567 #>>42622577 #>>42622621 #>>42622863 #>>42627093 #>>42627188 #
1. dagmx ◴[] No.42619339[source]
I think the enthusiast side of things is a negligible part of the market.

That said, enthusiasts do help drive a lot of the improvements to the tech stack so if they start using this, it’ll entrench NVIDIA even more.

replies(7): >>42619397 #>>42619404 #>>42619430 #>>42619479 #>>42619510 #>>42619885 #>>42621646 #
2. option ◴[] No.42619397[source]
today’s enthusiast, grad student, hacker is tomorrow’s startup founder, CEO, CTO or 10x contributor in large tech company
replies(1): >>42620210 #
3. computably ◴[] No.42619404[source]
Yeah, it's more about preempting competitors from attracting any ecosystem development than the revenue itself.
4. VikingCoder ◴[] No.42619430[source]
If I were NVidia, I would be throwing everything I could at making entertainment experiences that need one of these to run...

I mean, this is awfully close to being "Her" in a box, right?

replies(2): >>42619453 #>>42620410 #
5. dagmx ◴[] No.42619453[source]
I feel like a lot of people miss that Her was a dystopian future, not an ideal to hit.

Also, it’s $3000. For that you could buy subscriptions to OpenAI etc and have the dystopian partner everywhere you go.

replies(9): >>42619500 #>>42619529 #>>42619627 #>>42619642 #>>42620406 #>>42620519 #>>42621356 #>>42625745 #>>42627519 #
6. qwertox ◴[] No.42619479[source]
You could have said the same about gamers buying expensive hardware in the 00's. It's what made Nvidia big.
replies(2): >>42620002 #>>42620925 #
7. tacticus ◴[] No.42619500{3}[source]
they don't miss that part. they just want to be the evil character.
8. Karupan ◴[] No.42619510[source]
I’m not so sure it’s negligible. My anecdotal experience is that since Apple Silicon chips were found to be “ok” enough to run inference with MLX, more non-technical people in my circle have asked me how they can run LLMs on their macs.

Surely a smaller market than gamers or datacenters for sure.

replies(3): >>42619637 #>>42620854 #>>42622080 #
9. t0lo ◴[] No.42619529{3}[source]
The dystopian overton window has shifted, didn't you know, moral ambiguity is a win now? :) Tesla was right.
10. VikingCoder ◴[] No.42619627{3}[source]
We already live in dystopian hell and I'd like to have Scarlett Johansen whispering in my ear, thanks.

Also, I don't particularly want my data to be processed by anyone else.

11. dagmx ◴[] No.42619637[source]
I mean negligible to their bottom line. There may be tons of units bought or not, but the margin on a single datacenter system would buy tens of these.

It’s purely an ecosystem play imho. It benefits the kind of people who will go on to make potentially cool things and will stay loyal.

replies(3): >>42619863 #>>42621450 #>>42623624 #
12. croes ◴[] No.42619642{3}[source]
OpenAI doesn’t make any profit. So either it dies or prices go up. Not to mention the privacy aspect of your own machine and the freedom of choice which models to run
replies(2): >>42620369 #>>42623621 #
13. htrp ◴[] No.42619863{3}[source]
>It’s purely an ecosystem play imho. It benefits the kind of people who will go on to make potentially cool things and will stay loyal.

100%

The people who prototype on a 3k workstation will also be the people who decide how to architect for a 3k GPU buildout for model training.

14. gr3ml1n ◴[] No.42619885[source]
AMD thought the enthusiast side of things was a negligible side of the market.
replies(1): >>42620274 #
15. Cumpiler69 ◴[] No.42620002[source]
There's a lot more gamers than people wanting to play with LLms at home.
replies(2): >>42620074 #>>42620091 #
16. anonylizard ◴[] No.42620074{3}[source]
There's a titanic market with people wanting some uncensored local LLM/image/video generation model. This market extremely overlaps with gamers today, but will grow exponentially every year.
replies(6): >>42620079 #>>42620868 #>>42621277 #>>42621615 #>>42622007 #>>42622468 #
17. Cumpiler69 ◴[] No.42620079{4}[source]
How big is that market you claim? Local LLM image generation already exists out off the box on latest Samsung flagship phones and it's mostly a Gimmick that gets old pretty quickly. Hardly comparable to gaming in terms of market size and profitablity.

Plus, YouTube and the Google images is already full of AI generated slop and people are already tired of it. "AI fatigue" amongst majority of general consumers is a documented thing. Gaming fatigues is not.

replies(2): >>42620282 #>>42622670 #
18. estebarb ◴[] No.42620091{3}[source]
Sure, but those developers will create functionality that will require advanced GPUs and people will want that functionality. Eventually OS will expect it and it will became default everywhere. So, it is an important step that will push nvidia growing in the following years.
19. Mistletoe ◴[] No.42620210[source]
> tomorrow’s startup founder, CEO, CTO or 10x contributor in large tech company

Do we need more of those? We need plumbers and people that know how to build houses. We are completely full on founders and executives.

replies(2): >>42621629 #>>42633300 #
20. dagmx ◴[] No.42620274[source]
That’s not what I’m saying. I’m saying that the people buying this aren’t going to shift their bottom line in any kind of noticeable way. They’re already sold out of their money makers. This is just an entrenchment opportunity.
21. madwolf ◴[] No.42620282{5}[source]
I think he implied AI generated porn. Perhaps also other kind of images that are at odds with morality and/or the law. I'm not sure but probably Samsung phones don't let you do that.
22. blackoil ◴[] No.42620369{4}[source]
> So either it dies or prices go up.

Or efficiency gains in hardware and software catchup making current price point profitable.

replies(1): >>42627105 #
23. int_19h ◴[] No.42620406{3}[source]
This is exactly the scenario where you don't want "the cloud" anywhere.
24. int_19h ◴[] No.42620410[source]
The real interesting stuff will happen when we get multimodal LMs that can do VR output.
25. nostromo ◴[] No.42620519{3}[source]
Fun fact: Her was set in the year 2025.
replies(1): >>42621870 #
26. stuaxo ◴[] No.42620854[source]
It's annoying I do LLMs for work and have a bit of an interest in them and doing stuff with GANS etc.

I have a bit of an interest in games too.

If I could get one platform for both, I could justify 2k maybe a bit more.

I can't justify that for just one half: running games on Mac, right now via Linux: no thanks.

And on the PC side, nvidia consumer cards only go to 24gb which is a bit limiting for LLMs, while being very expensive - I only play games every few months.

replies(3): >>42621585 #>>42622473 #>>42622505 #
27. stuaxo ◴[] No.42620868{4}[source]
Apart from the uncensored bit, I'm in this small market.

Do I buy a Macbook with silly amount of RAM when I only want to mess with images occasionally.

Do I get a big Nvidia card, topping out at 24gb - still small for some LLMs, but I could occasionally play games using it at least.

28. spaceman_2020 ◴[] No.42620925[source]
I keep thinking about stocks that have 100xd, and most seemed like obscure names to me as a layman. But man, Nvidia was a household name to anyone that ever played any game. And still so many of us never bothered buying the stock

Incredible fumble for me personally as an investor

replies(3): >>42621403 #>>42621650 #>>42623817 #
29. otabdeveloper4 ◴[] No.42621277{4}[source]
> There's a titanic market with people wanting some uncensored local LLM/image/video generation model.

No. There's already too much porn on the internet, and AI porn is cringe and will get old very fast.

replies(4): >>42622387 #>>42622503 #>>42623432 #>>42623850 #
30. smt88 ◴[] No.42621356{3}[source]
If Silicon Valley could tell the difference between utopias and dystopias, we wouldn't have companies named Soylent or iRobot, and the recently announced Anduril/Palantir/OpenAI partnership to hasten the creation of either SkyNet or Big Brother wouldn't have happened at all.
replies(1): >>42630110 #
31. motoxpro ◴[] No.42621403{3}[source]
Unless you predicted AI and Crypto then it was just really good, not 100x. It 20x from 2005-2020 but ~500x from 2005-2025

And if you truly did predict that Nvidia would own those markets and those markets would be massive, you could have also bought Amazon, Google or heck even Bitcoin. Anything you touched in tech really would have made you a millionaire really.

replies(2): >>42623557 #>>42624833 #
32. mrlongroots ◴[] No.42621450{3}[source]
> It’s purely an ecosystem play imho. It benefits the kind of people who will go on to make potentially cool things and will stay loyal.

It will be massive for research labs. Most academics have to jump through a lot of hoops to get to play with not just CUDA, but also GPUDirect/RDMA/Infiniband etc. If you get older/donated hardware, you may have a large cluster but not newer features.

replies(1): >>42622526 #
33. ◴[] No.42621585{3}[source]
34. itsoktocry ◴[] No.42621615{4}[source]
>There's a titantic market

How so?

Only 40% of gamers use a PC, a portion of those use AI in any meaningful way, and a fraction of those want to set up a local AI instance.

Then someone releases an uncensored, cloud based AI and takes your market?

35. davrosthedalek ◴[] No.42621629{3}[source]
We might not, but Nvidia would certainly like it.
36. epolanski ◴[] No.42621646[source]
If this is gonna be widely used by ML engineers, in biopharma, etc and they land 1000$ margins at half a million sales that's half a billion in revenue, with potential to grow.
37. ◴[] No.42621650{3}[source]
38. swat535 ◴[] No.42621870{4}[source]
Boring fact: The underlying theme of the movie Her is actually divorce and the destructive impact it has on people, the futuristic AI stuff is just for stuffing!
replies(1): >>42622366 #
39. weregiraffe ◴[] No.42622007{4}[source]
>There's a titanic market

Titanic - so about to hit an iceberg and sink?

40. moralestapia ◴[] No.42622080[source]
Yes, but people already had their Macs for others reasons.

No one goes to an Apple store thinking "I'll get a laptop to do AI inference".

replies(4): >>42622296 #>>42622421 #>>42622639 #>>42623427 #
41. the_other ◴[] No.42622296{3}[source]
I'm currently wondering how likely it is I'll get into deeper LLM usage, and therefore how much Apple Silicon I need (because I'm addicted to macOS). So I'm some way closer to your steel man than you'd expect. But I'm probably a niche within a niche.
42. AnonymousPlanet ◴[] No.42622366{5}[source]
The overall theme of Her was human relationships. It was not about AI and not just about divorce in particular.The AI was just a plot device to include a bodyless person into the equation. Watch it again with this in mind and you will see what I mean.
replies(1): >>42625252 #
43. ceejayoz ◴[] No.42622387{5}[source]
AI porn is currently cringe, just like Eliza for conversations was cringe.

The cutting edge will advance, and convincing bespoke porn of people's crushes/coworkers/bosses/enemies/toddlers will become a thing. With all the mayhem that results.

replies(1): >>42624219 #
44. JohnBooty ◴[] No.42622421{3}[source]
They have, because until now Apple Silicon was the only practical way for many to work with larger models at home because they can be configured with 64-192GB of unified memory. Even the laptops can be configured with up to 128GB of unified memory.

Performance is not amazing (roughly 4060 level, I think?) but in many ways it was the only game in town unless you were willing and able to build a multi-3090/4090 rig.

replies(1): >>42624005 #
45. JohnBooty ◴[] No.42622468{4}[source]
I'm sure a lot of people see "uncensored" and think "porn" but there's a lot of stuff that e.g. Dall-E won't let you do.

Suppose you're a content creator and you need an image of a real person or something copyrighted like a lot of sports logos for your latest YouTube video's thumbnail. That kind of thing.

I'm not getting into how good or bad that is; I'm just saying I think it's a pretty common use case.

46. wkat4242 ◴[] No.42622473{3}[source]
32gb as of last night :)
47. JohnBooty ◴[] No.42622503{5}[source]
I think there are a lot of non-porn uses. I see a lot of YouTube thumbnails that seem AI generated, but feature copyrighted stuff.

(example: a thumbnail for a YT video about a video game, featuring AI-generated art based on that game. because copyright reasons, in my very limited experience Dall-E won't let you do that)

I agree that AI porn doesn't seem a real market driver. With 8 billion people on Earth I know it has its fans I guess, but people barely pay for porn in the first place so I reallllly dunno how many people are paying for AI porn either directly or indirectly.

It's unclear to me if AI generated video will ever really cross the "uncanny valley." Of course, people betting against AI have lost those bets again and again but I don't know.

48. WaxProlix ◴[] No.42622505{3}[source]
The new $2k card from Nvidia will be 32GB but your point stands. AMD is planning a unified chiplet based GPU architecture (AI/data center/workstation/gaming) called UDNA, which might alleviate some of these issues. It's been delayed and delayed though - hence the lackluster GPU offerings from team Red this cycle - so I haven't been getting my hopes up.

Maybe (LP)CAMM2 memory will make model usage just cheap enough that I can have a hosting server for it and do my usual midrange gaming GPU thing before then.

replies(2): >>42626549 #>>42627370 #
49. ckemere ◴[] No.42622526{4}[source]
Academic minimal-bureaucracy purchasing card limit is about $4k, so pricing is convenient*2.
50. kelsey98765431 ◴[] No.42622639{3}[source]
my $5k m3 max 128gb disagrees
replies(1): >>42623970 #
51. TeMPOraL ◴[] No.42622670{5}[source]
> Gaming fatigues is not.

It is. You may know it as the "I prefer to play board games (and feel smugly superior about it) because they're ${more social, require imagination, $whatever}" crowd.

replies(1): >>42623228 #
52. Cumpiler69 ◴[] No.42623228{6}[source]
The market heavily disagrees with you.

"The global gaming market size was valued at approximately USD 221.24 billion in 2024. It is forecasted to reach USD 424.23 billion by 2033, growing at a CAGR of around 6.50% during the forecast period (2025-2033)"

replies(1): >>42623573 #
53. com2kid ◴[] No.42623427{3}[source]
Tons of people do, my next machine will likely be a Mac for 60% this reason and 40% Windows being so user hostile now.
54. Filligree ◴[] No.42623432{5}[source]
> No. There's already too much porn on the internet, and AI porn is cringe and will get old very fast.

I needed an uncensored model in order to, guess what, make an AI draw my niece snowboarding down a waterfall. All the online services refuse on basis that the picture contains -- oh horrors -- a child.

"Uncensored" absolutely does not imply NSFW.

replies(1): >>42623952 #
55. fragmede ◴[] No.42623557{4}[source]
Survivors bias though. It's hard to name all the companies that failed in the dot com bust, but even among the ones that made it through, because they're not around any more, they're harder to remember than the winners. But MCI, Palm, RIM, Nortel, Compaq, Pets.com, Webvan all failed and went to zero. There's an uncountable number of ICOs and NFTs that ended up nowhere. SVB isn't exactly an tech stock but they were strongly connected to it and they failed.
56. com2kid ◴[] No.42623573{7}[source]
Farmville style games underwent similar explosive estimates of growth, up until they collapsed.

Much of the growth in gaming of late has come from exploitive dark patterns, and those dark patterns eventually stop working because users become immune to them.

replies(1): >>42625629 #
57. com2kid ◴[] No.42623621{4}[source]
OpenAI built a 3 billion dollar business in less than 3 years of a commercial offering.
replies(1): >>42627088 #
58. bwfan123 ◴[] No.42623624{3}[source]
Devalapers developers developers - balmer monkey dance - the key to be entrenched is the platform ecosystem.

Also why aws is giving trainium credits for free

59. robohoe ◴[] No.42623817{3}[source]
Nvidia joined S&P500 in 2001 so if you've been doing passive index fund investing, you probably got a little bit of it in your funds. So there was some upside to it.
60. Paradigma11 ◴[] No.42623850{5}[source]
I think scams will create a far more demand. Spear Phishing targets by creating persistent elaborate online environments is going to be big.
61. otabdeveloper4 ◴[] No.42623952{6}[source]
Yeah, and there's that story about "private window" mode in browsers because you were shopping for birthday gifts that one time. You know what I mean though.
replies(1): >>42624746 #
62. moralestapia ◴[] No.42623970{4}[source]
Doubt it, a year ago useful local LLMs on a Mac (via something like ollama) was barely taking off.

If what you say it's true you were among the first 100 people on the planet who were doing this; which btw, further supports my argument on how extremely rare is that use case for Mac users.

replies(2): >>42625331 #>>42628423 #
63. moralestapia ◴[] No.42624005{4}[source]
I would bet that people running LLMs on their Macs, today, is <0.1% of their user base.
replies(3): >>42625314 #>>42626764 #>>42627711 #
64. otabdeveloper4 ◴[] No.42624219{6}[source]
It will always be cringe due to how so-called "AI" works. Since it's fundamentally just log-likelihood optimization under the hood, it will always be a statistically most average image. Which means it will always have that characteristic "plastic" and overdone look.
replies(1): >>42626174 #
65. Filligree ◴[] No.42624746{7}[source]
I really don't. Censored models are so censored they're practically useless for anything but landscapes. Half of them refuse to put humans in the pictures at all.
66. adolph ◴[] No.42624833{4}[source]
It is interesting to think about crypto as a stairstep that Nvidia used to get to its current position in AI. It wasn't games > ai, but games > crypto > ai.
67. adolph ◴[] No.42625252{6}[source]
The universal theme of Her was the set of harmonics that define what is something and the thresholds, boundaries, windows onto what is not thatthing but someotherthing, even if the thing perceived is a mirror, not just about human relationships in particular. The relationship was just a plot device to make a work of deep philosophy into a marketable romantic comedy.
68. sroussey ◴[] No.42625314{5}[source]
People buying Macs for LLMs—sure I agree.

Since the current MacOS comes built in with small LLMs, that number might be closer to 50% not 0.1%.

replies(1): >>42627383 #
69. sroussey ◴[] No.42625331{5}[source]
No, I got a MacBook Pro 14”with M2 Max and 64GB for LLMs, and that was two generations back.
70. mrguyorama ◴[] No.42625629{8}[source]
>Farmville style games underwent similar explosive estimates of growth, up until they collapsed.

They did not collapse, they moved to smartphones. The "free"-to-play gacha portion of the gaming market is so successful it is most of the market. "Live service" games are literally traditional game makers trying to grab a tiny slice of that market, because it's infinitely more profitable than making actual games.

>those dark patterns eventually stop working because users become immune to them.

Really? Slot machines have been around for generations and have not become any less effective. Gambling of all forms has relied on the exact same physiological response for millennia. None of this is going away without legislation.

replies(1): >>42626766 #
71. dnissley ◴[] No.42625745{3}[source]
Please name the dystopian elements of Her.
72. ceejayoz ◴[] No.42626174{7}[source]
The current state of the art in AI image generation was unimaginable a few years back. The idea that it'll stay as-is for the next century seems... silly.
replies(1): >>42632388 #
73. FuriouslyAdrift ◴[] No.42626549{4}[source]
Unified architecture is still on track for 2026-ish.
74. justincormack ◴[] No.42626764{5}[source]
Higher than that buying the top end machines though, which are very high margin
75. com2kid ◴[] No.42626766{9}[source]
> Slot machines have been around for generations and have not become any less effective.

Slot machines are not a growth market. The majority of people wised to them literal generations ago, although enough people remain susceptible to maintain a handful of city economies.

> They did not collapse, they moved to smartphones

Agreed, but the dark patterns being used are different. The previous dark patterns became ineffective. The level of sophistication of psychological trickery in modern f2p games is far beyond anything Farmville ever attempted.

The rise of live service games also does not bode well for infinite growth in the industry as there's only so many hours to go around each day for playing games and even the evilest of player manipulation techniques can only squeeze so much blood from a stone.

The industry is already seeing the failure of new live service games to launch, possibly analogous to what happened in the MMO market when there was a rush of releases after WoW. With the exception of addicts, most people can only spend so many hours a day playing games.

76. croes ◴[] No.42627088{5}[source]
3 billion revenue and 5 billion loss doesn’t sound like a sustainable business model.
replies(2): >>42630094 #>>42632463 #
77. croes ◴[] No.42627105{5}[source]
Training data gets mired in expensive and they need constant input otherwise the AI‘s knowledge is outdated
78. sliken ◴[] No.42627370{4}[source]
Grace + Hopper, Grace + blackwell, and discussed GB10 are much like the currently shipping AMD MI300A.

I do hope that a AMD Strix Halo ships with 2 LPCAMM2 slots for a total width of 256 bits.

79. moralestapia ◴[] No.42627383{6}[source]
I'm not arguing whether or not Macs are capable of doing it, but whether is a material force that drives people to buy Macs because of it; it's not.
80. vasco ◴[] No.42627519{3}[source]
One man's dystopia is another man's dream. There's no "missing" in the moral of a movie, you make whatever you want out of it.
81. throwaway48476 ◴[] No.42627711{5}[source]
All macs? Yes. But of 192GB mac configs? Probably >50%
82. kgwgk ◴[] No.42628423{5}[source]
People were running llama.cpp on Mac laptops in March 2023 and Llama2 was released in July 2023. People were buying Macs to run LLMs months before M3 machines became available in November 2023.
83. VikingCoder ◴[] No.42630094{6}[source]
The real question is what the next 3 years look like. If it's another 5 billion burned for 3 billion or less in revenue, that's one thing... But...
replies(1): >>42631889 #
84. VikingCoder ◴[] No.42630110{4}[source]
I mean, we still act like a "wild goose chase" is a bad thing.

We still schedule "bi-weekly" meetings.

We can't agree on which way charge goes in a wire.

Have you seen the y-axis on an economists chart?

85. croes ◴[] No.42631889{7}[source]
How...
replies(1): >>42632175 #
86. menaerus ◴[] No.42632175{8}[source]
Recent report says there are 1M paying customers. At ~30USD for 12 months this is ~3.6B of revenue which kinda matches their reported figures. So to break even at their ~5B costs assuming that they need no further major investment in infrastructure they only need to increase the paying subscriptions from 1M to 2M. Since there are ~250M people who engaged with OpenAI free tier service 2x projection doesn't sound too surreal.
87. otabdeveloper4 ◴[] No.42632388{8}[source]
If you're talking about some sort of non-existent sci-fi future "AI" that isn't just log-likelihood optimization, then most likely such a fantastical thing wouldn't be using NVidia's GPU with CUDA.

This hardware is only good for current-generation "AI".

88. com2kid ◴[] No.42632463{6}[source]
Rumor has it they run queries at a profit, and most of the cost is in training and staff.

If they is true their path to profitability isn't super rocky. Their path to achieving their current valuation may end up being trickier though!

89. hatboat ◴[] No.42633300{3}[source]
If they're already an "enthusiast, grad student, hacker", are they likely to choose the "plumbers and people that know how to build houses" career track?

True passion for one's career is rare, despite the clichéd platitudes ecouraging otherwise. That's something we should encourage and invest in regardless of the field.