Nvidia's Project Digits is a 'personal AI supercomputer'

(techcrunch.com)

623 points magicalhippo | 3 comments | 07 Jan 25 04:14 UTC | HN request time: 0s | source

Show context

Karupan ◴[07 Jan 25 04:41 UTC] No.42619320[source]▶

>>42619139 (OP) #

I feel this is bigger than the 5x series GPUs. Given the craze around AI/LLMs, this can also potentially eat into Apple’s slice of the enthusiast AI dev segment once the M4 Max/Ultra Mac minis are released. I sure wished I held some Nvidia stocks, they seem to be doing everything right in the last few years!

replies(21): >>42619339 #>>42619433 #>>42619472 #>>42619544 #>>42619769 #>>42620175 #>>42620289 #>>42620359 #>>42620740 #>>42621569 #>>42621821 #>>42622149 #>>42622154 #>>42622259 #>>42622359 #>>42622567 #>>42622577 #>>42622621 #>>42622863 #>>42627093 #>>42627188 #

dagmx ◴[07 Jan 25 04:45 UTC] No.42619339[source]▶

>>42619320 #

I think the enthusiast side of things is a negligible part of the market.

That said, enthusiasts do help drive a lot of the improvements to the tech stack so if they start using this, it’ll entrench NVIDIA even more.

replies(7): >>42619397 #>>42619404 #>>42619430 #>>42619479 #>>42619510 #>>42619885 #>>42621646 #

Karupan ◴[07 Jan 25 05:20 UTC] No.42619510[source]▶

>>42619339 #

I’m not so sure it’s negligible. My anecdotal experience is that since Apple Silicon chips were found to be “ok” enough to run inference with MLX, more non-technical people in my circle have asked me how they can run LLMs on their macs.

Surely a smaller market than gamers or datacenters for sure.

replies(3): >>42619637 #>>42620854 #>>42622080 #

stuaxo ◴[07 Jan 25 09:36 UTC] No.42620854[source]▶

>>42619510 #

It's annoying I do LLMs for work and have a bit of an interest in them and doing stuff with GANS etc.

I have a bit of an interest in games too.

If I could get one platform for both, I could justify 2k maybe a bit more.

I can't justify that for just one half: running games on Mac, right now via Linux: no thanks.

And on the PC side, nvidia consumer cards only go to 24gb which is a bit limiting for LLMs, while being very expensive - I only play games every few months.

replies(3): >>42621585 #>>42622473 #>>42622505 #

1. WaxProlix ◴[07 Jan 25 14:15 UTC] No.42622505[source]▶

>>42620854 #

The new $2k card from Nvidia will be 32GB but your point stands. AMD is planning a unified chiplet based GPU architecture (AI/data center/workstation/gaming) called UDNA, which might alleviate some of these issues. It's been delayed and delayed though - hence the lackluster GPU offerings from team Red this cycle - so I haven't been getting my hopes up.

Maybe (LP)CAMM2 memory will make model usage just cheap enough that I can have a hosting server for it and do my usual midrange gaming GPU thing before then.

replies(2): >>42626549 #>>42627370 #

2. FuriouslyAdrift ◴[07 Jan 25 19:46 UTC] No.42626549[source]▶

>>42622505 (TP) #

Unified architecture is still on track for 2026-ish.

3. sliken ◴[07 Jan 25 20:57 UTC] No.42627370[source]▶

>>42622505 (TP) #

Grace + Hopper, Grace + blackwell, and discussed GB10 are much like the currently shipping AMD MI300A.

I do hope that a AMD Strix Halo ships with 2 LPCAMM2 slots for a total width of 256 bits.

↑