Most active commenters

geerlingguy(13)
(9)
kllrnohj(5)
mercutio2(4)
ssl-3(4)
aprdm(4)
mattbillenstein(4)
LTL_FTC(4)
IAmBroom(4)
TZubiri(4)

Popular/hot comments

>>45302356 #
>>45302320 #
>>45302424 #
>>45302469 #
>>45304632 #
>>45303645 #
>>45302265 #
>>45302284 #
>>45302450 #
>>45302573 #
>>45302287 #
>>45302232 #
>>45304167 #
>>45302428 #
>>45303134 #
>>45306905 #
>>45305192 #
>>45302491 #
>>45303242 #
>>45303457 #

I regret building this $3000 Pi AI cluster

(www.jeffgeerling.com)

1. elzbardico ◴[19 Sep 25 14:42 UTC] No.45302213[source]▶

>>45302065 (OP) #

Frankly, always thought about Pi Clusters as a nerd indulgence, something to play, not to do serious work.

replies(3): >>45302242 #>>45302271 #>>45302277 #

2. fidotron ◴[19 Sep 25 14:44 UTC] No.45302232[source]▶

>>45302065 (OP) #

If Pi Clusters were actually cost competitive for performance there would be data centres full of them.

replies(5): >>45302265 #>>45302287 #>>45304529 #>>45305386 #>>45306381 #

3. devmor ◴[19 Sep 25 14:45 UTC] No.45302242[source]▶

>>45302213 #

After a few years of experience with them I agree for the most part. They are great for individual projects and even as individual servers for certain loads, but once you start clustering them you will probably get better results from a purpose built computer in the same price range as multiple pis.

4. dbg31415 ◴[19 Sep 25 14:45 UTC] No.45302244[source]▶

>>45302065 (OP) #

Was it cost effective? Meh.

Was it a learning experience?

More importantly, did you have some fun? Just a little? (=

replies(1): >>45302299 #

5. Coffeewine ◴[19 Sep 25 14:46 UTC] No.45302252[source]▶

>>45302065 (OP) #

It's a pretty rough headline, clearly the author had fun performing the test and constructing the thing.

I would be pretty regretful of just the first sentence in the article, though:

> I ordered a set of 10 Compute Blades in April 2023 (two years ago), and they just arrived a few weeks ago.

That's rough.

replies(3): >>45302833 #>>45303576 #>>45303776 #

6. bravetraveler ◴[19 Sep 25 14:47 UTC] No.45302262[source]▶

>>45302065 (OP) #

Wow that's a lot of scratch for... scratch. Pays for itself, I'm sure: effective bait :)

'Worth it any more'? At this size, never. A Pi is a Pi is a Pi!

A few are fine for toying around; beyond that, hah. Price:perf is rough, does not improve with multiplication [of units, cost, or complexity].

7. xnx ◴[19 Sep 25 14:47 UTC] No.45302264[source]▶

>>45302065 (OP) #

Fun project. Was the author hoping for cost effective performance?!

I assumed this was a novelty, like building a RAID array out of floppy drives.

replies(2): >>45302341 #>>45302480 #

8. phoronixrly ◴[19 Sep 25 14:47 UTC] No.45302265[source]▶

>>45302232 #

If they were cost competitive for ... anything at all really...

replies(6): >>45302533 #>>45302559 #>>45302609 #>>45305113 #>>45306161 #>>45306233 #

9. randomgermanguy ◴[19 Sep 25 14:48 UTC] No.45302271[source]▶

>>45302213 #

I think the only exception is specifically for studying network/communciation-topologies. I've seen a couple clusters (ca. 10-50 Pi's) in universities for both research and teaching.

replies(1): >>45303586 #

10. NitpickLawyer ◴[19 Sep 25 14:48 UTC] No.45302277[source]▶

>>45302213 #

It reminds me of the Beowulf clusters of the 90s-2000s, that were all the rage at some point, then slowly lost ground... I remember many friends tinkering with some variant of those, we had one in Uni, and there were even some linux distros dedicated to the concept.

replies(3): >>45302447 #>>45302961 #>>45304853 #

11. lumost ◴[19 Sep 25 14:49 UTC] No.45302284[source]▶

>>45302065 (OP) #

I don’t really get why anyone would be buying ai compute unless A) to your goal is to rent out the compute B) no vendor can rent you enough compute when you need it C) you have an exotic funding arrangement that makes compute capex cheap and opex expensive.

Unless you can keep your compute at 70% average utilization for 5 years - you will never save money purchasing your hardware compared to renting it.

replies(6): >>45302322 #>>45302360 #>>45302364 #>>45302606 #>>45302685 #>>45302745 #

12. shermantanktop ◴[19 Sep 25 14:49 UTC] No.45302287[source]▶

>>45302232 #

Like the joke about the economists not picking up the $20 bill on the ground?

Faith in the perfect efficiency of the free market only works out over the long term. In the short term we have a lot of habits that serve as heuristics for doing a good job most of the time.

replies(5): >>45302513 #>>45302557 #>>45302760 #>>45303625 #>>45304468 #

13. phoronixrly ◴[19 Sep 25 14:50 UTC] No.45302299[source]▶

>>45302244 #

> Was it a learning experience?

Also no. The guy's a youtuber

On the other hand, will this make him 100+k views? Yes. It's bait - the perfect combo to attract both the AI crowd and the 'homelab' enthusiasts (of which the bulk are yet to find any use for their raspberry devices)...

replies(1): >>45302428 #

14. bunderbunder ◴[19 Sep 25 14:51 UTC] No.45302311[source]▶

>>45302065 (OP) #

Reminds me a bit of one of my favorite NormConf sessions, "Just use one big machine for model training and inference." https://youtu.be/9BXMWDXiugg?si=4MnGtOSwx45KQqoP

Or the oldie-but-goodie paper "Scalability! But at what COST?": https://www.usenix.org/system/files/conference/hotos15/hotos...

Long story short, performance considerations with parallelism go way beyond Amdahl's Law, because supporting scale-out also introduces a bunch of additional work that simply doesn't exist in a single node implementation. (And, for that matter, multithreading also introduces work that doesn't exist for a sequential implementation.) And the real deep down black art secret to computing performance is that the fastest operations are the ones you don't perform.

15. Aurornis ◴[19 Sep 25 14:52 UTC] No.45302320[source]▶

>>45302065 (OP) #

I thought the conclusion should have been obvious: A cluster of Raspberry Pi units is an expensive nerd indulgence for fun, not an actual pathway to high performance compute. I don’t know if anyone building a Pi cluster actually goes into it thinking it’s going to be a cost effective endeavor, do they? Maybe this is just YouTube-style headline writing spilling over to the blog for the clicks.

If your goal is to play with or learn on a cluster of Linux machines, the cost effective way to do it is to buy a desktop consumer CPU, install a hypervisor, and create a lot of VMs. It’s not as satisfying as plugging cables into different Raspberry Pi units and connecting them all together if that’s your thing, but once you’re in the terminal the desktop CPU, RAM, and flexibility of the system will be appreciated.

replies(11): >>45302356 #>>45302424 #>>45302433 #>>45302531 #>>45302676 #>>45302770 #>>45303057 #>>45303061 #>>45303424 #>>45304502 #>>45304568 #

16. 2OEH8eoCRo0 ◴[19 Sep 25 14:52 UTC] No.45302322[source]▶

>>45302284 #

I don't get why anyone would hack on and have fun with unique hardware either /s

replies(1): >>45302432 #

17. leptons ◴[19 Sep 25 14:54 UTC] No.45302341[source]▶

>>45302264 #

A lot of people don't understand the performance limits of the Raspberry Pi. It's a great little platform for some things, but it isn't really fit for half the use cases I've seen.

replies(1): >>45302516 #

18. bunderbunder ◴[19 Sep 25 14:55 UTC] No.45302356[source]▶

>>45302320 #

The cost effective way to do it is in the cloud. Because there's a very good chance you'll learn everything you intended to learn and then get bored with it long before your cloud compute bill reaches the price of a desktop with even fairly modest specs for this purpose.

replies(12): >>45302408 #>>45302469 #>>45302503 #>>45302550 #>>45302742 #>>45302824 #>>45303327 #>>45303352 #>>45304169 #>>45304176 #>>45304278 #>>45305010 #

19. HenryMulligan ◴[19 Sep 25 14:56 UTC] No.45302360[source]▶

>>45302284 #

Data privacy and security don't matter? My secondhand RTX 3060 would buy a lot of cloud credits, but I don't want tons of highly personal data sent to the cloud. I can't imagine how it would be for healthcare and finance, at least if they properly shepherded their data.

replies(1): >>45302679 #

20. justinrubek ◴[19 Sep 25 14:56 UTC] No.45302364[source]▶

>>45302284 #

At some point, the work has to actually be done rather than shuffling the details off to someone else.

21. aprdm ◴[19 Sep 25 14:59 UTC] No.45302395[source]▶

>>45302065 (OP) #

Love Jeff's ansible roles/playbooks and his cluster building ! Quite interesting, I should reserve some time to play with a Pi cluster and ansible, sounds fun

22. aprdm ◴[19 Sep 25 15:00 UTC] No.45302408{3}[source]▶

>>45302356 #

That really depends on what you want to learn and how deep. If you're automating things before the hypervisor comes online or there's an OS running (e.g: working on datacenter automation, bare metal as a service) you will have many gaps

replies(1): >>45303384 #

23. glitchc ◴[19 Sep 25 15:01 UTC] No.45302424[source]▶

>>45302320 #

I did some calculations on this. Procuring a Mac Studio with the latest Mx Ultra processor and maxing out the memory seems to be the most cost effective way to break into 100b+ parameter model space.

replies(8): >>45302483 #>>45302490 #>>45302620 #>>45302698 #>>45302777 #>>45302916 #>>45302937 #>>45304489 #

24. noelwelsh ◴[19 Sep 25 15:02 UTC] No.45302427[source]▶

>>45302065 (OP) #

I'd love to understand the economics here. $3000 purely for fun seems like a lot. $3000 for promotion of a channel? consulting? seems reasonable.

replies(1): >>45302767 #

25. aprdm ◴[19 Sep 25 15:02 UTC] No.45302428{3}[source]▶

>>45302299 #

He is not a YouTuber. And even if he was - what's the problem ?

Jeff has many useful OSS software used by many companies around the world daily - including mine. What have you created ?

replies(5): >>45302518 #>>45302524 #>>45302576 #>>45302618 #>>45302675 #

26. seanw444 ◴[19 Sep 25 15:02 UTC] No.45302432{3}[source]▶

>>45302322 #

It's also not always just about fun or cost effectiveness. Taking the infrastructure into your own hands is a nice way to know that you're not being taken advantage of, and you only have yourself to rely on to make the thing work. Freedom and self-reliance, in short.

27. moduspol ◴[19 Sep 25 15:02 UTC] No.45302433[source]▶

>>45302320 #

Also cost effective is to buy used rack mount servers from Amazon. They may be out of warranty but you get a lot more horsepower for your buck, and now your VMs don’t have to be small.

replies(3): >>45302450 #>>45302687 #>>45302719 #

28. gary_0 ◴[19 Sep 25 15:04 UTC] No.45302447{3}[source]▶

>>45302277 #

Oh yeah, the "imagine a Beowulf cluster of these" Slashdot meme! I miss those days. At least the "can it run Doom?" meme is still alive and kicking.

29. Aurornis ◴[19 Sep 25 15:04 UTC] No.45302450{3}[source]▶

>>45302433 #

Putting a retired datacenter rack mount server in your house is a great way to learn how unbearably loud a real rack mount datacenter server is.

replies(6): >>45302637 #>>45302759 #>>45303695 #>>45304110 #>>45304344 #>>45304626 #

30. markx2 ◴[19 Sep 25 15:05 UTC] No.45302459[source]▶

>>45302065 (OP) #

> "But if you're on the blog, you're probably not the type to sit through a video anyway. So moving on..."

Thank you!

31. Almondsetat ◴[19 Sep 25 15:06 UTC] No.45302469{3}[source]▶

>>45302356 #

I can get a Xeon E5-2690V4 with 28 threads and 64GB of RAM for about $150. If you need cores and memory to make a lot of VMs you can do it extremely cheaply

replies(7): >>45302491 #>>45302525 #>>45302535 #>>45302992 #>>45303342 #>>45303344 #>>45303461 #

32. AlfredBarnes ◴[19 Sep 25 15:07 UTC] No.45302473[source]▶

>>45302065 (OP) #

My pi's are just an easy onramp for me to have a functional NAS, PIHole, and webcam security.

Not at all the best, but they were cheap. If i WANTED the best or reliable, i'd actually buy real products.

33. LTL_FTC ◴[19 Sep 25 15:08 UTC] No.45302480[source]▶

>>45302264 #

The author is a YouTuber and projects like these pay for themselves with the views they garner. Even the title is designed for engagement.

34. Palomides ◴[19 Sep 25 15:08 UTC] No.45302483{3}[source]▶

>>45302424 #

even a single new mac mini will beat this cluster on any metric, including cost

35. nine_k ◴[19 Sep 25 15:09 UTC] No.45302491{4}[source]▶

>>45302469 #

It will probably consume $150 worth of electricity in less than a month, even sitting idle :-\

replies(4): >>45302573 #>>45303255 #>>45303270 #>>45304285 #

36. randomgermanguy ◴[19 Sep 25 15:09 UTC] No.45302490{3}[source]▶

>>45302424 #

Depends on how heavy one wants to go with the quants (for Q6-Q4 the AMD Ryzen AI MAX chips seem better/cheaper way to get started).

Also the Mac Studio is a bit hampered by its low compute-power, meaning you really can't use a 100b+ dense model, only MoE feasibly without getting multi minute prompt-processing times (assuming 500+ tokens etc.)

replies(2): >>45303116 #>>45303242 #

37. montebicyclelo ◴[19 Sep 25 15:09 UTC] No.45302503{3}[source]▶

>>45302356 #

Yeah... Looks like can get about $1/hr for 10 small VMs, ($0.10 per VM).

So for $3000, that's 3000 hours, or 125 days, (if just wastefully leave them on all the time, instead of turning them on when needed).

Say you wanted to play around for a couple of hours, that's like.. $3.

(That's assuming there's no bonus for joining / free tier, too.)

replies(3): >>45302520 #>>45303031 #>>45304301 #

38. IAmBroom ◴[19 Sep 25 15:10 UTC] No.45302513{3}[source]▶

>>45302287 #

> Faith in the perfect efficiency of the free market only works out over the long term

... and even then it doesn't always prove true.

39. Our_Benefactors ◴[19 Sep 25 15:11 UTC] No.45302516{3}[source]▶

>>45302341 #

This was my impression as well, the bit about GPU incompatibility with llama.cpp made me think he was in over his head.

40. hn_throw_250915 ◴[19 Sep 25 15:11 UTC] No.45302519[source]▶

>>45302065 (OP) #

I read through it and it’s amusing but along with the title being something I’d receive in email from a newsletter mailing list I’ve never subscribed to (hoping it has an unsubscribe link at the bottom), there’s nothing really of hacker curiosity here to keep me hooked. It’s shallow and appeals to some LCD “I did the thing with the stuff and the results will shock you because of how obvious they are now click here” mentality. Vainposting at its most average. The Mac restoration video was somewhat easier to sit through if only because the picture quality beats out a handful of other YT videos doing the exactly same thing as I’m holding back a jaw grating wince of watching someone butchering a board with poor knowledge of soldering iron practice, so YMMV? Back to hackaday for me I think. I’m not here to read submarine resumes of people applying to work at Linus Tech Tips.

replies(1): >>45302568 #

41. phoronixrly ◴[19 Sep 25 15:11 UTC] No.45302518{4}[source]▶

>>45302428 #

> What have you created ?

Nothing that is not AGPL-licensed, so you and your company haven't taken advantage of it.

I am not sure how this relates to my comment though.

replies(1): >>45302615 #

42. verdverm ◴[19 Sep 25 15:11 UTC] No.45302520{4}[source]▶

>>45302503 #

You can rent a beefy vm with an H100 for $1.50 / hr

I regularly rent this for a few hours at a time for learning and prototyping

replies(1): >>45302657 #

43. vel0city ◴[19 Sep 25 15:12 UTC] No.45302524{4}[source]▶

>>45302428 #

He may also be a good OSS contributor and writer, but he is also a Youtuber. Over 500 videos posted, 175M views, nearly a million subscribers.

Not that its a problem, I don't see why it would inherently be a negative thing. Dude seems to make some good content across a lot of different mediums. Cheers to Jeff.

44. sebastiansm ◴[19 Sep 25 15:12 UTC] No.45302525{4}[source]▶

>>45302469 #

On Aliexpress those Xeon+mobo+ram kits are really cheap.

replies(1): >>45303396 #

45. vlovich123 ◴[19 Sep 25 15:12 UTC] No.45302531[source]▶

>>45302320 #

I’d say it’s inconclusive. For traditional compute it wins on power and cost (it’ll always lose on space). The inference is noted to not be able to use the GPU due to llama.cpp’s vulkan backend AND that clustering software in llama.cpp is bad. I’d say it’s probably still going to be worse for AI but it’s inconclusive where it could be due to the software immaturity (ie not worth it today but could be with better software)

replies(1): >>45303840 #

46. wltr ◴[19 Sep 25 15:12 UTC] No.45302533{3}[source]▶

>>45302265 #

Well I have a Pi as a home server, and it’s very energy efficient, while doing what I want. Since I don’t need latest and greatest (I don’t see any difference with a modern PC for my use case), it’s very competitive for me. No need for any cooling is bonus.

replies(1): >>45303311 #

47. ◴[19 Sep 25 15:12 UTC] No.45302535{4}[source]▶

>>45302469 #

48. nsxwolf ◴[19 Sep 25 15:13 UTC] No.45302550{3}[source]▶

>>45302356 #

That isn’t fun. I have a TI-99/4A in my office hooked up to a raspberry pi so it can use the internet. Why? Because it’s fun. I like to touch and see the things even though it’s all so silly.

49. infecto ◴[19 Sep 25 15:13 UTC] No.45302557{3}[source]▶

>>45302287 #

Sure but for commodities, like server hardware, we can say it’s usually directionally correct. If there are no pi cloud offerings, there is probably a good economic reason for it.

50. ACCount37 ◴[19 Sep 25 15:13 UTC] No.45302559{3}[source]▶

>>45302265 #

Prototyping and low volume.

They're good for long as the development costs dominate the total costs.

51. imtringued ◴[19 Sep 25 15:14 UTC] No.45302562[source]▶

>>45302065 (OP) #

Oh come on Jeff, you forgot to buy GPUs for your AI cluster. Such a beginner mistake.

All you needed to do is buy 4x xtx 7900 used on ebay and build a four node raspberry pi cluster using the external GPU setup you've come up with in one of your previous blog posts [0].

[0] https://www.jeffgeerling.com/blog/2024/use-external-gpu-on-r...

replies(1): >>45302995 #

52. paxys ◴[19 Sep 25 15:14 UTC] No.45302568[source]▶

>>45302519 #

This is just the evolution of clickbait titles. The only thing missing is a thumbnail of an AI generated raspberry pi cluster with a massive arrow pointing to it and the words "not worth it!!"

replies(1): >>45302604 #

53. blobbers ◴[19 Sep 25 15:15 UTC] No.45302573{5}[source]▶

>>45302491 #

The internet says 100W idle, so maybe more like $40-50 electricity, depending on where you live could be cheaper could be more expensive.

Makes me wonder if I should unplug more stuff when on vacation.

replies(5): >>45302688 #>>45302709 #>>45302886 #>>45302909 #>>45306658 #

54. IAmBroom ◴[19 Sep 25 15:15 UTC] No.45302576{4}[source]▶

>>45302428 #

He absolute is a YouTuber.

https://www.jeffgeerling.com/projects

And the inference is that he is doing this for clicks, i.e. clickbait. The very title is disingenuous.

Your attack on the poster above you is childish.

55. cosarara ◴[19 Sep 25 15:17 UTC] No.45302597[source]▶

>>45302065 (OP) #

> Compared to the $8,000 Framework Cluster I benchmarked last month, this cluster is about 4 times faster:

Slower. 4 times slower.

replies(1): >>45302782 #

56. Joker_vD ◴[19 Sep 25 15:17 UTC] No.45302604{3}[source]▶

>>45302568 #

There also needs to be a Face Screaming in Fear Emoji plastered on the other side of it.

57. causal ◴[19 Sep 25 15:18 UTC] No.45302606[source]▶

>>45302284 #

1) Data proximity (if you have a lot of data, egress fees add up)

2) Hardware optimization (the exact GPU you want may not always be available for some providers)

3) Not subject to price changes

4) Not subject to sudden Terms of Use changes

5) Know exactly who is responsible if something isn't working.

6) Sense of pride and accomplishment + Heating in the winter

58. jacobr1 ◴[19 Sep 25 15:18 UTC] No.45302609{3}[source]▶

>>45302265 #

They are competitive for hobbyist use cases. Limited home servers, or embedded applications that overlap with arduino.

replies(1): >>45310400 #

59. op00to ◴[19 Sep 25 15:19 UTC] No.45302618{4}[source]▶

>>45302428 #

"He is not a YouTuber"... what?

https://www.youtube.com/c/JeffGeerling

"978K subscribers 527 videos"

Jeff's had a pattern of embellishing controversies, misrepresenting what people say, and using his platform to create narratives that benefit his content's engagement. This is yet another example of farming outrage to get clicks. I don't understand why people drool over his content so much.

replies(1): >>45303092 #

60. the8472 ◴[19 Sep 25 15:20 UTC] No.45302620{3}[source]▶

>>45302424 #

You could try getting a DGX Thor devkit with 128GB unified memory. Cheaper than the 96GB mac studio and more FLOPs.

replies(1): >>45305690 #

61. tempest_ ◴[19 Sep 25 15:21 UTC] No.45302637{4}[source]▶

>>45302450 #

ahah and pricey power wise.

Currently the cloud providers are dumping second gen xeon scalables and those things are pigs when it comes to power use.

Sound wise its like someone running a hair dryer at full speed all the time and it can be louder under load.

62. drillsteps5 ◴[19 Sep 25 15:21 UTC] No.45302642[source]▶

>>45302065 (OP) #

If he was building compute device for LLM inference specifically it would help to check in advance what that would entail. Like GPU requirement. Which putting bunch of RPis in the cluster doesn't help one bit.

Maybe I'm missing something.

63. deadbabe ◴[19 Sep 25 15:22 UTC] No.45302653[source]▶

>>45302065 (OP) #

I really don’t understand the hype over raspberry Pi.

It’s an overrated, overhyped little computer. Like ok it’s small I guess but why is it the default that everyone wants to build something new on? Because it’s cheap? Whatever happened to buy once, cry once? Why not just build an actual powerful rig? For your NAS? For your firewalls? For security cameras? For your local AI agents?

replies(3): >>45302733 #>>45302791 #>>45303244 #

64. ◴[19 Sep 25 15:24 UTC] No.45302675{4}[source]▶

>>45302428 #

65. llm_nerd ◴[19 Sep 25 15:25 UTC] No.45302676[source]▶

>>45302320 #

If you assume that the author did this to have content for his blog and his YouTube channel, it makes much more sense. Going back to the well with a "I regret" entry allows for extra exploiting of a pretty dubious venture.

YouTube is absolute jam packed full of people pitching home "lab" sort of AI buildouts that are just catastrophically ill-advised, but it yields content that seems to be a big draw. For instance Alex Ziskind's content. I worry that people are actually dumping thousands to have poor performing ultra-quantized local AIs that will have zero comparative value.

replies(1): >>45302697 #

66. tern ◴[19 Sep 25 15:25 UTC] No.45302679{3}[source]▶

>>45302360 #

For most people, no, privacy does not matter in this sense, and "security" would only be a relevant term if there was a pre-existing adversarial situation

67. horsawlarway ◴[19 Sep 25 15:25 UTC] No.45302685[source]▶

>>45302284 #

There are an absolutely stunning number of ways to lose a whole bunch of money very quickly if you're not careful renting compute.

$3,000 is well under many "oopsie billsies" from cloud providers.

And that's outside of the whole "I own it" side of the conversation, where things like latency, control, flexibility, & privacy are all compelling reasons to be willing to spend slightly more.

I still run quite a number of LLM services locally on hardware I bought mid-covid (right around 3k for a dual RTX3090 + 124gb system ram machine).

It's not that much more than you'd spend if you're building a gaming machine anyways, and the nifty thing about hardware I own is that it usually doesn't stop working at the 5 year mark. I have desktops from pre-2008 still running in my basement. 5 year amortization might have the cloud win, but the cloud stops winning long before most hardware dies. Just be careful about watts.

Personally - I don't think pi clusters really make much sense. I love them individually for certain things, and with a management plane like k8s, they're useful little devices to have around. But I definitely wouldn't plan to get good performance from 10 of them in a box. Much better off spending roughly the same money for a single large machine unless you're intentionally trying to learn.

replies(2): >>45303598 #>>45304260 #

68. yjftsjthsd-h ◴[19 Sep 25 15:26 UTC] No.45302688{6}[source]▶

>>45302573 #

> Makes me wonder if I should unplug more stuff when on vacation.

What's the margin on unplugging vs just powering off?

replies(2): >>45302807 #>>45310939 #

69. Y_Y ◴[19 Sep 25 15:26 UTC] No.45302687{3}[source]▶

>>45302433 #

If you're following this path, make sure to use the finest traditional server rack that money can buy: https://www.ikea.com/ie/en/p/lack-side-table-white-30449908/

70. philipwhiuk ◴[19 Sep 25 15:26 UTC] No.45302697{3}[source]▶

>>45302676 #

I doubt anyone does this seriously.

replies(1): >>45302735 #

71. eesmith ◴[19 Sep 25 15:26 UTC] No.45302698{3}[source]▶

>>45302424 #

Geerling links to last month's essay on a Frameboard cluster, at https://www.jeffgeerling.com/blog/2025/i-clustered-four-fram... . In it he writes 'An M3 Ultra Mac Studio with 512 gigs of RAM will set you back just under $10,000, and it's way faster, at 16 tokens per second.' for 671B parameters, that is, that M3 is at least 3x the performance of the other three systems.

72. nine_k ◴[19 Sep 25 15:27 UTC] No.45302709{6}[source]▶

>>45302573 #

I was surprised to find out that my apartment pulls 80-100W when everything is seemingly down during the night. A tiny light here and there, several displays in sleep mode, a desktop idling (mere 15W, but), a laptop charging, several phones charging, etc, the fridge switches on for a short moment. The many small amounts add up to something considerable.

replies(2): >>45303629 #>>45304143 #

73. hendersoon ◴[19 Sep 25 15:28 UTC] No.45302716[source]▶

>>45302065 (OP) #

I mean, obviously it isn't practical, he got a couple of videos out of it.

74. allanrbo ◴[19 Sep 25 15:29 UTC] No.45302719{3}[source]▶

>>45302433 #

No, again, just run VMs on your desktop/laptop. The software doesn't know or care if it's a rack mounted machine.

replies(1): >>45304612 #

75. theultdev ◴[19 Sep 25 15:30 UTC] No.45302733[source]▶

>>45302653 #

I use mine for a plex server.

I don't need to transcode + I need something I can leave on that draws little power.

I have a powerful rig, but the one time I get to turn it off is when I'd need the media server lol.

There's a lot of scenarios where power usage comes into play.

These clusters don't make much sense to me though.

replies(1): >>45302763 #

76. nerdsniper ◴[19 Sep 25 15:30 UTC] No.45302735{4}[source]▶

>>45302697 #

I sure hope no one does this seriously expecting to save some money. I enjoy the videos on "catastrophically ill-advised" build-outs. My primary curiosities that get satisfied by them are:

1) How much worse / more expensive are they than a conventional solution?

2) What kinds of weird esoteric issues pop up and how do they get solved (e.g. the resizable BAR issue for GPU's attached to RPi's PCIe slot)

77. bakugo ◴[19 Sep 25 15:30 UTC] No.45302742{3}[source]▶

>>45302356 #

It heavily depends on the use case. For these AI setups, you're completely correct, because the people who talk about how amazing it is to run a <100B model at home almost never actually end up using it for anything real (mostly because these small models aren't actually very good) and are doing it purely for the novelty.

But if you're someone like me who intends to actively use the hardware for real-world purposes, the cloud often simply can't compete on price. At home, I have a mini PC with a 5600G, 32GB of RAM, and a few TBs of NVME storage. The entire thing cost less than $600 a few years ago, and consumes around 20W of power on average.

Even on the cheapest cloud providers available, an equivalent setup would exceed that price in less than half a year. SSD storage in particular is disproportionately expensive on the cloud. For small VMs that don't need much storage, it does make sense, but as soon as you scale up, cloud prices quickly start ballooning.

replies(1): >>45304429 #

78. a2128 ◴[19 Sep 25 15:30 UTC] No.45302745[source]▶

>>45302284 #

Why do people buy gaming PCs when it's much cheaper to use streaming platforms? I think the two cases share practically the same parallels in terms of reliability, availability, restrictions, flexibility, sovereignty, privacy, etc.

But also when it comes to Vast/RunPod it can be annoying and genuinely become more expensive if you have to rent 2x the number of hours because you constantly have to upload and download data, checkpoints, continuous storage costs, transfer data to another server because the GPU is no longer available, etc. It's just less of a headache if you have an always available GPU with a hard drive plugged into the machine and that's it

replies(2): >>45304220 #>>45304466 #

79. nromiun ◴[19 Sep 25 15:31 UTC] No.45302752[source]▶

>>45302065 (OP) #

There is a reason all the big supercomputers have started using GPUs in the last decade. They are much more efficient. If you want 32bit parallel performance just buy some consumer GPUs and hook them up. If you need 64bit buy some prosumer GPUs like the RTX 6000 Pro and you are done.

Nobody is really building CPU clusters these days.

replies(2): >>45304577 #>>45305705 #

80. deater ◴[19 Sep 25 15:32 UTC] No.45302755[source]▶

>>45302065 (OP) #

as someone who has built various raspberry pi clusters over the years (I even got an academic paper out of one) the big shame is that as far as I know it's still virtually impossible to use the fairly powerful GPUs they have for GPGPU work

replies(1): >>45314127 #

81. Tsiklon ◴[19 Sep 25 15:32 UTC] No.45302759{4}[source]▶

>>45302450 #

To quote @swiftonsecurity - https://x.com/swiftonsecurity/status/1650223598903382016 ;

> DO NOT TAKE HOME THE FREE 1U SERVER YOU DO NOT WANT THAT ANYWHERE A CLOSET DOOR WILL NOT STOP ITS BANSHEE WAIL TO THE DARK LORD AN UNHOLY CONDUIT TO THE DEPTHS OF INSOMNIA BINDING DARKNESS TO EVEN THE DAY

replies(1): >>45303566 #

82. ThrowawayR2 ◴[19 Sep 25 15:32 UTC] No.45302760{3}[source]▶

>>45302287 #

There's been so much investigation into alternative architectures for datacenters and cloud providers, including FAANG resorting to designing their own ARM processors and accelerator chips (e.g. AWS Graviton, Google TPUs) and having them fabbed, that that comes off not as warranted cynicism but silly cynicism.

83. deadbabe ◴[19 Sep 25 15:32 UTC] No.45302763{3}[source]▶

>>45302733 #

That’s insane, drawing very little power from an always on server is a solved problem.

replies(3): >>45302977 #>>45303280 #>>45303377 #

84. philipwhiuk ◴[19 Sep 25 15:33 UTC] No.45302767[source]▶

>>45302427 #

Jeff has a million YouTube subscribers, gets $2000 a month from Patreon and has 200 GitHub sponsors.

The economics of spending $3,000 on a video probably work out fine.

replies(1): >>45302921 #

85. zamadatix ◴[19 Sep 25 15:33 UTC] No.45302769[source]▶

>>45302065 (OP) #

The article focuses on compute performance but I wonder if that was ever the bottleneck considering the memory bandwidth involved.

86. ww520 ◴[19 Sep 25 15:33 UTC] No.45302770[source]▶

>>45302320 #

Now. Imagine a Beawulf of these...

87. GeekyBear ◴[19 Sep 25 15:34 UTC] No.45302777{3}[source]▶

>>45302424 #

Now that we know that Apple has added tensor units to the GPU cores the M5 series of chips will be using, I might be asking myself if I couldn't wait a bit.

replies(1): >>45304691 #

88. teleforce ◴[19 Sep 25 15:34 UTC] No.45302782[source]▶

>>45302597 #

That's definitely a typo because I've to read the sentence 3 times from the article still cannot make a sense until I saw the figure.

TL;DR, just buy one framework desktop and it's better than the Pi AI cluster of the OP in every single performance metrics including cost, performance, efficiency, headache, etc.

replies(1): >>45302863 #

89. jonatron ◴[19 Sep 25 15:35 UTC] No.45302791[source]▶

>>45302653 #

In the category of SBC's, it's pretty much the only one that has good software support, not outdated images made with a bunch of kernel patches for a specific kernel version.

replies(1): >>45303277 #

90. amelius ◴[19 Sep 25 15:35 UTC] No.45302801[source]▶

>>45302065 (OP) #

Ok, what are the back-of-the-envelope computations that he should have done before starting to build this?

replies(1): >>45302857 #

91. dijit ◴[19 Sep 25 15:36 UTC] No.45302807{7}[source]▶

>>45302688 #

By "off" you mean, functionally disabled but with whatever auto-update system in the background with all the radios on for "smart home" reasons - or, "off"?

92. bearjaws ◴[19 Sep 25 15:36 UTC] No.45302808[source]▶

>>45302065 (OP) #

Am I the only one who looks at both the Pi Cluster and the Framework PC and wonders how they are both slower and less cost effective than a MacBook Pro M4 Max? 88 token/s on a 2.3b model is not exactly great, most likely you will want a 32 or 70b model.

93. newsclues ◴[19 Sep 25 15:37 UTC] No.45302824{3}[source]▶

>>45302356 #

Text and reference books are free at the library.

You don’t need hardware to learn. Sure it helps but you can learn from a book and pen and paper exercises.

replies(1): >>45302877 #

94. geerlingguy ◴[19 Sep 25 15:38 UTC] No.45302833[source]▶

>>45302252 #

That's the biggest regret; but I've backed 6 Kickstarter projects over the years. Median time to deliver is 1 year.

Somehow I've actually gotten every item I backed shipped at some point (which is unexpected).

Hardware startups are _hard_, and after interacting with a number of them (usually one or two people with a neat idea in an underserved market), it seems like more than half fail before delivering their first retail product. Some at least make it through delivering prototypes/crowdfunded boards, but they're already in complete disarray by the end of the shipping/logistics nightmares.

replies(2): >>45303432 #>>45306696 #

95. geerlingguy ◴[19 Sep 25 15:40 UTC] No.45302857[source]▶

>>45302801 #

Pi memory bandwidth is less than 10 GB/sec, so AI use cases will be extremely limited. Network I/O is maximum of 1 Gbps (or more if you do some unholy thing with M.2 NICs), so that also limits maximum networked performance.

But still can be decent for HPC learning, CI testing, or isolated multi-node smaller-app performance.

96. geerlingguy ◴[19 Sep 25 15:40 UTC] No.45302863{3}[source]▶

>>45302782 #

Oops, fixed the typo! Thanks.

And regarding efficiency, in CPU-bound tasks, the Pi cluster is slightly more efficient. (Even A76 cores on a 16nm node still do well there, depending on the code being run).

97. trenchpilgrim ◴[19 Sep 25 15:41 UTC] No.45302877{4}[source]▶

>>45302824 #

I disagree. Most of what I've learned about systems comes from debugging the weird issues that only happen on real systems, especially real hardware. The book knowledge is like, 20-30% of it.

replies(1): >>45302940 #

98. rogerrogerr ◴[19 Sep 25 15:42 UTC] No.45302886{6}[source]▶

>>45302573 #

100W over a month (rule of thumb 730 hours) is 73kWh. Which is $7.30 at my $0.10/kWh rate, or less than $25 at (what Google told me is) Cali’s average $0.30/kWh.

replies(1): >>45303134 #

99. titanomachy ◴[19 Sep 25 15:44 UTC] No.45302909{6}[source]▶

>>45302573 #

100W continuous at 12¢/kWh (US average) is only ~$9 / month. Is your electricity 5x more expensive than the US average?

replies(2): >>45303043 #>>45303121 #

100. teleforce ◴[19 Sep 25 15:45 UTC] No.45302916{3}[source]▶

>>45302424 #

Not quite, as it stands now the most cost effective way is most likely framework desktop or similar system for example HP G1a laptop/PC [1],[2].

[1] The Framework Desktop is a beast:

https://news.ycombinator.com/item?id=44841262

[2] HP ZBook Ultra:

https://www.hp.com/us-en/workstations/zbook-ultra.html

101. geerlingguy ◴[19 Sep 25 15:46 UTC] No.45302921{3}[source]▶

>>45302767 #

It's definitely a stretch for my per-video budget, but I did want to have a 'maxed out' Pi cluster for future testing as well.

A lot of people (here, Reddit, elsewhere) speculate about how good/bad a certain platform or idea is. Since I have the means to actually test how good or bad something is, I try to justify the hardware costs for it.

Similar to testing various graphics cards on Pis, I've probably spent a good $10,000 on those projects over the past few years, but now I have a version of every major GPU from the past 3 generations to test on, not only on Pi, but other Arm platforms like Ampere and Snapdragon.

Which is fun, but also educational; I've learned a lot about inference, GPU memory access, cache coherency, the PCIe bus...

So a lot of intangibles, many of which never make it directly into a blog post or video. (Similar story with my time experiments).

102. llm_nerd ◴[19 Sep 25 15:47 UTC] No.45302937{3}[source]▶

>>45302424 #

The next generation M5 should bring the matmul functionality seen on the A19 Pro to the desktop SoC's GPU -- "tensor" cores, in essence -- and will dramatically improve the running of most AI models on those machine.

Right now the Macs are viable purely because you can get massive amounts of unified memory. Be pretty great when they have the massive matrix FMA performance to complement it.

103. titanomachy ◴[19 Sep 25 15:47 UTC] No.45302940{5}[source]▶

>>45302877 #

Agreed, I don't think I'd hire a datacenter engineer whose experience consisted of reading books and doing "pen and paper exercises".

104. unregistereddev ◴[19 Sep 25 15:48 UTC] No.45302961{3}[source]▶

>>45302277 #

Ditto! It reminded me of the time in college when I built a Beowulf cluster from recently-retired Pentium II desktops.

Was it fast? No. But that wasn't the point. I was learning about distributed computing.

105. geerlingguy ◴[19 Sep 25 15:49 UTC] No.45302977{4}[source]▶

>>45302763 #

What's your idea of very little power, though?

I know for many who run SBCs (RK3588, Pi, etc.), very little is 1-2W idle, which is almost nothing (and doesn't even need a heatsink if you can stand some throttling from time to time).

Most of the Intel Mini PCs (which are about the same price, with a little more performance) idle at 4-6W, or more.

106. kbenson ◴[19 Sep 25 15:50 UTC] No.45302992{4}[source]▶

>>45302469 #

Source? That seems like something I would want to take advantage if at the moment...

replies(1): >>45303066 #

107. geerlingguy ◴[19 Sep 25 15:50 UTC] No.45302995[source]▶

>>45302562 #

More on that soon... ;)

108. Drblessing ◴[19 Sep 25 15:50 UTC] No.45302996[source]▶

>>45302065 (OP) #

The bee-link AI max+ is the best value AI pc right now.

replies(1): >>45303222 #

109. wongarsu ◴[19 Sep 25 15:52 UTC] No.45303031{4}[source]▶

>>45302503 #

The VMs quickly get expensive if you leave them running though.

The desktop equivalent of your 10 T3 Micro instances is about $600 if you buy new. For example a Lenovo ThinkCentre M75q Gen 2 Tiny 11JN009QGE has 8x3.2GHz processor with hyperthreading. That's 16 virtual cores compared to the 20 vcpus of the T3 instances, but with much faster cores. And 16GB RAM allows you to match the 1GB per instance.

If you don't have anything and feel generous throw in another $200 for a good monitor and keyboard plus mouse. But you can get a used crap monitor for $20. I'd give you one for free just to be rid of it.

That's a total of $800, or 33 days of forgetting to shut down the 10 VMs. Maybe half that if you buy used.

Granted not everyone has $800 or even $400 to drop on hobby projects, renting VMs often does make sense

110. mercutio2 ◴[19 Sep 25 15:53 UTC] No.45303043{7}[source]▶

>>45302909 #

Not OP, but my California TOU rates are between a 40 and 70 cents per kWh.

Still only $50/month, not $150, but I very much care about 100W loads doing no work.

replies(1): >>45303494 #

111. TZubiri ◴[19 Sep 25 15:53 UTC] No.45303057[source]▶

>>45302320 #

Fun fact, a raspberry pi does not have a built in Real Time Clock with its own battery, so it relies on network clocks to keep the time.

Another fun fact, the network module of the pi is actually connected to the USB bus, so there's some overhead as well as a throughput limitation.

Fun fact, the Pi does not have a power button, relying on software to shut down cleanly. If you lose access to the machine, it's not possible to avoid corrupted states on the disk.

Despite all of this, if you want to self host some website, the raspberry pi is still an amazingly cost effective choice, from anywhere between 2 to 20000 monthly users, one pi will be overprovisioned. And you can even get an absolutely overkill redundant pi as a failover, but still a single pi can reach 365 days of uptime with no problem, and as long as you don't reboot or lose power or lose internet, you can achieve more than a couple of nines of reliability.

But if you are thinking of a third, much less a 10th raspberry pi, you are probably scaling the wrong way, way before you reach the point where a quantity matters ( a third machine), it becomes cost effective to upgrade the quality of your one or two machines.

On the embedded side it's the same story, these are great for prototyping, but you are not going to order 10k and sell them in production, maybe a small 100 test batch? But you will optimize and make your own PCB before a mass batch.

replies(3): >>45303246 #>>45303322 #>>45303723 #

112. wccrawford ◴[19 Sep 25 15:53 UTC] No.45303061[source]▶

>>45302320 #

Geerling's titles have been increasingly click-bait for a while now. It's pretty sad, because I like his content, but hate the click-bait BS.

replies(2): >>45304562 #>>45305076 #

113. kllrnohj ◴[19 Sep 25 15:54 UTC] No.45303066{5}[source]▶

>>45302992 #

Note the E5-2690V4 is a 10 year old CPU, they are talking about used servers. You can find those on ebay or whatever as well as stores specializing in that. Depending on where you live, you might even find them free as they are often considered literal ewaste by the companies decommissioning them.

It also means it performs like a 10 year old server CPU, so those 28 threads are not exactly worth a lot. The geekbench results, for whatever value those are worth, are very mediocre in the context of anything remotely modern: https://browser.geekbench.com/processors/intel-xeon-e5-2690-...

Like a modern 12-thread 9600x runs absolute circles around it https://browser.geekbench.com/processors/amd-ryzen-5-9600x

replies(3): >>45303457 #>>45313507 #>>45354389 #

114. aprdm ◴[19 Sep 25 15:56 UTC] No.45303092{5}[source]▶

>>45302618 #

I guess I met Jeff's work on Ansible for DevOps: Server and configuration management for humans which is roughly 10 years old.

I then used many of his ansible playbooks on my day to day job, which paid my bills and made my career progress.

I don't check youtube so I didn't know that he was an "youtuber", I do know his other side and how mucH I have leveraged his content/code in my career

115. GeekyBear ◴[19 Sep 25 15:57 UTC] No.45303116{4}[source]▶

>>45302490 #

Given the RAM limitations of the first gen Ryzen AI MAX, you have no choice but to go heavy on the quantization of the larger LLMs on that hardware.

116. RussianCow ◴[19 Sep 25 15:57 UTC] No.45303121{7}[source]▶

>>45302909 #

The US average hasn't been that low in a few years; according to [0] it's 17.47¢/kWh, and significantly higher in some parts of the country (40+ in Hawaii). And the US has low energy costs relative to most of the rest of the world, so a 3-5x multiplier over that for other countries isn't unreasonable. Plus, energy prices are currently rising and will likely continue to do so over the next few years.

$50/month for 100W continuous usage isn't totally mad, and that could climb even higher over the rest of the decade.

117. mercutio2 ◴[19 Sep 25 15:58 UTC] No.45303134{7}[source]▶

>>45302886 #

Your googling gave results that were likely accurate for California 4-5 years ago. My average cost per kWh is about 60 cents.

Rates have gone up enormously because the cost of wildfires is falling on ratepayers, not the utility owners.

Regulated monopolies are pretty great, aren’t they? Heads I win, tales you lose.

replies(4): >>45303320 #>>45303324 #>>45303607 #>>45304654 #

118. verdverm ◴[19 Sep 25 16:01 UTC] No.45303181{6}[source]▶

>>45302657 #

I'll take the H1/200s over a vehicle any day of the week

119. stirfish ◴[19 Sep 25 16:03 UTC] No.45303222[source]▶

>>45302996 #

I came here to ask about these. You like yours?

replies(1): >>45319326 #

120. mercutio2 ◴[19 Sep 25 16:05 UTC] No.45303242{4}[source]▶

>>45302490 #

Huh? My maxed out Mac Studio gets 60-100 tokens per second on 120B models, with latency on the order of 2 seconds.

It was expensive, but slow it is not for small queries.

Now, if I want to bump the context window to something huge, it does take 10-20 seconds to respond for agent tasks, but it’s only 2-3x slower than paid cloud models, in my experience.

Still a little annoying, and the models aren’t as good, but the gap isn’t nearly as big as you imply, at least for me.

replies(3): >>45303597 #>>45304594 #>>45304642 #

121. heresie-dabord ◴[19 Sep 25 16:05 UTC] No.45303244[source]▶

>>45302653 #

> an overrated, overhyped little computer.

No, you are dismissive because you don't care about the use-cases.

The R.Pi 4 , 400, and the 500 are great models. Consider all the advantages together:

i= support for current Debian

ii= stellar community

iii= ease of use (UX), especially for people new to Debian and/or coding and/or Linux

iv= quiet, efficient, low power and passively cooled

v= robust enough to be left running for a long time

There are cheaper, more performant x86 and ARM dev boards and SOCs. But nothing compares to the full set of advantages.

That said, building a $3K A.I. cluster is just a senseless, expensive lark. (^;

122. stuxnet79 ◴[19 Sep 25 16:05 UTC] No.45303246{3}[source]▶

>>45303057 #

> Fun fact, a raspberry pi does not have a built in Real Time Clock with its own battery, so it relies on network clocks to keep the time.

> Another fun fact, the network module of the pi is actually connected to the USB bus, so there's some overhead as well as a throughput limitation.

> Fun fact, the Pi does not have a power button, relying on software to shut down cleanly. If you lose access to the machine, it's not possible to avoid corrupted states on the disk.

With all these caveats in mind, a raspberry pi seems to be an incredibly poor choice for distributed computing

replies(1): >>45303715 #

123. kjkjadksj ◴[19 Sep 25 16:06 UTC] No.45303255{5}[source]▶

>>45302491 #

So shut it off when you don’t need it.

124. Havoc ◴[19 Sep 25 16:06 UTC] No.45303260[source]▶

>>45302065 (OP) #

Have a bunch of Pis too, but realized I can use them to create a high availability control plane for a k8s cluster. Pi4s are entirely adequate for that

replies(1): >>45311704 #

125. Almondsetat ◴[19 Sep 25 16:07 UTC] No.45303270{5}[source]▶

>>45302491 #

Isn't your home lab supposed to make you learn stuff? Why would you leave it idle?

replies(1): >>45303309 #

126. qhwudbebd ◴[19 Sep 25 16:08 UTC] No.45303277{3}[source]▶

>>45302791 #

This is certainly the reputation but I'm not sure they deserve it. They've always had the horrible closed-source bootloader with threadx running on the gpu, without a free alternative. At least up to pi4 they weren't bad at linux mainlining, but progress on upstreaming pi5 support has been glacial.

Cf. the various Beagle boards which have mainline linux and u-boot support right from release, together with real open hardware right down to board layouts you can customise. And when you come to manufacture something more than just a dev board, you can actually get the SoC from your normal distributor and drop it on your board - unlike the strange Broadcom SoCs rpi use.

I'm quite a lot more positive about rp2040 and rp2350, where they've at least partially broken free of that Broadcom ball-and-chain.

replies(1): >>45306377 #

127. theultdev ◴[19 Sep 25 16:08 UTC] No.45303280{4}[source]▶

>>45302763 #

how many can run on batteries?

it's nice to take it on road trips / into hotels.

can't really imagine hauling a server around.

we probably have different definitions of "very little power".

replies(1): >>45303931 #

128. pstuart ◴[19 Sep 25 16:08 UTC] No.45303284[source]▶

>>45303083 #

Ewww. Religion is a hell of a drug.

129. cjbgkagh ◴[19 Sep 25 16:11 UTC] No.45303309{6}[source]▶

>>45303270 #

You wouldn’t, it’s given as a lower bound, it costs more than that when not idling

replies(1): >>45303364 #

130. Waraqa ◴[19 Sep 25 16:11 UTC] No.45303311{4}[source]▶

>>45302533 #

>very energy efficient

If your server has a lot of idle time, ARM will always win.

replies(1): >>45310902 #

131. WhitneyLand ◴[19 Sep 25 16:11 UTC] No.45303314[source]▶

>>45302065 (OP) #

“with 160 GB of total RAM, shared by the CPU and iGPU, this could be a small, efficient AI Cluster, right? Well, you'd think.”

No, I wouldn’t think.

132. lukevp ◴[19 Sep 25 16:12 UTC] No.45303320{8}[source]▶

>>45303134 #

60 cents per kWh? That’s shocking. Here in Oregon people complain about energy prices and my fully loaded cost (not the per kWh but including everything) is 19c. And I go over the limit for single family residential where I end up in a higher priced bracket. Thanks for making me feel better about my electricity rate. I’m sorry you have to deal with that. The utility companies should have to pay to cover those costs.

133. alias_neo ◴[19 Sep 25 16:12 UTC] No.45303322{3}[source]▶

>>45303057 #

> the raspberry pi is still an amazingly cost effective choice

It's really not though. I've been a Pi user and fan since it was first announced, and I have dozens of them, so I'm not hating on RPi here; we did the maths some time back here on HN when something else Pi related came up.

If you go for a Pi5 with say 8GB RAM, by the time you factor in an SSD + HAT + PSU + Case + Cooler (+ maybe a uSD), you're actually already in mini-PC price territory and you can get something much more capable and feature complete for about the same price, or for a few £ more, something significantly more capable, better CPU, iGPU, you'll get an RTC, proper networking, faster storage, more RAM, better cooling, etc, etc, and you won't be using much more electricity either.

I went this route myself and have figuratively and literally shelved a bunch of Pis by replacing them with a MiniPC.

My conclusion, for my own use, after a decade of RPi use, is that a cheap mini PC is the better option these days for hosting/services/server duty and Pis are better for making/tinkering/GPIO related stuff, even size isn't a winner for the Pi any more with the size of some of the mini-PCs on the market.

replies(3): >>45303940 #>>45305038 #>>45305108 #

134. cogman10 ◴[19 Sep 25 16:12 UTC] No.45303324{8}[source]▶

>>45303134 #

Depends entirely on the utilities board doing the regulation.

That said, I'm of the opinion that power/water/internet should all be state/county/city ran. I don't want my utilities companies to have profit motives.

My water company just got bought up by a huge water company conglomerate and, you guessed it, immediate rate increases.

replies(1): >>45303486 #

135. mattbillenstein ◴[19 Sep 25 16:12 UTC] No.45303327{3}[source]▶

>>45302356 #

LOL, no

136. traverseda ◴[19 Sep 25 16:13 UTC] No.45303341[source]▶

>>45303083 #

A specific women? Women in general? Is vague-posting back in vogue?

137. mattbillenstein ◴[19 Sep 25 16:14 UTC] No.45303342{4}[source]▶

>>45302469 #

Power and noise - old server hardware is not something you want in your home.

Commodity desktop cpus with 32 or 64GB RAM can do all of this in a low-power and quiet way without a lot more expense.

replies(1): >>45306592 #

138. semi-extrinsic ◴[19 Sep 25 16:14 UTC] No.45303344{4}[source]▶

>>45302469 #

For $3000 you can get 3x used Epyc servers with a total of 144 cores and 384 GB memory, with dual-port 25Gbe networking so you can run them in a fully connected cluster without a switch. It will have >20x better perf/$ and ~3x better perf/W.

That combo gives you the better part of a gigabyte of L3 cache and an aggregate memory bandwidth of 600 GB/s, while still below 1000W total running at full speed. Plus your NICs are the fancy kind that let you play around with RoCEv2 and such nifty stuff.

It would also be relevant to then also learn how to do stuff properly with SLURM and Warewulf etc. instead of a poor mans solution with Ansible playbooks like in these blog posts.

replies(2): >>45304364 #>>45306512 #

139. dukeyukey ◴[19 Sep 25 16:15 UTC] No.45303352{3}[source]▶

>>45302356 #

It's good for the soul to have your cluster running in your home somewhere.

replies(2): >>45303549 #>>45303645 #

140. dijit ◴[19 Sep 25 16:16 UTC] No.45303364{7}[source]▶

>>45303309 #

but then you’d turn it off, if you don’t then cloud is much more expensive too.

Also $150 for 100w is crazy, thats like $1.70 per kWh; it would cost about $150 a year at the (high) rates of southern Sweden.

replies(1): >>45303475 #

141. lostmsu ◴[19 Sep 25 16:16 UTC] No.45303377{4}[source]▶

>>45302763 #

Not to mention there's more cost, effort, and energy effective compute from old laptops.

Unless you have a robot body for your potential RPi, don't buy one.

142. leoc ◴[19 Sep 25 16:17 UTC] No.45303384{4}[source]▶

>>45302408 #

If you want to run something like GNS3 network simulation on a hosting service's hardware you'll either have to deal with hiring a bare-metal server or deal with nested virtualisation on other people's VM setups. Network simulation absolutely drinks RAM, too, so just filling an old Xeon with RAM starts to look very attractive in comparison to cloud providers who treat it an expensive upsell.

143. titaniumtown ◴[19 Sep 25 16:17 UTC] No.45303385[source]▶

>>45303083 #

Can you elaborate? I don't know what you are referring to.

replies(1): >>45303453 #

144. datadrivenangel ◴[19 Sep 25 16:18 UTC] No.45303396{5}[source]▶

>>45302525 #

1. Not in the US with tariffs now. 2. I would not trust complicated electronics from Aliexpress from a safety and security perspective.

145. kolbe ◴[19 Sep 25 16:20 UTC] No.45303424[source]▶

>>45302320 #

The author, Jeff Geerling, is a very intelligent person. He has more experience with using niche hardware than almost anyone on earth. If he does something, there's usually a good a priori rationale for it.

replies(3): >>45303517 #>>45303850 #>>45304400 #

146. maartin0 ◴[19 Sep 25 16:21 UTC] No.45303432{3}[source]▶

>>45302833 #

Not completely related, but do you know if hardware kickstarters typically have any IP protection? I'm surprised there haven't been any cases of large companies creating patents for ideas from kickstarter at least that I've seen

replies(1): >>45305282 #

147. owaislone ◴[19 Sep 25 16:21 UTC] No.45303444[source]▶

>>45302065 (OP) #

What is the ideal or realistic setup for a home lab to power big enough models running locally and service ~2-3 users at a time?

148. stanac ◴[19 Sep 25 16:22 UTC] No.45303453{3}[source]▶

>>45303385 #

Quick google search, Jeff has a blog post about abortion, I haven't read it, but it starts with a warning:

> This post is more than 10 years old, I do not delete posts...

https://www.jeffgeerling.com/articles/religion/abortion-case...

I guess GP is referring to that post.

149. mattbillenstein ◴[19 Sep 25 16:22 UTC] No.45303457{6}[source]▶

>>45303066 #

This is the correct analysis - there's a reason you see this stuff cheap or free.

The homelab group on Reddit is full of people who don't understand any of this - they have full racks in their house that could be replaced with one high-end desktop.

replies(3): >>45303637 #>>45304959 #>>45354238 #

150. Aurornis ◴[19 Sep 25 16:23 UTC] No.45303461{4}[source]▶

>>45302469 #

> I can get a Xeon E5-2690V4 with 28 threads and 64GB of RAM for about $150.

If the goal is a lot of RAM and you don’t care about noise, power, or heat then these can be an okay deal.

Don’t underestimate how far CPUs have come, though. That machine will be slower than AMD’s slowest entry-level CPU. Even an AMD 5800X will double its single core performance and even walk away from it on multithreaded tasks despite only having 8 cores. It will use less electricity and be quiet, too. More expensive, but if this is something you plan to leave running 24/7 the electricity costs over a few years might make the power hungry server more expensive over time.

151. hamonrye ◴[19 Sep 25 16:23 UTC] No.45303463[source]▶

>>45302065 (OP) #

Strange quirks for language syntax. I read about a parsing system that was distilled towards being-towards this particular species of silver-tail that existed below Golden Gate bridge. Godwin's Law?

152. cjbgkagh ◴[19 Sep 25 16:24 UTC] No.45303475{8}[source]▶

>>45303364 #

Im not the OP, don’t know how they arrived at that cost.

Personally it’s cheaper to buy the hardware that does spend most of its time idling. Fast turnaround on very large private datasets being key.

153. bee_rider ◴[19 Sep 25 16:25 UTC] No.45303483[source]▶

>>45302065 (OP) #

> The first benchmark I ran was my top500 High Performance Linpack cluster benchmark. This is my favorite cluster benchmark, because it's the traditional benchmark they'd run on massive supercomputers to get on the top500 supercomputer list. […]

> After fixing the thermals, the cluster did not throttle, and used around 130W. At full power, I got 325 Gflops

I was sort of surprised to find that the top500 list on their website only goes back to 1993. I was hoping to find some ancient 70’s version of the list where his ridiculous Pi cluster could sneak on. Oh well, might as well take a look… I’ll pull from the sub-lists of

https://www.top500.org/lists/top500/

They give the top 10 immediately.

First list (June 1993):

     placement  name            RPEAK (GFlop/s)
     1          CM-5/1024       131.00
     10         Y-MP C916/16256 15.24

Last list he wins, I think (June 1996):

     1          SR2201/1024     307.20  
     10         SX-4/32         64.00

First list he’s bumped out of the top 10 (November 1997):

     1          ASCI Red        1,830.40
     10         T3E             326.40

I think he gets bumped off the full top500 list around 2002-2003. Unfortunately I made the mistake of going by Rpeak here, but they sort by Rmax, and I don’t want to go through the whole list.

Apologies for any transcription errors.

Actually, pretty good showing for such a silly cluster. I think I’ve been primed by stuff like “your watch has more compute power than the Apollo guidance computer” or whatever to expect this sort of thing to go way, way back, instead of just to the 90’s.

154. SoftTalker ◴[19 Sep 25 16:25 UTC] No.45303486{9}[source]▶

>>45303324 #

Most utilities, even if ostensibly privately-owned, are profit-limited and rates must be approved by a regulatory board. Some are organized as non-profits (rural water and electric co-ops, etc.) This is in exchange for the local monopoly.

If your local regulators approved the merger and higher rates, your complaint is with them as much as the utility company.

Not saying that some regulators are not basically rubber stamps or even corrupt.

replies(1): >>45303739 #

155. wizzwizz4 ◴[19 Sep 25 16:26 UTC] No.45303490{6}[source]▶

>>45302615 #

GP is not Jeff Geerling.

156. cjbgkagh ◴[19 Sep 25 16:26 UTC] No.45303494{8}[source]▶

>>45303043 #

Those kWh prices are insane, that’ll make industry move out of there.

replies(1): >>45305052 #

157. ◴[19 Sep 25 16:28 UTC] No.45303511[source]▶

>>45302065 (OP) #

158. buildbot ◴[19 Sep 25 16:28 UTC] No.45303517{3}[source]▶

>>45303424 #

Jeff is a good person/blogger and does interesting projects but more experience with niche hardware than literally anyone is a stretch.

Like what about the people who maintain the alpha/sparc/parisc linux kernels? Or the designers behind idk tilera or tenstorrent hardware.

replies(3): >>45303646 #>>45303707 #>>45303825 #

159. ofrzeta ◴[19 Sep 25 16:31 UTC] No.45303549{4}[source]▶

>>45303352 #

Maybe so, but even then a second-hand blade server is more cost-effective than a Raspi Cluster.

replies(1): >>45304411 #

160. buildbot ◴[19 Sep 25 16:32 UTC] No.45303566{5}[source]▶

>>45302759 #

This 1000%; and some 1us are extra 666. I had a sparc t2000 at one point, it was so much louder than a 1u Supermicro. Or whatever was in Microsoft HW labs, those you could hear from multiple hallways over… There were non optional earplugs at the doors.

161. ◴[19 Sep 25 16:34 UTC] No.45303576[source]▶

>>45302252 #

162. ofrzeta ◴[19 Sep 25 16:34 UTC] No.45303586{3}[source]▶

>>45302271 #

There are so many network emulators you can use, such as Mininet or GNS3.

replies(1): >>45304740 #

163. m3kw9 ◴[19 Sep 25 16:34 UTC] No.45303591[source]▶

>>45302065 (OP) #

when i saw your tok/s on a shtty model, i said yah.

164. zargon ◴[19 Sep 25 16:35 UTC] No.45303597{5}[source]▶

>>45303242 #

GPT OSS 120B only has 5B active parameters. GP specifically said dense models, not MoE.

165. 0xbadcafebee ◴[19 Sep 25 16:35 UTC] No.45303598{3}[source]▶

>>45302685 #

You could also spill a can of Mountain Dew over the $8,000 AI rig next to you. Oopsies can happen anywhere...

If it's for personal use, do whatever... there's nothing wrong with buying a $60,000 sports car if you get a lot of enjoyment out of driving it. (you could also lease if you want to trade up to the "faster model" next year) For business, renting (and managed hosting) makes more sense.

replies(1): >>45335393 #

166. LTL_FTC ◴[19 Sep 25 16:35 UTC] No.45303607{8}[source]▶

>>45303134 #

They have definitely increased but not all of California is like this. In the heart of Silicon Valley, Santa Clara, it's about $0.15/kWh. Having Data Centers nearby helps, I suppose.

replies(2): >>45304593 #>>45305212 #

167. uncircle ◴[19 Sep 25 16:36 UTC] No.45303625{3}[source]▶

>>45302287 #

> Like the joke about the economists not picking up the $20 bill on the ground?

For those like me that don't know the joke:

Two economists are walking down the street. One of them says “Look, there’s a twenty-dollar bill on the sidewalk!” The other economist says “No there’s not. If there was, someone would have picked it up already.”

replies(1): >>45305804 #

168. ToucanLoucan ◴[19 Sep 25 16:37 UTC] No.45303629{7}[source]▶

>>45302709 #

I got out of the homelab game as I finished my transition from DevOps to Engineering Lead, and it was simply massively overbuilt for what I actually needed. I replaced an ancient Dell R700 series, R500 series, and a couple supermicros with 3 old desktop PCs in rack enclosures and cut my electric bill nearly $90/month.

Fuckin nutty how much juice those things tear through.

169. kllrnohj ◴[19 Sep 25 16:38 UTC] No.45303637{7}[source]▶

>>45303457 #

> The homelab group on Reddit is full of people who don't understand any of this - they have full racks in their house that could be replaced with one high-end desktop.

A lot of that group is making use of the IO capabilities of these systems to run lots of PCI-E devices & hard drives. There's not exactly a cost-effective modern equivalent for that. If there were cost-effective ways to do something like take a PCI-E 5.0 x2 and turn it into a PCI-E 3.0 x8 that'd be incredible, but there isn't really. So raw PCI-E lane count is significant if you want cheap networking gear or HBAs or whatever, and raw PCI-E lane count is $$$$ if you're buying new.

Also these old systems mean cheap RAM in large, large capacities. Like 128GB RAM to make ZFS or VMs purr is much cheaper to do on these used systems than anything modern.

replies(1): >>45303876 #

170. NordSteve ◴[19 Sep 25 16:39 UTC] No.45303645{4}[source]▶

>>45303352 #

Bad for your power bill though.

replies(6): >>45304383 #>>45304569 #>>45304628 #>>45304785 #>>45304792 #>>45305593 #

171. phatfish ◴[19 Sep 25 16:39 UTC] No.45303646{4}[source]▶

>>45303517 #

Youtubers have armies of sycophants (check their video comments if you dare). Not saying they even court them, something to do with video building a stronger parasocial relationship than a text blog I think.

172. _boffin_ ◴[19 Sep 25 16:42 UTC] No.45303695{4}[source]▶

>>45302450 #

Not true. Have one running in the closet and never hear it.

173. geerlingguy ◴[19 Sep 25 16:44 UTC] No.45303707{4}[source]▶

>>45303517 #

I was just at VCF Midwest this past weekend, and I can assure you I am on some of the lower echelons of people who know about niche hardware.

I do get to see and play with a lot of interesting systems, but for most of them, I only get to go just under surface-level. It's a lot different seeing someone who's reverse engineered every aspect of an IBM PC110, or someone who's restored an entire old mainframe that was in storage for years... or the group of people who built an entire functional telephone exchange with equipment spread over 50 years (including a cell network, a billing system, etc.).

174. CamperBob2 ◴[19 Sep 25 16:44 UTC] No.45303715{4}[source]▶

>>45303246 #

With all these caveats in mind, a raspberry pi seems to be an incredibly poor choice for distributed computing

Exactly. This build sounds like the proverbial "1024 chickens" in Seymour Cray's famous analogy. If nothing else, the communications overhead will eat you alive.

175. geerlingguy ◴[19 Sep 25 16:45 UTC] No.45303723{3}[source]▶

>>45303057 #

The Pi 5 / CM5 / Pi 500 series does have a built-in RTC now, though most models require you to buy a separate RTC battery to plug into the RTC battery jack.

176. yalogin ◴[19 Sep 25 16:46 UTC] No.45303736[source]▶

>>45302065 (OP) #

Can the LLMs be deployed split across multiple Pis? I thought it was not possible, may be I am not caught up.

177. bityard ◴[19 Sep 25 16:46 UTC] No.45303737[source]▶

>>45302065 (OP) #

I can think of SOME decent--if suboptimal--reasons to build an RPi cluster, but it never would have occurred to me to try running an LLM on one. Even if you could cluster hundreds or thousands of Pis, it's still just entirely the wrong architecture for running an AI model as they are currently built.

178. cogman10 ◴[19 Sep 25 16:46 UTC] No.45303739{10}[source]▶

>>45303486 #

I agree. The issue really is that they are 3 layers removed from where I can make a change. They are all appointed and not elected which means I (and my neighbors) don't have any recourse beyond the general election. IIRC, they are appointed by the governor which makes it even harder to fix (might be the county commissioner, not 100% on how they got their position, just know it was an appointment).

I did (as did others), in fact, write in comments and complaints about the rate increases and buyout. That went unheard.

179. ◴[19 Sep 25 16:50 UTC] No.45303776[source]▶

>>45302252 #

180. kolbe ◴[19 Sep 25 16:53 UTC] No.45303825{4}[source]▶

>>45303517 #

> more experience with niche hardware than literally anyone is a stretch.

This is why I said "almost anyone." If I changed your words, I could disagree with you as well.

181. tracker1 ◴[19 Sep 25 16:55 UTC] No.45303840{3}[source]▶

>>45302531 #

But will there be a CM6 while you're waiting for the software to improve?

182. asimpleusecase ◴[19 Sep 25 16:55 UTC] No.45303845[source]▶

>>45302065 (OP) #

Great write up! This is nice nerd catnip. And sharing a failed project teaches so much. Please, more should share the projects that absorbed tons of time yet never really delivered.

183. amelius ◴[19 Sep 25 16:56 UTC] No.45303850{3}[source]▶

>>45303424 #

Is a Pi still considered "niche" hardware?

184. mattbillenstein ◴[19 Sep 25 16:58 UTC] No.45303876{8}[source]▶

>>45303637 #

Perhaps, but I don't really get the dozens of TB of storage in the home use case a lot of the time either.

Like if you have a large media library, you need to push maybe 10MB/s, you don't need 128GB of RAM to do that...

It's mostly just hardware porn - perhaps there are a few legit use cases for the old hardware, but they are exceedingly rare in my estimate.

replies(1): >>45304202 #

185. IAmBroom ◴[19 Sep 25 17:03 UTC] No.45303931{5}[source]▶

>>45303280 #

> it's nice to take it on road trips / into hotels.

> can't really imagine hauling a server around.

These two sentences contradict each other.

replies(1): >>45304551 #

186. numpad0 ◴[19 Sep 25 17:04 UTC] No.45303936[source]▶

>>45302065 (OP) #

> goes round-robin style asking each node to perform its prompt processing, then token generation.

Yeah, this is a now-long-wide-known issue with LLM processing. This can be remediated so that all nodes split computation, but then you'll come back to classical supercomputing problem of node interconnect latency/bandwidth bottlenecks.

It looks to me that many such interconnect simulate Ethernet cards. I wonder if it can be recreated using the M.2 slot rather than using that slot for node-local data, and cost effectively so(like cheaper than bunch of 10GE cards and short DACs).

187. IAmBroom ◴[19 Sep 25 17:04 UTC] No.45303939[source]▶

>>45303083 #

I'd support this idea if you provided some measure of evidence. As it is, pure ad hominem.

188. TZubiri ◴[19 Sep 25 17:04 UTC] No.45303940{4}[source]▶

>>45303322 #

What do you mean by Cooler? Raspberry pi doesn't need a fan.

Also the other peripherals you consider are irrelevant, since you would need them (or not), in other setups. You can use a pi without a PSU for example. And if you use an SSD, you have to consider that cost in whatever you compare it to.

>I went this route myself and have figuratively and literally shelved a bunch of Pis

>and I have dozens of them,

Reread my post? I meant specifically that Pis are great for the 1 to 2 range. with 3 pis you should change to something else. So I'm saying they are good at the 100$-200$ budget, but bad anywhere above that.

replies(3): >>45304305 #>>45305430 #>>45312530 #

189. ComputerGuru ◴[19 Sep 25 17:18 UTC] No.45304110{4}[source]▶

>>45302450 #

Only if it’s a 1U. 2U units idle at silent.

190. wcchandler ◴[19 Sep 25 17:20 UTC] No.45304137[source]▶

>>45302065 (OP) #

I was just exploring Pi’s and AI hats, so this post is appreciatively timely.

I’m finally at the point where I can dedicate time for building an AI with a specific use case in mind. I play competitive paintball and would like to utilize AI for a handful of things. Specifically hit detections in video streams. Pi’s were my natural choice simply because of low cost of entry and wide range of supported products to get a PoV running. I even thought about reaching out to Jeff and asking his input.

This post didn’t change my direction too much, but it did help level set some realistic expectations. So thanks for sharing.

191. amatecha ◴[19 Sep 25 17:21 UTC] No.45304143{7}[source]▶

>>45302709 #

Yeah it kinda puts it all into perspective when you think of how every home used to use 60-watt light bulbs all throughout. Most people just leave lights on all over their home all day, probably using hundreds of watts of electricity. Makes me realize my 35-65w laptop is pretty damn efficient haha

192. ComputerGuru ◴[19 Sep 25 17:23 UTC] No.45304167[source]▶

>>45302065 (OP) #

Guys, don’t take the claim so literally. He’s a successful tech poster. He makes good money showing off his purchases the good money complaining about how expensive they were.

But certainly don’t imitate his choices, his economics aren’t your economics!

replies(5): >>45304175 #>>45304482 #>>45304746 #>>45304813 #>>45305353 #

193. pinkgolem ◴[19 Sep 25 17:24 UTC] No.45304169{3}[source]▶

>>45302356 #

For learning I feel much safer setting everything up locally, worst case I have to reinstall my system.

In the cloud, worst case I have a bill over 5-6 digits.

And I know my ADD, 2 is not super unlikely.

194. ◴[19 Sep 25 17:24 UTC] No.45304175[source]▶

>>45304167 #

195. sam1r ◴[19 Sep 25 17:24 UTC] No.45304176{3}[source]▶

>>45302356 #

A great way to do this is… is with a brand new Aws account, which will give you 1 year free across all services with reasonable limits.

replies(1): >>45304275 #

196. kllrnohj ◴[19 Sep 25 17:26 UTC] No.45304202{9}[source]▶

>>45303876 #

> Like if you have a large media library, you need to push maybe 10MB/s,

For just streaming a 4k bluray you need more than 10MB/s, Ultra HD bluray tops out at 144 Mbit/s. Not to mention if that system is being hit by something else at the same time (backup jobs, etc...).

Is the 128GB of RAM just hardware porn? Eh, maybe, probably. But if you want 8+ bays for a decent sized NAS then you're already quickly into price points at which point these used servers are significantly cheaper, and 128GB of RAM adds very little to the cost so why not.

replies(1): >>45304607 #

197. iLoveOncall ◴[19 Sep 25 17:27 UTC] No.45304213[source]▶

>>45302065 (OP) #

> With 160 GB of total RAM, shared by the CPU and iGPU, this could be a small, efficient AI Cluster, right? Well, you'd think.

I don't know anyone who would think this actually.

replies(1): >>45304492 #

198. cellis ◴[19 Sep 25 17:28 UTC] No.45304220{3}[source]▶

>>45302745 #

And lest we forget! Forgetting to turn it off!

199. lumost ◴[19 Sep 25 17:30 UTC] No.45304260{3}[source]▶

>>45302685 #

At the local/hobby scale, it’s very much a “do whatever” area. But I can rent a 4090 for a little under a dollar an hour, and I can rent a b200 for $6, it’s very hard to claim I’ll use 10k+ hours of gpu time on a b2000 I buy for myself.

replies(1): >>45304724 #

200. jahsome ◴[19 Sep 25 17:32 UTC] No.45304275{4}[source]▶

>>45304176 #

Oracle's free tier is pretty generous too.

201. cramcgrab ◴[19 Sep 25 17:32 UTC] No.45304278{3}[source]▶

>>45302356 #

I don’t know, i keyed this into google Gemini and got pretty far: “ Simulate an AWS AI cluster, command line interface. For each command supply the appropriate AWS AI cluster response”

202. swiftcoder ◴[19 Sep 25 17:33 UTC] No.45304285{5}[source]▶

>>45302491 #

Obviously the solution is to pickup another hobby, and enter the DIY solar game at the same time as your home lab obsession :D

replies(1): >>45337953 #

203. WhereIsTheTruth ◴[19 Sep 25 17:34 UTC] No.45304299[source]▶

>>45302065 (OP) #

brand fanboyism, and vendor locking, the bane of our society

the common denominator is always capital gain

capitalism is the reason why we haven't been able to go back to the moon and build bases there

replies(1): >>45304501 #

204. pinkgolem ◴[19 Sep 25 17:34 UTC] No.45304301{4}[source]▶

>>45302503 #

Are you comparing 10 VM with 1 shared core with a 144 core solution?

205. J_McQuade ◴[19 Sep 25 17:34 UTC] No.45304305{5}[source]▶

>>45303940 #

> What do you mean by Cooler? Raspberry pi doesn't need a fan.

From the official website:

> Does Raspberry Pi 5 need active cooling?

> Raspberry Pi 5 is faster and more powerful than prior-generation Raspberry Pis, and like most general-purpose computers, it will perform best with active cooling.

replies(1): >>45305373 #

206. J_Shelby_J ◴[19 Sep 25 17:39 UTC] No.45304344{4}[source]▶

>>45302450 #

Buy a 3/4u case for $100 and put whatever board you want in it with standard PC fans and a decent cpu cooler. Dead silent.

207. Almondsetat ◴[19 Sep 25 17:41 UTC] No.45304364{5}[source]▶

>>45303344 #

You are taking my reply completely out of context. If you want to learn clustering, you need a lot of cores and ram to run many VMs. You don't need them to be individually very powerful.

208. platybubsy ◴[19 Sep 25 17:43 UTC] No.45304383{5}[source]▶

>>45303645 #

I'm sure 5 rpis will devastate the power grid

209. AceJohnny2 ◴[19 Sep 25 17:44 UTC] No.45304400{3}[source]▶

>>45303424 #

> If he does something, there's usually a good a priori rationale for it.

I greatly respect Jeff's work, but he's a professional YouTuber, so his projects will necessarily lean towards clickbait and riding trends (Jeff, I don't mean this as criticism!) He's been a great advocate for doing interesting things with RasPis, but "interesting" != "rational"

210. geerlingguy ◴[19 Sep 25 17:45 UTC] No.45304411{5}[source]▶

>>45303549 #

Not if you run it idle a lot; most commercial blade servers suck down a lot of power. I think a niche where Pi blades can work is for a learning cluster, like in schools for HPC learning, network automation, etc.

It's definitely not suited for production, but there, you won't find old blade servers either (for the power to performance issue).

211. swiftcoder ◴[19 Sep 25 17:47 UTC] No.45304429{4}[source]▶

>>45302742 #

Plus you still have access to the whole lot when your ISP goes down (maybe less of a problem than it used to be, but not unheard of)

212. system2 ◴[19 Sep 25 17:49 UTC] No.45304457[source]▶

>>45302065 (OP) #

Look at it this way, you had fun playing with an expensive toy. A single $1000 GPU could 10x that, even when you were building those (not today's GPUs). You probably already knew it. But I gotta admit, the rig looks very nice.

213. ripdog ◴[19 Sep 25 17:51 UTC] No.45304466{3}[source]▶

>>45302745 #

Because latency matters when gaming in a way which doesn't matter with AI inference?

Plus cloud gaming is always limited in range of games, there are restrictions on how you can use the PC (like no modding and no swapping savegames in or out).

replies(1): >>45322124 #

214. themafia ◴[19 Sep 25 17:51 UTC] No.45304468{3}[source]▶

>>45302287 #

It's quite the opposite when corruption becomes involved. There are definite financial incentives for middle men to deliver inefficient and wasteful experiences.

Competition is what creates efficiency. Without it you live in a lie.

215. system2 ◴[19 Sep 25 17:52 UTC] No.45304482[source]▶

>>45304167 #

I totally agree. A person with that kind of builder knowledge already knows a decent GPU could 10x that compute power.

216. encom ◴[19 Sep 25 17:52 UTC] No.45304489{3}[source]▶

>>45302424 #

>Mac

>cost effective

lmao

217. geerlingguy ◴[19 Sep 25 17:53 UTC] No.45304492[source]▶

>>45304213 #

Most people who haven't actually self-hosted AI models would think this—for some reason people still think RAM is RAM, and they don't think about specs like memory speed, whether the full amount of RAM is shared with the GPU, etc.

You'd be surprised by the number of emails, Instagram DMs, YouTube comments, etc. I get—even after explicitly showing how bad a system is at a certain task—asking if a Pi would be good for X, or if they could run ChatGPT on their laptop...

218. pmarreck ◴[19 Sep 25 17:54 UTC] No.45304501[source]▶

>>45304299 #

and yet spaceX is for-profit and is pushing the boundaries again, after NASA (a not-for-profit government-created monopoly) stagnated.

blanket-blaming capitalism without good reasoning is becoming the new red-flag of "can't think critically"

replies(1): >>45304735 #

219. randomNumber7 ◴[19 Sep 25 17:54 UTC] No.45304502[source]▶

>>45302320 #

What I think is strange with stuff like this that you should be able to come to that conclusion without technical knowledge. Just the fact that everyone runs AI on GPUs and NVIDIAs stock skyrocketed since the AI boom should tell you s.th..

Did OP really think his fellow humans are that moronic that they just didn't find out you can plug in together a cuple of rasperri pis?

replies(1): >>45304968 #

220. theultdev ◴[19 Sep 25 18:00 UTC] No.45304551{6}[source]▶

>>45303931 #

How?

I can fit a raspberry pi and external ssd in my pocket.

I cannot do that for a server.

I could use a laptop, but simply plugging in a firestick to the hotel tv or a projector when camping is nicer.

221. jonathanlydall ◴[19 Sep 25 18:01 UTC] No.45304562{3}[source]▶

>>45303061 #

If it makes an appreciable difference to how much money he makes on YouTube then I can’t begrudge him for doing it.

Don’t hate the player, hate the game.

replies(1): >>45306912 #

222. Asraelite ◴[19 Sep 25 18:02 UTC] No.45304568[source]▶

>>45302320 #

> I don’t know if anyone building a Pi cluster actually goes into it thinking it’s going to be a cost effective endeavor, do they?

Some Raspberry Pi products are sold at a loss, so I could see how it's in the realm of possibility.

223. Damogran6 ◴[19 Sep 25 18:02 UTC] No.45304569{5}[source]▶

>>45303645 #

I got past that back when I was paying for ISDN and had 5 Surplus Desktop PCs...write it off as 'Professional development'

224. ted_dunning ◴[19 Sep 25 18:02 UTC] No.45304577[source]▶

>>45302752 #

Well, El Capitan uses AMD CPUs (which have integrated GPU capabilities) and it is right on top of the rankings lately.

Frontier is right behind it with the same arrangement.

Having honest to god dedicated GPUs on their own data bus with their own memory isn't necessarily the fastest way to roll.

replies(1): >>45304802 #

225. chermi ◴[19 Sep 25 18:03 UTC] No.45304593{9}[source]▶

>>45303607 #

I'm guessing the parent is talking about total bill (transmission, demand charges..) $.15/kwH is probably just the usage, and I am very skeptical that's accurate for residential.

replies(1): >>45342425 #

226. ◴[19 Sep 25 18:03 UTC] No.45304594{5}[source]▶

>>45303242 #

227. Kubuxu ◴[19 Sep 25 18:05 UTC] No.45304607{10}[source]▶

>>45304202 #

For 8+ bays you just need a SAS HBA card and one free PCI-E slot. Not to mention that many motherboards will have 6+ SATA ports already.

If anything, 2nd hand AMD gaming rigs make more sense than old servers. I say that as someone with always off r720xd at home due to noise and heat. It was fun when I bought it during winter years ago, until summer came.

replies(2): >>45305020 #>>45312158 #

228. ◴[19 Sep 25 18:05 UTC] No.45304612{4}[source]▶

>>45302719 #

229. moduspol ◴[19 Sep 25 18:07 UTC] No.45304626{4}[source]▶

>>45302450 #

True! They aren't quiet. I keep mine in a well-ventilated room that doesn't typically have people in it.

230. throwaway894345 ◴[19 Sep 25 18:07 UTC] No.45304628{5}[source]▶

>>45303645 #

What does a few rpis cost on a monthly basis?

replies(1): >>45304737 #

231. densh ◴[19 Sep 25 18:08 UTC] No.45304632[source]▶

>>45302065 (OP) #

For anyone interested in playing with distributed systems, I'd really recommend getting a single machine with latest 16-core CPU from AMD and just running 8 virtual machines on it. 8 virtual machines, with 4 hyper threads pinned per machine, and 1/8 of total RAM per machine. Create a network between them virtually within your virtualization software of choice (such as Proxmox).

And suddenly you can start playing with distributed software, even though it's running on a single machine. For resiliency tests you can unplug one machine at a time with a single click. It will annihilate a Pi cluster in Perf/W as well, and you don't have to assemble a complex web of components to make it work. Just a single CPU, motherboard, m.2 SSD, and two sticks of RAM.

Naturally, using a high core count machine without virtualization will get you best overall Perf/W in most benchmarks. What's also important but often not highlighted in benchmarks in Idle W if you'd like to keep your cluster running, and only use it occasionally.

replies(6): >>45305155 #>>45305387 #>>45305468 #>>45305628 #>>45307364 #>>45313651 #

232. EnPissant ◴[19 Sep 25 18:08 UTC] No.45304642{5}[source]▶

>>45303242 #

I think the Mac Studio is a poor fit for gpt-oss-120b.

On my 96 GB DDR5-6000 + RTX 5090 box, I see ~20s prefill latency for a 65k prompt and ~40 tok/s decode, even with most experts on the CPU.

A Mac Studio will decode faster than that, but prefill will be 10s of times slower due to much lower raw compute vs a high-end GPU. For long prompts that can make it effectively unusable. That’s what the parent was getting at. You will hit this long before 65k context.

If you have time, could you share numbers for something like:

llama-bench -m <path-to-gpt-oss-120b.gguf> -ngl 999 -fa 1 --mmap 0 -p 65536 -b 4096 -ub 4096

Edit: The only Mac Studio pp65536 datapoint I’ve found is this Reddit thread:

https://old.reddit.com/r/LocalLLaMA/comments/1jq13ik/mac_stu ...

They report ~43.2 minutes prefill latency for a 65k prompt on a 2-bit DeepSeek quant. Gpt-oss-120b should be faster than that, but still very slow.

replies(1): >>45311625 #

233. Damogran6 ◴[19 Sep 25 18:10 UTC] No.45304654{8}[source]▶

>>45303134 #

CORE energy in Colorado is charging $0.10819 per kWh _today_

https://core.coop/my-cooperative/rates-and-regulations/rate-...

234. teaearlgraycold ◴[19 Sep 25 18:11 UTC] No.45304665[source]▶

>>45302065 (OP) #

Having seen someone else build and tinker with a 7 node Pi cluster it seems like an absolute waste of time. 1Gb/s networking, PCIe 3.0 x1, slow RAM, slow CPU. And with all the hats and accessories needed it’s not that good of a deal.

Getting some NUC-like machines makes a lot more sense to me. You’ll get 2.5Gb/s Ethernet at the least and way more FlOPS as well.

replies(1): >>45326021 #

235. t1amat ◴[19 Sep 25 18:13 UTC] No.45304691{4}[source]▶

>>45302777 #

This is the right take. You might be able to get decent (2-3x less than a GPU rig) token generation, which is adequate, but your prompt processing speeds are more like 50-100x slower. A hardware solution is needed to make long context actually usable on a Mac.

236. horsawlarway ◴[19 Sep 25 18:17 UTC] No.45304724{4}[source]▶

>>45304260 #

So 83 days for payback at the 2k sticker price for the 4090? Sounds like a good time to buy a 4090...

Like, if you buy that card it can still be processing things for you a decade from now.

Or you can get 3 months of rental time.

---

And yes, there is definitely a point where renting makes more sense because the capital outlay becomes prohibitive, and you're not reasonably capable of consuming the full output of the hardware.

But the cloud is a huge cash cow for a reason... You're paying exorbitant prices to rent compared to the cost of ownership.

237. WhereIsTheTruth ◴[19 Sep 25 18:17 UTC] No.45304735{3}[source]▶

>>45304501 #

NASA didn't stagnate, NASA landed humans on the Moon with 1960s technology

private space companies, despite decades of hype and funding, have stagnated by comparison

the fact that SpaceX depends heavily on government contracts just to function is yet another proof: their "innovation" isn't self sustaining, it's underwritten by taxpayer money

are you denying that NASA landed on the Moon?

Elon psyop doesn't work on me, i know who is behind it all, they need a charismatic sales man for the masses, just like Ford, Disney, Reagan and all, masking structural power with a digestible story for the masses

> blanket-blaming capitalism without good reasoning is becoming the new red-flag of "can't think critically"

it's quite the opposite, people unable to take criticism of capitalism, talk about "critical thinking", how is China doing?

238. theodric ◴[19 Sep 25 18:17 UTC] No.45304737{6}[source]▶

>>45304628 #

Depends. At full load? At Irish power prices? Just the Pi, no peripherals, no NVMe? 5 units? €13/mo.

Handy: https://700c.dk/?powercalc

My Pi CM4 NAS with a PCIe switch, SATA and USB3 controllers, 6 SATA SSDs, 2 VMs, 2 LXC containers, and a Nextcloud snap pretty much sits at 17 watts most of the time, hitting 20 when a lot is being asked of it, and 26-27W at absolute max with all I/O and CPU cores pegged. €3.85/mo if I pay ESB, but I like to think that it runs fully off the solar and batteries :)

replies(1): >>45305083 #

239. JambalayaJimbo ◴[19 Sep 25 18:18 UTC] No.45304740{4}[source]▶

>>45303586 #

I'm sure pedagogically speaking it's better to use physical devices

240. esskay ◴[19 Sep 25 18:18 UTC] No.45304746[source]▶

>>45304167 #

Thats pretty much a given, but what the real takeaway should be from most of the content is that whatever you are doing, these days the answer in all likelihood is not to buy a raspberry pi. It's specs to price just do not add up at all anymore, and it's looking like a pretty damn stagnant place these days.

replies(1): >>45304958 #

241. duxup ◴[19 Sep 25 18:22 UTC] No.45304785{5}[source]▶

>>45303645 #

I need to heat my house too so maybe it helps a little there.

242. 11101010001100 ◴[19 Sep 25 18:23 UTC] No.45304792{5}[source]▶

>>45303645 #

You still pay for power for the cloud.

243. nromiun ◴[19 Sep 25 18:24 UTC] No.45304802{3}[source]▶

>>45304577 #

They do not. The CPUs are only there to support and push data to the GPUs. Much like Nvidia GH200 systems. Nobody buys these APU chips for their CPU parts.

For comparison there are 9,988,224 GPU compute units in El Capitan and only 1,051,392 CPU cores. Roughly one CPU core to push data to 10 GPU CUs.

244. brcmthrowaway ◴[19 Sep 25 18:25 UTC] No.45304813[source]▶

>>45304167 #

If only Dan Luu pivoted to this style of content

245. dekhn ◴[19 Sep 25 18:30 UTC] No.45304853{3}[source]▶

>>45302277 #

Beowulf style clusters went on to dominate supercomputing.

246. Computer0 ◴[19 Sep 25 18:40 UTC] No.45304958{3}[source]▶

>>45304746 #

What if you need ARM? What is the best sub $100 sbc that I am missing? Orange Pi hardware always looks good but I hear a lot of negativity about the software that I don't really experience with Raspbian.

replies(2): >>45305192 #>>45311108 #

247. zer00eyz ◴[19 Sep 25 18:40 UTC] No.45304959{7}[source]▶

>>45303457 #

Most of the workloads that people with homelabs run, could be run on a 5 year old i5.

A lot of business are paying obscene money to cloud providers when they could have a pair of racks and the staff to support it.

Unless you're paying attention to the bleeding edge of the server market, to its costs (better yet features and affordability) this sort of mistake is easy to make.

The article is by someone who does this sort of thing for fun, and views/attention, and im glad for it... it's fun to watch. But it's sad when this same sort of misunderstanding happens in professional settings, and it happens a lot.

248. rustyminnow ◴[19 Sep 25 18:41 UTC] No.45304968{3}[source]▶

>>45304502 #

Nobody thought an RPI cluster would ever be competitive, and Geerling never expected anybody would. But it's fun to play "what if" and then make the thing just to see how it stacks up and that's his job. Any implication or suggestion of this being a good idea is just part of the story telling.

249. motorest ◴[19 Sep 25 18:44 UTC] No.45305010{3}[source]▶

>>45302356 #

> The cost effective way to do it is in the cloud.

This. Some cloud providers offer VMs with 4GB RAM and 2 virtual cores for less than $4/month. If your goal is to learn how to work with clusters, nothing beats firing up a dozen VMs when it suits your fancy, and shut them down when playtime is over. This is something you can pull off in a couple of minutes with something like an Ansible script.

250. kllrnohj ◴[19 Sep 25 18:45 UTC] No.45305020{11}[source]▶

>>45304607 #

> For 8+ bays you just need a SAS HBA card and one free PCI-E slot. Not to mention that many motherboards will have 6+ SATA ports already.

And what case are you putting them into? What if you want it rack mounted? What about >1gig networking? What if I want a GPU in there to do whisper for home assistant?

Used gaming rigs are great. But used servers also still have loads of value, too. Compute just isn't one of them.

replies(1): >>45306970 #

251. mrguyorama ◴[19 Sep 25 18:47 UTC] No.45305038{4}[source]▶

>>45303322 #

>SSD + HAT + PSU + Case + Cooler

Zero of any of that is needed. The new Pi "works best" with a cooler sure but at standard room temps will be fine for serving web apps and custom projects and things. You do not need an SSD. You do not need a HAT for anything.

Apparently the Pi 5 8gb is $120 though WTF.

What personal web site or web app or project can't run just fine on a Pi Zero 2 though? It's a little RAM starved but performance wise it should be sufficient.

Other than second-hand mini PCs, old laptops also make great home servers. They have built in UPS!

252. selkin ◴[19 Sep 25 18:48 UTC] No.45305052{9}[source]▶

>>45303494 #

Industrial pays different rates than homes.

That said, I am not sure those numbers are true. I am in California (PG&E with East Bay community generation), and my TOU rates are much lower than those.

replies(2): >>45309384 #>>45311965 #

253. mrguyorama ◴[19 Sep 25 18:51 UTC] No.45305076{3}[source]▶

>>45303061 #

Blame Youtube. They are the ones that run a purposely zero sum and adversarial system for directing attention at your videos. If he doesn't have a high enough click rate on his videos, Youtube will literally stop showing them to people, even subscribers.

Youtube demonstrably wants clickbait titles and thumbnails. They built tooling to automatically A/B test titles and thumbnails for you.

Youtube could fix this and stop it if they want, but that might lose them 1% of business so they never will.

They love that you blame creators for this market dynamic instead of the people who literally create the market dynamic.

254. throwaway894345 ◴[19 Sep 25 18:51 UTC] No.45305083{7}[source]▶

>>45304737 #

> Depends. At full load? At Irish power prices? Just the Pi, no peripherals, no NVMe? 5 units? €13/mo.

Pretty sure most of us aren't running anywhere close to full load 24/7, but whoa, Irish power is expensive. In the central US I pay $0.14/KWh.

replies(2): >>45309663 #>>45314426 #

255. barnas2 ◴[19 Sep 25 18:54 UTC] No.45305108{4}[source]▶

>>45303322 #

> SSD + HAT + PSU + Case + Cooler (+ maybe a uSD)

The only 100% required thing on there is some sort of power supply, and an SD card, and I suspect a lot of people have a spare USB-C cable and brick lying around. A cooler is only recommended if you're going to be putting it under sustained CPU load, and they're like $10 on Amazon.

replies(1): >>45305640 #

256. magicalhippo ◴[19 Sep 25 18:55 UTC] No.45305113{3}[source]▶

>>45302265 #

I picked up several Rpi 4 2GB for $20 each just before covid-19. At that price point they've been quite competitive for small homelab workloads.

The current RPi 5 makes no sense to me in any configuration, given its pricing.

replies(1): >>45305617 #

257. bee_rider ◴[19 Sep 25 19:00 UTC] No.45305155[source]▶

>>45304632 #

Tangentially related: I really expected running old MPI programs on stuff like the AMD multi-chip workstation packages to become a bigger thing.

replies(1): >>45309479 #

258. michaelt ◴[19 Sep 25 19:04 UTC] No.45305192{4}[source]▶

>>45304958 #

Then you should take a good hard look at older, much cheaper Raspberry Pis.

Then look at Apple’s ARM offerings, and AWS Graviton if you need ARM with raw power.

If you need embedded/GPIO you should consider an Arduino, or clone. If you need GPIOs and Internet connectivity, look at an ESP32. GPIOs, ARM and wired ethernet? Consdier the the STM32H.

Robotics/machine vision applications, needing IO and lots of compute power? Consider a regular PC with an embedded processor on serial or USB. Or nvidia jetson if you want to run CUDA stuff.

And take a good hard look at your assumptions, as mini PCs using the Intel N100 CPU are very competitive with modern Pis.

replies(4): >>45305956 #>>45307282 #>>45308722 #>>45309301 #

259. favorited ◴[19 Sep 25 19:06 UTC] No.45305212{9}[source]▶

>>45303607 #

Santa Clara's energy rates are an outlier among neighboring municipalities, and should not be used as an example of energy cost in the Bay Area. Santa Clara residents are served by city-owned Silicon Valley Power, which has lower rates than PG&E or SVCE, which service almost all of the South Bay.

replies(1): >>45342527 #

260. leakycap ◴[19 Sep 25 19:09 UTC] No.45305247[source]▶

>>45302065 (OP) #

I dipped my toe into cluster computing when rendering a MAXON Cinema 4D project across a lab of ~30 dual G5 Power Macs.

Quickly learned that there is so much more to manage when you split a task up across systems, even when the system (like Cinema 4D) is designed for it.

261. ssl-3 ◴[19 Sep 25 19:12 UTC] No.45305282{4}[source]▶

>>45303432 #

One cannot (or at least, one is not supposed to be able to) patent someone else's invention.

replies(1): >>45305974 #

262. nodesocket ◴[19 Sep 25 19:14 UTC] No.45305300[source]▶

>>45302065 (OP) #

I feel like AI is not the right workload for this. I run a 4x node Kubernetes cluster (1x control and 3x workers) and it’s surprising good. While performance is certainly not state of the art by any stretch of the imagination it works great for HomeLab’in. Portainer, Glance, OpenSpeed Test, Semaphore UI, Uptime Kuma, and Home Assistant. Many small and lightweight containers is where a cluster such as this can shine.

263. dzhiurgis ◴[19 Sep 25 19:19 UTC] No.45305353[source]▶

>>45304167 #

Yea but Jeff’s videos are refreshing.

A lot of others are stuck in a loop where they essentially review tech for making more youtube videos - render times, colour accuracy, camera resolution, audio fidelity.

264. djhworld ◴[19 Sep 25 19:20 UTC] No.45305360[source]▶

>>45302065 (OP) #

I watched the video and enjoyed it, I think the most interesting part to me was running the distributed Llama.cpp, Jeff mentioned it seems to work in a linear fashion where processing would hop between nodes.

Which got me thinking about how do these frontier AI models work when you (as a user) run a query. Does your query just go to one big box with lots of GPUs attached and it runs in a similar way, but much faster? Do these AI companies write about how their infra works?

replies(1): >>45306961 #

265. TZubiri ◴[19 Sep 25 19:22 UTC] No.45305373{6}[source]▶

>>45304305 #

Oh. I haven't used 5, i did 3 and 4

266. miunau ◴[19 Sep 25 19:24 UTC] No.45305386[source]▶

>>45302232 #

Mythic Beasts rents rpi servers: https://www.mythic-beasts.com/order/rpi/ - there is a niche for it

replies(1): >>45305592 #

267. cyberpunk ◴[19 Sep 25 19:24 UTC] No.45305387[source]▶

>>45304632 #

Honestly why do you need so much cpu power? You can play with distributed systems just by installing Erlang and running a couple of nodes on whatever potato-level linux box you have laying around, including a single raspberry pi.

268. Sohcahtoa82 ◴[19 Sep 25 19:27 UTC] No.45305430{5}[source]▶

>>45303940 #

> What do you mean by Cooler? Raspberry pi doesn't need a fan.

Starting with the Pi 4, they started saying that a cooler isn't required, but that it may thermal throttle without one if you keep the CPU pegged.

269. globular-toast ◴[19 Sep 25 19:30 UTC] No.45305468[source]▶

>>45304632 #

I've been saying this for years. When the last Raspberry Pi shortage happened people were scrambling to get them for building these toy clusters and it's such a shame. The Pi was made for paedogogy but I feel like most of them are wasted.

I run a K8s "cluster" on a single xcp-ng instance, but you don't even really have to go that far. Docker Machine could easily spin up docker hosts with a single command, but I see that project is dead now. Docker Swarm I think still lets you scale up/down services, no hypervisor required.

replies(1): >>45310344 #

270. MomsAVoxell ◴[19 Sep 25 19:31 UTC] No.45305480[source]▶

>>45302065 (OP) #

I have a SOPINE cluster board that I'd quite like to get booted up and pressed into some sort of useful service.

I think the biggest problem with cluster products is that they just don't work out of the box. Vendors haven't really done the "last 2%" of development required to make them viable - its left to us purchasers to get the final bits in place.

Still, it'll make a fun distributed computing experimental platform some day.

Just like the Inmos Transputer I've got somewhere, sitting in a box, waiting for a power supply ..

271. dragontamer ◴[19 Sep 25 19:34 UTC] No.45305521[source]▶

>>45302065 (OP) #

Why build a cluster?

I believe the Rasp Pi cluster is one of the cheapest multi node / MPI machines you can buy. That's useful even if it is t fast. You need to practice the programming interfaces, not necessarily make a fast computer.

However, NUMA is also a big deal. The various AMD Threadrippers with multi-die memory controllers are better on this regards. Maybe the aging Threadrippers 1950x, yes it's much slower than modern chips but the NUMA issues are exaggerated (especially poor) on this old architecture.

That exaggerates the effects of good NUMA and now you as a programmer can get more NUMA skills.

Of course, the best plan is to spend $20,000,000++ on your own custom NUMA nodes cluster out of EPYCs or something.

-------

But no. The best supercomputers are your local supercomputers that you should rent some time from. You need a local box to see various issues and learn to practice programming.

272. mayli ◴[19 Sep 25 19:39 UTC] No.45305592{3}[source]▶

>>45305386 #

This is also a proof where you can get cheaper and more perf/$ if you buy x86 based vps.

273. trenchpilgrim ◴[19 Sep 25 19:39 UTC] No.45305593{5}[source]▶

>>45303645 #

Still less than renting the same amount of compute. Somewhere between several months and a couple years you pull ahead on costs. Unless you only run your lab a few hours a day.

274. mayli ◴[19 Sep 25 19:41 UTC] No.45305617{4}[source]▶

>>45305113 #

Yeah, it's only competitive as a toy for under $35, anything beyond that you can get a cheap x86 with much better performance, a much compatible architecture and much more IOs.

275. qmr ◴[19 Sep 25 19:42 UTC] No.45305628[source]▶

>>45304632 #

No need for so much CPU power, any old quad core would work.

replies(2): >>45306905 #>>45309795 #

276. sjsdaiuasgdia ◴[19 Sep 25 19:42 UTC] No.45305640{5}[source]▶

>>45305108 #

> a spare USB-C cable and brick lying around

Particularly with Pi 5, any old brick that might be hanging around has a fair chance at not being able to supply sufficient power.

277. glitchc ◴[19 Sep 25 19:46 UTC] No.45305690{4}[source]▶

>>45302620 #

Yeah but slower memory compared to the M3 Ultra. There's a big difference in memory bandwidth, which seems to be a driving factor for inferencing. Training on the other hand, it's probably a lot faster.

278. anematode ◴[19 Sep 25 19:48 UTC] No.45305705[source]▶

>>45302752 #

Unfortunately even the RTX 6000 Pro has nerfed double-precision throughput at about 2 TFLOPS, 64x slower than single precision. For comparison an EPYC 9755 does ~10 TFLOPS, while drawing less power. An A100 -- if you can find one -- is in the same ballpark.

The best option for DP throughput for hobbyists interested in HPC might be old AMD cards from before they, too, realized that scientific folks would pay up the nose for higher precision.

279. shermantanktop ◴[19 Sep 25 19:55 UTC] No.45305804{4}[source]▶

>>45303625 #

Presumably the non-economist following them picked up the twenty, unencumbered by theory.

replies(2): >>45306675 #>>45306799 #

280. privatelypublic ◴[19 Sep 25 20:10 UTC] No.45305956{5}[source]▶

>>45305192 #

I've heard nothing but horror stories on the Jetson & Tegra in general. I'd Avoid it unless the project MUST use a SoM w/ CUDA. which will basically only be Professional stuff. I've never heard of anything hobby level where a PCIe slot was a deal breaker- even with high vibration. (PCIe 4.0 isn't terrible difficult to get good flex cables for)

replies(1): >>45307159 #

281. privatelypublic ◴[19 Sep 25 20:11 UTC] No.45305974{5}[source]▶

>>45305282 #

You theoretically have a year before you even have to apply- but patents are expressly "first to file."

replies(1): >>45307948 #

282. zeristor ◴[19 Sep 25 20:12 UTC] No.45305985[source]▶

>>45302065 (OP) #

If it worked out well he would be trumpeting the glory.

Not so good, and this is the sort of title. you need to bring the punters in for YouTube.

I don't mean to sound too cynical, I appreciate Jeff's videos, just wanted to point out that if you've spent money and time on content you can either ditch it or make a regret video.

Just so long as the thumbnails don't have an arrow on them I'm happy.

283. ssl-3 ◴[19 Sep 25 20:29 UTC] No.45306161{3}[source]▶

>>45302265 #

It depends on the application.

If one just wants a cheap desktop box to do desktop things with, then they're a terrible option, price-wise, compared to things like used corpo mini-PCs.

But they're reasonably cost-competitive with other new (not used!) small computers that are tinkerer-friendly, and unlike many similar constructs there's a plethora of community-driven support for doing useful things with the unusual interfaces they expose.

284. rebolek ◴[19 Sep 25 20:34 UTC] No.45306233{3}[source]▶

>>45302265 #

They are cost competitive enough for Korg synthesizers which is pretty OK for me.

285. jonatron ◴[19 Sep 25 20:48 UTC] No.45306377{4}[source]▶

>>45303277 #

This sort of comment is great - it's good to know if the tech situation changes, which you could call Hacker News

286. rozab ◴[19 Sep 25 20:48 UTC] No.45306381[source]▶

>>45302232 #

People said the same about PlayStations, to be fair

287. p12tic ◴[19 Sep 25 20:59 UTC] No.45306512{5}[source]▶

>>45303344 #

Better build a single workstation - less noise, less power usage and the form factor is way more convenient. A budget of $3000 can buy 128 cores with 512GB of RAM on a single regular EATX motherboard, a case, a power supply and other accessories. Power usage is ~550W at maximum utilization which not much more than a gaming rig with a powerful GPU.

288. p12tic ◴[19 Sep 25 21:05 UTC] No.45306592{5}[source]▶

>>45303342 #

The problem is with the form factor, not the server hardware per-se. If one buys regular ATX motherboard that accepts server CPUs and fits it in regular ATX case, then there's lots of space for a relatively silent CPU air cooler. 2690 v4 idles at less than 40W which is not much more than a regular gaming desktop with a powerful GPU.

The only problem in practice is that server CPUs don't support S3 suspend, so putting whole thing to sleep after finishing with it doesn't work.

289. p12tic ◴[19 Sep 25 21:11 UTC] No.45306658{6}[source]▶

>>45302573 #

Depends on a server. This test got 79W idle for _two socket_ E5 2690-V4 server.

https://www.servethehome.com/lenovo-system-x3650-m5-workhors...

290. uncircle ◴[19 Sep 25 21:12 UTC] No.45306675{5}[source]▶

>>45305804 #

I've never seen a $20 bill on the ground in my entire life, so I guess the economists are actually right.

replies(2): >>45307984 #>>45310883 #

291. milesvp ◴[19 Sep 25 21:14 UTC] No.45306696{3}[source]▶

>>45302833 #

Seriously. Imagine the fractal nature of software problems (you do need firmware right?), and add that on top of a layer that has poorly documented errata and non standard peripheral variants. I'm currently making a board to solder onto an existing production run because we finally found the 1 sentence in a 1700page data sheet that explained why many of our MCUs are overheating. Had the datasheet simply labeled VREFP as VDDA, it probably would have triggered a review of the power schematic. Oh, and the CPU overheating was masked by component variation, and CPU binning, which means the overheating wasn't obvious on any of the early prototype boards...

And then, there's the sourcing problem. Components that looked like they were in big supply when the hardware was specced, can end up being in short supply, or worse end of lifed while you're trying to get all the firmware working.

replies(1): >>45306854 #

292. ◴[19 Sep 25 21:23 UTC] No.45306799{5}[source]▶

>>45305804 #

293. computersuck ◴[19 Sep 25 21:26 UTC] No.45306842[source]▶

>>45302065 (OP) #

RPIs have always had shit cpu performance - the initial idea for the pi was to use very low cost broadcom crap chips that no one would buy and market it for "educational" use.

They are essentially for kids to play around with learning computers by blinking LEDs and integrating with circuit boards. The idea of building a high performance cluster with pis is dumb from day one

replies(1): >>45308128 #

294. geerlingguy ◴[19 Sep 25 21:28 UTC] No.45306854{4}[source]▶

>>45306696 #

The stories hardware devs have about individual chip datasheets causing hours or weeks of pain...

It's most fun when you can prove the vendor's datasheet is lying about some pin or some function, but they still don't update it after a decade or more. So everyone integrating the chip who hasn't before hits the exact same speed bump!

replies(1): >>45335762 #

295. subscribed ◴[19 Sep 25 21:32 UTC] No.45306905{3}[source]▶

>>45305628 #

Old quad core won't have all the virtualisation extensions.

replies(4): >>45307617 #>>45308289 #>>45310898 #>>45315702 #

296. geerlingguy ◴[19 Sep 25 21:33 UTC] No.45306912{4}[source]▶

>>45304562 #

Just to add context — I've been experimenting on my 2nd channel (Level 2 Jeff) with titles that are straight/barebones exactly describing the content of the video, vs a slight bit of clickbait (never untrue, but certainly more intriguing and not describing the exact topic of the video).

The ones that are dead straight with no clickbait are 10/10 (the worst performers), and usually by a massive margin. Even with the same thumbnail.

The sad fact is, if you want your work seen on YouTube, you can't just say "I built a 10 node Raspberry Pi blade cluster and ran HPL and LLMs on it".

Some people are fine with a limited audience. And that's fine too! I don't have to write on my blog at all—I earn negative income from that, since I pay for hosting and a domain, but I hope some people enjoy the content in text form like I do.

replies(2): >>45308919 #>>45316864 #

297. geerlingguy ◴[19 Sep 25 21:38 UTC] No.45306961[source]▶

>>45305360 #

ServeTheHome has a few videos covering AI servers and interconnects.

And yes, they basically have 1 Tbps+ interconnects and throw tens or hundreds of GPUs at queries. Nvidia was wise to invest so much in their networking side—they have massive bandwidth between machines and shared memory, so they can run massive models with tons of cards, with minimal latency.

It's still not as good as tons of GPU attached to tons of memory on _one_ machine, but it's better than 10, 25, or 40 Gbps networking that most small homelabs would run.

298. ssl-3 ◴[19 Sep 25 21:39 UTC] No.45306970{12}[source]▶

>>45305020 #

> And what case are you putting them into?

Maybe one of the Fractal Designs cases with a bunch of drive bays?

> What if you want it rack mounted?

Companies like Rosewill sell ATX cases that can scratch that itch.

> What about >1gig networking?

What about PCI Express card? Regular ATX computers are expandable.

> What if I want a GPU in there to do whisper for home assistant?

I mean... We started with a gaming rig, right? Isn't a GPU already implicit?

replies(1): >>45313594 #

299. Computer0 ◴[19 Sep 25 21:55 UTC] No.45307159{6}[source]▶

>>45305956 #

I hate my Jetson, and the hardware is fine for my purpose.

300. kortex ◴[19 Sep 25 22:06 UTC] No.45307282{5}[source]▶

>>45305192 #

There are a lot of reasons you probably would want a Pi over an ESP32 (or in addition to one), e.g. you want GPIO, plus, internet connectivity, and want to run certain linux programs (e.g. full python, not micropython), or need timesharing, or any number of reasons you might want a linux box over an embedded.

But single board computers with something external to do your GPIO is often way more compelling.

301. malux85 ◴[19 Sep 25 22:15 UTC] No.45307364[source]▶

>>45304632 #

Yeah this is how I practiced Postgres hot standby and read replicas,

It was also how I learned to setup a Hadoop cluster, and a Cassandra cluster (this was 10 years ago when these technologies were hot)

Having knowledge of these systems and being able to talk about how I set them up and simulated recovery directly got me jobs that 2x and then 3x my salary, I would highly recommend all medium skilled developers setup systems like this and get practicing if you want to get up into the next level

302. afzalive ◴[19 Sep 25 22:41 UTC] No.45307617{4}[source]▶

>>45306905 #

An old Xeon then.

303. oasisaimlessly ◴[19 Sep 25 23:18 UTC] No.45307948{6}[source]▶

>>45305974 #

Public use of an idea still prevents someone else from patenting it.

replies(1): >>45328422 #

304. sgerenser ◴[19 Sep 25 23:22 UTC] No.45307984{6}[source]▶

>>45306675 #

I recently saw a $20 bill left in the bill dispenser of a gas station ATM. I didn’t take it because I assumed there was a chance that whoever left it would realize within a minute or two and run back for it. But most likely the next person to see it grabbed it.

replies(1): >>45312484 #

305. trhway ◴[19 Sep 25 23:41 UTC] No.45308128[source]▶

>>45306842 #

>They are essentially for kids to play around with learning computers by blinking LEDs and integrating with circuit boards.

Where a "kid" may be a 53 years old with 30+ years softdev experience who ultimately got to get to the stuff he wanted to for quite some time, and the "blinking LEDs" are a bunch of servos programmatically controlled based on input from a bunch of sensors. While there are definitely better alternatives based on various narrow metrics, especially when it may come to actual productization, the ease (and cheapness, so you don't think much about that spending) of starting with all those easily available for RPi servo array drive boards and various IO ports array boards and all the available software - it is hard to imagine how it can be more easy/cheaper/available than it already is with all that actual compute power and full-featured Linux environment.

306. qmr ◴[20 Sep 25 00:02 UTC] No.45308289{4}[source]▶

>>45306905 #

Virtualization existed long before virtualization instructions. Not strictly necessary.

307. greenavocado ◴[20 Sep 25 01:01 UTC] No.45308722{5}[source]▶

>>45305192 #

N100s absolutely curb stomp Pis and have very good video hardware encoders/decoders to boot

replies(1): >>45309984 #

308. pmw ◴[20 Sep 25 01:20 UTC] No.45308919{5}[source]▶

>>45306912 #

Thanks for the transparency. Much respect to you.

309. bschwindHN ◴[20 Sep 25 01:50 UTC] No.45309301{5}[source]▶

>>45305192 #

> If you need embedded/GPIO you should consider an Arduino, or clone. If you need GPIOs and Internet connectivity, look at an ESP32

I would throw in the RP2040 for consideration as well, and nRF chips if you need wireless connectivity.

310. mrkstu ◴[20 Sep 25 01:58 UTC] No.45309384{10}[source]▶

>>45305052 #

If he’s only paying $50 most of it is connection fees and low usage distorting his per kWh price way up.

311. le-mark ◴[20 Sep 25 02:07 UTC] No.45309479{3}[source]▶

>>45305155 #

I actually worked with some MPI code way back. What MPI programs are you referring to?

replies(2): >>45310410 #>>45311020 #

312. theodric ◴[20 Sep 25 02:28 UTC] No.45309663{8}[source]▶

>>45305083 #

Yeah, it's brutal. Was €0.39 right after Mad Vlad kicked off his vanity conflict.

replies(1): >>45340655 #

313. anaganisk ◴[20 Sep 25 02:46 UTC] No.45309795{3}[source]▶

>>45305628 #

Aren’t newer CPUs especially AMDs more energy efficient?

replies(2): >>45310940 #>>45314677 #

314. jaredhallen ◴[20 Sep 25 03:16 UTC] No.45309984{6}[source]▶

>>45308722 #

A different thread on here the other day seemed to come to the consensus that if you're considering an N100, you may as well go with a low end Ryzen. And for much the same reasoning being used here. Seems to be a little bit of a slippery slope.

replies(3): >>45311070 #>>45339968 #>>45367788 #

315. motorest ◴[20 Sep 25 04:23 UTC] No.45310344{3}[source]▶

>>45305468 #

> I've been saying this for years. When the last Raspberry Pi shortage happened people were scrambling to get them for building these toy clusters and it's such a shame. The Pi was made for paedogogy but I feel like most of them are wasted.

You're describing people using RPis to learn distributed systems, and you conclude that these RPis are wasted because RPis were made for paedogogy?

> I run a K8s "cluster" on a single xcp-ng instance, but you don't even really have to go that far.

That's perfectly fine. You do what works for you, just like everyone else. How would you handle someone else accusing your computer resourcss of being wasted?

replies(2): >>45310589 #>>45311261 #

316. kldg ◴[20 Sep 25 04:33 UTC] No.45310400{4}[source]▶

>>45302609 #

I picked up some 1GB Rock-2F boards while available in the US for ~$10/ea. Seems they aren't shipping the 1GB boards to the US anymore though. Before this, I had a couple Raspberry Pis; one I fried, and the other acted as a web server for a few years.

My realization in ordering the Rock-2Fs is I really only need an MMU (that is, an SBC instead of something like an ESP32) when I'm running something with a graphical desktop, which is, outside my workstation, never (except for kiosks, which I use Android tablets for). -OR when I want to plug something into a bloated SBC board which saves me from having to solder a connector on, which is sometimes.

I use one for running a timelapse camera (camera is USB) while another is a portable mp3 player I can put in shirt pocket and which has aux port (tho its aux line is noisy). -So that's two of the four Rock-2F boards in use.... but it took me far less time to think up uses and deploy 25/25 of seeedstudio's ESP32C3 boards I ordered a couple years ago, and have used ~5/25 of the ESP32C6s I ordered early this year. They're so cheap, and use so much less energy than ARM boards, that it's difficult to justify using the SBCs anymore.

I think they're asking $50 for a base 2GB Pi4B, now -- that's 10 ESP32C3 boards (with integrated WiFi and BMS, btw!) -- and the Pi5 is even less competitive except in what I'd characterize as a very unusual scenario where you need high compute at edge (where it's both needed AND the latency of computing at the edge is lower than sending it to central server for processing), OR you need the security of protected memory, OR you have no central server and an ESP32 isn't going to cut it (I'll say, though, that one can run a thermostat with multiple WiFi-connected thermometers, and run a web server interface just fine.).

317. MathMonkeyMan ◴[20 Sep 25 04:34 UTC] No.45310410{4}[source]▶

>>45309479 #

I don't know, but when I was playing with finite difference code as an undergrad in Physics, all of the docs I could find (it was a while ago, though) assumed that I was going to use MPI to run a distributed workload across the university's supercomputer. My needs were less, so I just ran my Boost.Thread code on the four cores of one node.

What if you had a single server with a zillion cores in it? Maybe you could take some 15 year old MPI code and run it locally -- it'd be like a mini supercomputer with an impossibly fast network.

318. 0xDE7EC71V3 ◴[20 Sep 25 05:13 UTC] No.45310589{4}[source]▶

>>45310344 #

I’ve learned so much setting up a pi cluster. There is something so cool about seeing code run across different pieces of hardware.

319. Symbiote ◴[20 Sep 25 06:12 UTC] No.45310883{6}[source]▶

>>45306675 #

I found 500 DKK on the ground outside once, which is about $77.

320. AnthonyMouse ◴[20 Sep 25 06:15 UTC] No.45310898{4}[source]▶

>>45306905 #

> Old quad core won't have all the virtualisation extensions.

Intel's first quad core was Kentsfield in 2006. It supports VT-x. AMD's first quad core likewise supports AMD-V. The newer virtualization extensions mostly just improve performance a little or do things you probably won't use anyway like SR-IOV.

321. wltr ◴[20 Sep 25 06:15 UTC] No.45310902{5}[source]▶

>>45303311 #

My home server mostly not used. It triggers some simple bash scripts each hour, or each night. It serves some simple personal web pages. I access them a couple of times a day, each time randomly. I have a Raspberry Pi 1B, and it looks like despite being massively underpowered, it’s the most energy efficient. And in my use-case, I think of using it for the task.

One day my primary Raspberry Pi broke (turned out to be a PSU issue), and I thought of having an old laptop running 24/7 as a home server. While being not very power hungry, it’s still wants much more energy (plus it has fans). For a casual usage (I forgot to mention Pi-Hole) it feels like an overkill. So, while a Raspberry Pi isn’t the best, it has its niche, and I’m happy of having one (actually, a few).

322. Symbiote ◴[20 Sep 25 06:22 UTC] No.45310939{7}[source]▶

>>45302688 #

That also depends on the country you live.

The EU (and maybe China?) have been regulating standby power consumption, so most of my appliances either have a physical off switch (usually as the only switch) or should have very low standby power draw.

I don't have the equipment to measure this myself.

323. AnthonyMouse ◴[20 Sep 25 06:22 UTC] No.45310940{4}[source]▶

>>45309795 #

Newer CPUs have significantly better performance per watt under load, essentially by being a lot faster while using a similar amount of power. Idle CPU power consumption hasn't changed much in 10+ years simply because by that point it was already a single digit number of watts.

The thing that matters more than the CPU for idle power consumption is how efficient the system's power supply is under light loads. The variance between them is large and newer power supplies aren't all inherently better at it.

replies(1): >>45314741 #

324. bee_rider ◴[20 Sep 25 06:41 UTC] No.45311020{4}[source]▶

>>45309479 #

I’m not thinking of one code in particular. Just, observing that in the multi-chiplet, even inside a CPU package we’re already talking over a sort of little internal network anyway. Might as well use code that was designed to run on a network, right?

325. c0wb0yc0d3r ◴[20 Sep 25 06:51 UTC] No.45311070{7}[source]▶

>>45309984 #

If you or someone can find that thread I would find it an interesting read.

My cursory research indicates that a low end ryzen would make sense if you are building the board yourself. Right now, I haven’t found a new ryzen mini pc sub 200$. New N100 minis can be had for 150-175$, and if you don’t care so much about power N95 minis are even cheaper.

326. lelanthran ◴[20 Sep 25 06:58 UTC] No.45311108{4}[source]▶

>>45304958 #

> What is the best sub $100 sbc that I am missing?

RockChip, maybe? Little bit pricier but more powerful than Rpi?

https://www.dfrobot.com/product-2673.html

327. globular-toast ◴[20 Sep 25 07:27 UTC] No.45311261{4}[source]▶

>>45310344 #

The point was you don't need to wait for 8 Pis to become available when most people can get going straight away with what they already have.

If you want to learn physical networking or really need to "see" things happening on physically separate machines just get a free old PC from gumtree or something.

replies(1): >>45314328 #

328. int_19h ◴[20 Sep 25 08:53 UTC] No.45311625{6}[source]▶

>>45304642 #

This is Mac Studio M1 Ultra with 128Gb of RAM.

  > llama-bench -m ./gpt-oss-120b-MXFP4-00001-of-00002.gguf -ngl 999 -fa 1 --mmap 0 -p 65536 -b 4096 -ub 4096       
                                                                                             
  | model                          |       size |     params | backend    | threads | n_batch | n_ubatch | fa | mmap |            test |                  t/s |
  | ------------------------------ | ---------: | ---------: | ---------- | ------: | ------: | -------: | -: | ---: | --------------: | -------------------: |
  | gpt-oss 120B MXFP4 MoE         |  59.02 GiB |   116.83 B | Metal,BLAS |      16 |    4096 |     4096 |  1 |    0 |         pp65536 |       392.37 ± 43.91 |
  | gpt-oss 120B MXFP4 MoE         |  59.02 GiB |   116.83 B | Metal,BLAS |      16 |    4096 |     4096 |  1 |    0 |           tg128 |         65.47 ± 0.08 |
  
  build: a0e13dcb (6470)

replies(1): >>45314457 #

329. phito ◴[20 Sep 25 09:12 UTC] No.45311704[source]▶

>>45303260 #

Could you please elaborate? I find this interesting as I'm building a k8s cluster and also have a bunch of RPIs laying around...

replies(1): >>45312574 #

330. mercutio2 ◴[20 Sep 25 10:06 UTC] No.45311965{10}[source]▶

>>45305052 #

There are 3 different components of PG&E electricity bills, which makes the bill difficult to read. I am also in PG&E East Bay community generation, and when I look at all components, it’s:

Minimum Delivery Charge (what’s paid monthly, which is largely irrelevant, before annual true-up of NEM charges): $11.69/month

Actual charges, billed annually, per kWh:

  Peak NEM charge: $.62277
  Off-Peak NEM charges: $.31026

Plus 3-20% extra (depending on the month) in “non-bypassable charges” (I haven’t figured out where these numbers come from), then a 7.5% local utility tax.

Those rates do get a little lower in the winter (.30 to .48), and of course the very high rates benefit me when I generate more energy than I consume (which only happens when I’m on vacation). But the marginal all-in costs are just very high.

That’s NEM2 + TOU-EV2A, specifically.

replies(1): >>45315507 #

331. ThatPlayer ◴[20 Sep 25 10:43 UTC] No.45312158{11}[source]▶

>>45304607 #

I've been turning off my home server even though it's a modern PC rather than old server hardware because it idles at 100W which is too much. Put a Ryzen 7900X in it.

Not sure if it's not properly doing lower power states, or if it's the 10 HDDs spinning. Or even the GPU. But also don't really have anything important running on it that I can't just turn it off.

332. uncircle ◴[20 Sep 25 11:45 UTC] No.45312484{7}[source]▶

>>45307984 #

Either you are a good, innocent soul or you’re an economist. Mind you, they are mutually exclusive.

333. alias_neo ◴[20 Sep 25 11:53 UTC] No.45312530{5}[source]▶

>>45303940 #

> What do you mean by Cooler? Raspberry pi doesn't need a fan

It's recommended for Pi 5, and if you're running a Pi 4, you should at least use a little heat sink, the 4 and 5 run pretty warm, and under any load they can throttle quite easily. I run mine in a rack, in the UK where it's not very warm compared to other parts of the world, and they get pretty warm even with cooling.

> Also the other peripherals you consider are irrelevant, since you would need them (or not), in other setups

No, they're not irrelevant, because if you buy a Mini-PC you get SSD, RAM, cooling, case, PSU included in the price.

> You can use a pi without a PSU for example

You can wing it with some odd USB charger you have lying around, but my experience over a decade killing tens of high-quality microSDs in Pis, power throttling and brown outs is that you should stick to the Pi spec (5.1V) PSUs, the current can typically be lower than their rated if you're not connecting peripherals but a proper USB spec plug will be 5V not the 5.1V the Pi wants.

> Reread my post? I meant specifically that Pis are great for the 1 to 2 range

I think you need to re-read mine, I'm not suggesting replacing all of the Pis with a mini-PC, I'm suggesting replacing ONE is cost-effective NOW, when compared to Pi 5.

> So I'm saying they are good at the 100$-200$ budget

Disagree (at least as things stand here in the UK with our current pricing).

Mini-PC with N100, 16GB RAM, 512GB SSD, case, cooling, PSU, better IO, much better performance, etc: £128[0]

Pi 5, bare board, nothing else: £114[1]

These aren't some obtuse websites, they're places I shop all the time, PiHut is an official distributor in the UK, and the Amazon result is the second result for "mini pc".

The thing about the performance gap here is that you _can_ replace 2-3+ Raspberry Pis with a single Mini-PC for the same price as a single Raspberry Pi 5. I've occasionally seen mini PC models on Amazon go on sale for £99 and less.

I'm not talking theoretical or napkin maths, I've literally done it, I replaced a bunch of Pis with a mini PC and now the Pis sit idle because there's still LOTS of headroom on the mini PC to add more, before I need to even consider firing up the Pis again for other stuff.

The Pi, _to me_, in 2025, is a great tool for learning, and building upon, using the GPIO and the excellent resources, but for self-hosting services, it no longer adds up.

By services I mean software tools, services, things actively "doing work", not a personal blog or project that could run on a vape[2].

[0] https://www.amazon.co.uk/BOSGAME-Computers-Windows-Desktop-G... [1] https://thepihut.com/products/raspberry-pi-5?src=raspberrypi... [2] https://news.ycombinator.com/item?id=45252817

replies(1): >>45318639 #

334. Havoc ◴[20 Sep 25 12:00 UTC] No.45312574{3}[source]▶

>>45311704 #

3x Pi 4s booting off ssd running Talos with virtual IP enabled

You know in k8s you've got worker nodes and control plane nodes? The control planes don't need much horsepower, but they're what you need to be online all to communicate with the cluster. Pis work just fine for that.

replies(1): >>45312675 #

335. phito ◴[20 Sep 25 12:15 UTC] No.45312675{4}[source]▶

>>45312574 #

Great, I will look into that. Thank you!

replies(1): >>45316660 #

336. flas9sd ◴[20 Sep 25 14:01 UTC] No.45313507{6}[source]▶

>>45303066 #

I tend to use quite old hardware that is powered-off when not in use for its intended purpose and I coined "capability is its own quality".

For dedicated build boxes that crunch through lots of sources (whole distributions, AOSP) but do run seldomly, getting your hands on lots of Cores and RAM very cheaply can still trump buying newer CPUs with better perf/watt but higher cost.

337. kllrnohj ◴[20 Sep 25 14:12 UTC] No.45313594{13}[source]▶

>>45306970 #

> Companies like Rosewill sell ATX cases that can scratch that itch.

Have you looked at what they cost? Those cases alone cost as much as a used server. Which comes with a case.

> What about PCI Express card? Regular ATX computers are expandable.

As mentioned higher up, they run out of lane count in a hurry. Especially when you're using things like used Connect-X cards

replies(1): >>45318874 #

338. user432678 ◴[20 Sep 25 14:21 UTC] No.45313651[source]▶

>>45304632 #

Yes, but this is boring. Saying this as an owner of home server with ProxMox.

339. fgbarben ◴[20 Sep 25 15:23 UTC] No.45314127[source]▶

>>45302755 #

sub hundred gigaflop counts as "fairly powerful" now?

340. motorest ◴[20 Sep 25 15:44 UTC] No.45314328{5}[source]▶

>>45311261 #

> The point was you don't need to wait for 8 Pis to become available when most people can get going straight away with what they already have.

You also don't need RPis to learn anything about programming, networking, electronics, etc.

But people do it anyways.

I really don't see what point anyone thinks they are making regarding pedogogy. RPis are synonymous with tinkering, regardless of how you cut it. Distributed systems too.

replies(1): >>45339746 #

341. fragmede ◴[20 Sep 25 15:57 UTC] No.45314426{8}[source]▶

>>45305083 #

cries in west coast peak $0.71/KWh rate

342. EnPissant ◴[20 Sep 25 16:00 UTC] No.45314457{7}[source]▶

>>45311625 #

Thanks. That’s better than I expected. It's only 8.3x worse than a 5090 + CPU: 167s latency.

343. 0manrho ◴[20 Sep 25 16:20 UTC] No.45314677{4}[source]▶

>>45309795 #

If the point is a multi-tasking sandbox, not heavy/sustained data-crunching, those old CPU's w/ boosting turned off or a mild underclock/undervolt (or an L spec which comes iwth that out of the box) really aren't any more power hungry than a newer Ryzen unless you intend on running whatever you buy at high load for long times. Yeah, on paper it still could be a double digit percentage difference, but in reality we're talking a difference of 10W or 20W if you're not running stuff above 50% load for sustained periods.

Again, lots of variables there and it really depends on how heavily you intend to use/rely on that sandbox as to what's the better play. Regional pricing also comes into it.

344. 0manrho ◴[20 Sep 25 16:26 UTC] No.45314741{5}[source]▶

>>45310940 #

Also worth noting, as this is a common point for the homelabbers out there, fans in surplus enterprise hardware can actually be a significant source of not just noise, but power usage, even at idle.

I remember back in the R710 days (circa 2008 and Nehalem/Westmere cpu's) that under like 30% cpu load, most of your power draw came from fans that you couldn't spin down below a certain threshold without an firmware/idrac script, as well as what you mentioned about those PSU's being optimized for high sustained loads and thus being inefficient at near idle and low usage.

IIRC System Idle power profile on those was only like 15% CPU (that's combined for both CPUs), with the rest being fans, ram and the various other vendor stuff (iDrac, PERC etc) and low-load PSU inefficiencies.

Newer hardware has gotten better, but servers are still generally engineered for above 50% sustained loads rather than under, and those fans still can easily pull a dozen plus watts even at very low usage each in those servers (of course, depends on exact model), so, point being, splitting hairs over a dozen watts or so between CPU's is a bit silly when your power floor from fans and PSU inefficiencies alone puts you at 80W+ draw anyway, not to mention the other components (NIC, Drives, Storage controller, OoB, RAM etc). Also, this is primarily relevant for surplus servers, but lot of people building systems at home for the usecase relevant to this discussion often turn to or are recommended these servers, so just wanted to add this food for thought.

replies(1): >>45316353 #

345. nullc ◴[20 Sep 25 17:46 UTC] No.45315507{11}[source]▶

>>45311965 #

Are you actually able to compute that? With PG&E + MCE because of the way they back off the PG&E generation charges, the actual per-time period rates are not disclosed.

I can solve for them with three equations for three unknowns... but since they change the rates quarterly by the time I know what my exact rates were they have changed.

346. justsomehnguy ◴[20 Sep 25 18:04 UTC] No.45315702{4}[source]▶

>>45306905 #

Ivy Bridge is 13 years old today. You need to do the the things to buy something older than that in 2025.

347. AnthonyMouse ◴[20 Sep 25 19:10 UTC] No.45316353{6}[source]▶

>>45314741 #

Yeah, the server vendors give negative fucks about idle power consumption. I have a ~10 year old enterprise desktop quad core with a full-system AC power consumption of 6 watts while powered on and idle. I've seen enterprise servers of a similar vintage -- from the same vendor -- draw 40 watts when they're off.

348. yellowapple ◴[20 Sep 25 19:33 UTC] No.45316553[source]▶

>>45302065 (OP) #

Incredible to me that you can buy a 10-pack of 256GB NVMe drives for less than $200. Seems like yesterday when those ran for $200 each.

349. Havoc ◴[20 Sep 25 19:42 UTC] No.45316660{5}[source]▶

>>45312675 #

Talos in general is worth a look too. I did k3s first and in hindsight should have gone straight to talos. Bit more fiddly at start but worth it

350. asalahli ◴[20 Sep 25 20:02 UTC] No.45316864{5}[source]▶

>>45306912 #

FWIW I like Level 2 Jeff more and I would watch the videos with or without the clickbait-y titles. As you've said I've never found your titles deceptive so if they bring you more money, then more power to you

351. TZubiri ◴[20 Sep 25 23:43 UTC] No.45318639{6}[source]▶

>>45312530 #

Interesting. I may be outdated

1) raspberry pis competitors have gotten better, that nuc is very cheap.

2) the pi has gone in a different direction, increasing specs and price, the 3b+ or 4a had much lower specs, price, power consumption etc...

In conclusion, if you can get an arm soc board with specs similar to the 3b+ or 4a (500mb to 2gb ram), then you can host a blog on linux for cheap. Should run you in the 50$ area. But raspberry no longer makes these, you might look into the thousands of competitors.

Additionally if you want something more serious, nucs become reasonable, while it's hard to tell whether two 50$ pis or one 200$ Intel NUC would be better. It depends on the tradeoffs.

replies(1): >>45335926 #

352. ssl-3 ◴[21 Sep 25 00:22 UTC] No.45318874{14}[source]▶

>>45313594 #

A rackmount case from Rosewill costs a couple of hundred bucks or so, new. And they'll remain useful for as long as things like ATX boards and 3.5" hard drives are useful.

I mean: An ATX case can be paid for once, and then be used for decades. (I'm writing this using a modern desktop computer with an ATX case that I bought in 2008.)

PCI Express lanes can be multiplied. There should frankly be more of this going on than there is, but it's still a thing that can be done.

Consumer boards built on the AMD X670E chipset, for instance, have some switching magic built in. There's enough direct CPU-connected lanes for an x16 GPU and a couple of x4 NVMe drives, and the NIC(s) and/or HBA(s) can go downstream of the chipset.

(Yeah, sure: It's limited to an aggregate 64 Gbps at the tail end, but that's not a problem for the things I do at home where my sights are set on 10Gbps networking and an HBA with a bunch of spinny disks. Your needs may differ.)

353. Drblessing ◴[21 Sep 25 01:51 UTC] No.45319326{3}[source]▶

>>45303222 #

Yes. Incredible form factor.

354. a2128 ◴[21 Sep 25 12:29 UTC] No.45322124{4}[source]▶

>>45304466 #

Yep those are exactly the same considerations. LLM providers will have inconsistent latency and throughput due to batching across many users, while training with cloud GPU servers can have inconsistent bandwidth and delay for uploading mass training data. LLM providers are always limited in how you can use them (often no LoRAs, finetuned models, prompt restrictions)

355. nathan_douglas ◴[21 Sep 25 19:46 UTC] No.45326021[source]▶

>>45304665 #

You might really dislike the ESP32 cluster I'm planning right now :P

356. privatelypublic ◴[22 Sep 25 02:03 UTC] No.45328422{7}[source]▶

>>45307948 #

Sounds like You're conflating sale of a product prior to filing with "prior art"

357. horsawlarway ◴[22 Sep 25 16:05 UTC] No.45335393{4}[source]▶

>>45303598 #

So you got downvoted already, but to clearly address this:

If I spill something on my own hardware, the max out-of-pocket amount I lose is the amount I spent on that hardware.

If I run up an AWS/GCP/Azure bill accidentally... the max out-of-pocket amount I lose is often literally unbounded. Are there some guardrails you can put around this? Sure. But they're often confusing, misleading, delayed, or riddled with "holes" which they don't catch.

Ex - the literal best AWS offers you is delayed "billing alarms" which need to be manually enabled and configured, and even then don't cover all the services you might incur billing charges for.

It's not that "Oopsies" can't happen locally - it's that even if they do, I have a clear understanding of the potential costs by default, and they're much less intangible than "I left a thing running overnight and I now I owe AWS a new car worth of cash".

The worst case for a misconfigured bit of software locally is that my machine stalls and my services go down (ex - overloaded). The worst case for a misconfigured bit of software in AWS is literal bankruptcy.

Think about that for a minute.

358. robotguy ◴[22 Sep 25 16:29 UTC] No.45335762{5}[source]▶

>>45306854 #

A junior EE can build the circuits from the examples in the datasheet. A mid-level EE can build configurations NOT shown in the datasheet. A Senior EE can tell you where the datasheet is wrong.

359. alias_neo ◴[22 Sep 25 16:39 UTC] No.45335926{7}[source]▶

>>45318639 #

Absolutely. I wouldn't suggest one shouldn't use a Pi if it fits their use case and budget, simply that once we get to a higher end Pi, it can be cost effective to simply buy a mini PC which will be more capable for not a lot more money.

The issue with competing ARM SBCs is the software support; Radxa makes some boards that are more powerful than Pis, but if you read the forums they've had hardware flaws in the designs, and they run old kernels and don't get updated, and of course there isn't the community behind it.

An x86 mini pc is a different beast to a Pi, but then I think a lot of people who were hosting software on a Pi weren't specifically looking for ARM architecture anyway, unless they were, in which case stick with a Pi.

360. bokohut ◴[22 Sep 25 19:02 UTC] No.45337953{6}[source]▶

>>45304285 #

Interestingly enough it is often times a foundational change in one's 'normal' that inspires something 'new'.

In this case that 'new' is energy efficient software down to the individual lines of code and what their energy cost is on certain hardware. Academics are publishing about it in niche corners of the web and some entrepreneurs are doing it but of course none of this is cool now so we remain a mockery for our objectives. In time this too will become a real thing as many now are just beginning to feel the ever rising costs of energy which is only just starting to increase from decisions made years ago. The worst is yet to come as seen and heard directly from every single expert that has testified in the last years before the Energy and Commerce committee however only the outside-the-boxers among us watch such educational content to better prepare for tomorrow.

Electricity powers our world and nearly all take it for granted, time too will change this thinking.

361. globular-toast ◴[22 Sep 25 21:31 UTC] No.45339746{6}[source]▶

>>45314328 #

I think you misread my comment, maybe it's clearer if I say "(admittedly) the pi is meant for paedogogy (however) I feel like most of them are wasted".

362. Computer0 ◴[22 Sep 25 21:48 UTC] No.45339968{7}[source]▶

>>45309984 #

I have looked at it extensively and the Ryzen never is available at the pricing of an n100. Though it would offer a lot more performance.

363. throwaway894345 ◴[22 Sep 25 22:57 UTC] No.45340655{9}[source]▶

>>45309663 #

That’s rough. What’s your progress on renewables? Wind has made electricity really cheap in my state and I would think Ireland would be pretty windy (esp offshore)?

replies(1): >>45483949 #

364. LTL_FTC ◴[23 Sep 25 03:13 UTC] No.45342425{10}[source]▶

>>45304593 #

Correct. $0.15/kwh is usage. There are a few small fees but that’s likely the case in most places. This is residential use. If skeptical, a quick online search is all it takes…

365. LTL_FTC ◴[23 Sep 25 03:28 UTC] No.45342527{10}[source]▶

>>45305212 #

Well the discussion was California as a whole and averages, so I decided to share. As with averages, data is above and bellow the mean, so when a commenter above said $.30/kwh was much too low for California, I decided to lend some support the the argument as I’m in California paying bellow the average. It’s a just a data point. A counter example to the claim made by parent. Maybe it helps fellow nerds pick a spot in the bay if they want to run their homelabs.

366. kbenson ◴[23 Sep 25 23:34 UTC] No.45354238{7}[source]▶

>>45303457 #

I have a couple units of free colocation cabinet space space and free bandwidth and power to go with it waiting to be used, so inefficient hardware is less of an issue for me. I've just been fairly lazy in sourcing it myself.

367. kbenson ◴[23 Sep 25 23:52 UTC] No.45354389{6}[source]▶

>>45303066 #

Which is mostly irrelevant because I have a few rack units of free space, with free power and bandwidth I can use if I want, but I haven't bothered because I don't have a need worth shelling out the money for a modern platform to put in it.

I'm well aware of the costs of power and the lgostics of colocation, this is purely about how I'm more willing to spend $100-$200 for a toy than I am $1000-$2000.

368. greenavocado ◴[25 Sep 25 00:34 UTC] No.45367788{7}[source]▶

>>45309984 #

Low end Ryzens are too big.

369. OrangeMusic ◴[26 Sep 25 15:49 UTC] No.45387867[source]▶

>>45302065 (OP) #

Imagine a Beowulf cluster of these!

370. theodric ◴[05 Oct 25 18:30 UTC] No.45483949{10}[source]▶

>>45340655 #

Ireland has had hydro for a century[1], and wind and tidal are productive here. There are wind farms EVERYWHERE around where I live (mountains, Cork/Limerick border). There are solar farms, as well, but sun is not our strong suit. Trouble for individuals is that small, affordable wind turbines are basically useless, and most people don't have hundreds of thousands of Euros plus planning permission to erect megawatt-scale units, so solar is pretty much it.

[1] https://en.wikipedia.org/wiki/Shannon_hydroelectric_scheme

↑