AI capex is so big that it's affecting economic statistics

1. mikewarot ◴[18 Jul 25 21:11 UTC] No.44609878[source]▶

I'm waiting for the shoe to drop when someone comes out with an FPGA optimized for reconfigurable computing and lowers the cost of llm compute by 90% or better.

replies(6): >>44609898 #>>44609932 #>>44610004 #>>44610118 #>>44610319 #>>44610367 #

2. pdhborges ◴[18 Jul 25 21:14 UTC] No.44609898[source]▶

>>44609878 (TP) #

Where is that improvement coming from? Hardware is already here to compute gemm as fast as possible.

replies(1): >>44609936 #

3. trhway ◴[18 Jul 25 21:17 UTC] No.44609932[source]▶

>>44609878 (TP) #

We're already seeing it with DeepSeek's and other optimizations - like that law with highways - the wider the highway the even more usage of it. Dropping by 90% would open even more use cases.

For white-collar jobs replacement - we can always evolve up the knowledge/skills/value chain. It is the blue-collar jobs where bloodbath with all the robotics is coming.

replies(1): >>44617413 #

4. leakyfilter ◴[18 Jul 25 21:17 UTC] No.44609936[source]▶

>>44609898 #

Raw gemm computation was never the real bottleneck, especially on the newer GPUs. Feeding the matmuls i.e memory bandwidth is where it’s at, especially in the newer GPUs.

5. crystal_revenge ◴[18 Jul 25 21:24 UTC] No.44610004[source]▶

>>44609878 (TP) #

This is where I do wish we had more people working on the theoretical CS side of things in this space.

Once you recognize that all ML techniques, including LLMs, are fundamentally compression techniques you should be able to come up with some estimates of the minimum feasible size of an LLM based on: information that can be encoded in a given parameter size, relationship between loss of information and model performance, and information contained in the original data set.

I simultaneously believe LLMs are bigger than the need to be, but suspect they need to be larger than most people think given that you are trying to store a fantastically large amount of information. Even given lossy compression (which ironically is what makes LLMs "generalize"), we're still talking about an enormous corpus of data we're trying to represent.

replies(1): >>44610083 #

6. sfpotter ◴[18 Jul 25 21:34 UTC] No.44610083[source]▶

>>44610004 #

Getting theoretical results along these lines that can be operationalized meaningfully is... really hard.

7. charleshn ◴[18 Jul 25 21:39 UTC] No.44610118[source]▶

>>44609878 (TP) #

We do already have ASICs, see Google's TPU to get some cost estimates.

HBM is also very expensive.

8. d-moon ◴[18 Jul 25 22:07 UTC] No.44610319[source]▶

>>44609878 (TP) #

I wouldn't wait. fpgas weren't design to serve this model architecture. yes they are very power efficient but the layout/p+r overhead, the memory requirement (very few on-the-market fpgas have hbm), slower clock speed, and just an unpleasant developer experience makes it a hard sell.

9. wagwang ◴[18 Jul 25 22:15 UTC] No.44610367[source]▶

>>44609878 (TP) #

My chip friends constantly complain about Qualcomm owning a bunch of FPGA related patents that stifle any meaningful FPGA progress.

replies(1): >>44614315 #

10. jillesvangurp ◴[19 Jul 25 10:42 UTC] No.44614315[source]▶

>>44610367 #

They'll expire eventually.

11. mad0 ◴[19 Jul 25 17:24 UTC] No.44617413[source]▶

>>44609932 #

> For white-collar jobs replacement - we can always evolve up the knowledge/skills/value chain.

I'm not so sure about this one. I partially agree with the statement, but less-abled collegues might have troubles with this :( Ultimately there will be less stuff to do for a plain human being.