The End of Moore's Law for AI? Gemini Flash Offers a Warning

(sutro.sh)

113 points sethkim | 1 comments | 03 Jul 25 17:34 UTC | HN request time: 0.22s | source

Show context

fusionadvocate ◴[03 Jul 25 19:22 UTC] No.44458352[source]▶

What is holding back AI is this business necessity that models must perform everything. Nobody can push for a smaller model that learns a few simple tasks and then build upon that, similar to the best known intelligent machine: the human.

If these corporations had to build a car they would make the largest possible engine, because "MORE ENGINE MORE SPEED", just like they think that bigger models means bigger intelligence, but forget to add steering, or even a chassi.

replies(3): >>44458546 #>>44458573 #>>44458576 #

furyofantares ◴[03 Jul 25 19:49 UTC] No.44458576[source]▶

>>44458352 #

This is extremely theorycrafted but I see this as an excellent thing driving AI forward, not holding it back.

I suspect a large part of the reason we've had many decades of exponential improvements in compute is the general purpose nature of computers. It's a narrow set of technologies that are universally applicable and each time they get better/cheaper they find more demand, so we've put an exponentially increasing amount of economical force behind it to match. There needed to be "plenty of room at the bottom" in terms of physics and plenty of room at the top in terms of software eating the world, but if we'd built special purpose hardware for each application I don't think we'd have seen such incredible sustained growth.

I see neural networks and even LLMs as being potentially similar. They're general purpose, a small set of technologies that are broadly applicable and, as long as we can keep making them better/faster/cheaper, they will find more demand, and so benefit from concentrated economic investment.

replies(1): >>44459174 #

fnord123 ◴[03 Jul 25 21:04 UTC] No.44459174[source]▶

>>44458576 #

They aren't arguing against LLMs They are arguing against their toaster's LLM to make the perfect toast from being trained on the tax policies of the Chang Dynasty.

replies(2): >>44459540 #>>44474361 #

1. int_19h ◴[05 Jul 25 17:53 UTC] No.44474361[source]▶

>>44459174 #

Thing is, we keep finding out again and again that having a very broad training mix in the baseline model makes it better across the board, including in those specialized tasks when you fine-tune it.

As I understand it, the general ability to reason is what the models get out of "being trained on the tax policies of the Chang Dynasty", and we haven't really figured out a better way to do so than to throw most everything at them. And even if all you do is make toast, you still need some intelligence.

↑