Most active commenters
  • sensanaty(3)
  • superfrank(3)

←back to thread

1479 points sandslash | 21 comments | | HN request time: 1.854s | source | bottom
Show context
hgl ◴[] No.44315520[source]
It’s fascinating to think about what true GUI for LLM could be like.

It immediately makes me think a LLM that can generate a customized GUI for the topic at hand where you can interact with in a non-linear way.

replies(8): >>44315537 #>>44315540 #>>44315566 #>>44315572 #>>44315981 #>>44316784 #>>44316877 #>>44319465 #
1. karpathy ◴[] No.44315566[source]
Fun demo of an early idea was posted by Oriol just yesterday :)

https://x.com/OriolVinyalsML/status/1935005985070084197

replies(9): >>44315985 #>>44316123 #>>44316264 #>>44316327 #>>44316476 #>>44320938 #>>44321641 #>>44322664 #>>44325125 #
2. aprilthird2021 ◴[] No.44315985[source]
This is crazy cool, even if not necessarily the best use case for this idea
3. hackernewds ◴[] No.44316123[source]
it's impressive but it seems like a crappier UX? that none of the patterns can really be memorized
4. suddenlybananas ◴[] No.44316264[source]
Having different documents come up every time you go into the documents directory seems hellishly terrible.
replies(1): >>44316580 #
5. sensanaty ◴[] No.44316327[source]
[flagged]
replies(2): >>44316378 #>>44321619 #
6. danielbln ◴[] No.44316378[source]
Maybe we can collect all of this salt and operate a Thorium reactor with it, this in turn can then power AI.
replies(1): >>44316397 #
7. sensanaty ◴[] No.44316397{3}[source]
We'll need to boil a few more lakes before we get to that stage I'm afraid, who needs water when you can have your AI hallucinate some for you after all?
replies(1): >>44317965 #
8. superfrank ◴[] No.44316476[source]
On one hand, I'm incredibly impressed by the technology behind that demo. On the other hand, I can't think of many things that would piss me off more than a non-deterministic operating system.

I like my tools to be predictable. Google search trying to predict that I want the image or shopping tag based on my query already drives me crazy. If my entire operating system did that, I'm pretty sure I'd throw my computer out a window.

replies(1): >>44317434 #
9. falcor84 ◴[] No.44316580[source]
It's a brand of terribleness I've somewhat gotten used to, opening Google Drive every time, when it takes me to the "Suggested" tab. I can't recall a single time when it had the document I care about anywhere close to the top.

There's still nothing that beats the UX of Norton Commander.

10. iLoveOncall ◴[] No.44317434[source]
> incredibly impressed by the technology behind that demo

An LLM generating some HTML?

replies(1): >>44319895 #
11. TeMPOraL ◴[] No.44317965{4}[source]
Who needs water when all these hot takes come from sources so dense, they're about to collapse into black holes.
replies(1): >>44318595 #
12. sensanaty ◴[] No.44318595{5}[source]
Is me not wanting the UI of my OS to shift with every mouse click a hot take? If me wanting to have the consistent "When I click here, X happens" behavior instead of the "I click here and I'm Feeling Lucky happens" behavior is equal to me being dense, so be it I guess.
replies(1): >>44320639 #
13. superfrank ◴[] No.44319895{3}[source]
At a speed that feels completely seamless to navigate through. Yeah, I'm pretty impressed by that.
replies(1): >>44329689 #
14. TeMPOraL ◴[] No.44320639{6}[source]
No. But you interpreting and evaluating the demo in question as suggesting the things you described - frankly, yes. It takes a deep gravity well to miss a point this clear from this close.

It's a tech demo. It shows you it's possible to do these things live, in real time (and to back Karpathy's point about tech spread patterns, it's accessible to you and me right now). It's not saying it's a good idea - but there are obvious seeds of good ideas there. For one, it shows you a vision of an OS or software you can trivially extend yourself on the fly. "I wish it did X", bam, it does. And no one says it has to be non-deterministic each time you press some button. It can just fill what's missing and make additions permanent, fully deterministic after creation.

15. spamfilter247 ◴[] No.44320938[source]
My takeaway from the demo is less that "it's different each time", but more a "it can be different for different users and their styles of operating" - a poweruser can now see a different Settings UI than a basic user, and it can be generated realtime based on the persona context of the user.

Example use case (chosen specifically for tech): An IDE UI that starts basic, and exposes functionality over time as the human developer's skills grow.

16. dang ◴[] No.44321619[source]
"Please don't fulminate."

"Don't be curmudgeonly. Thoughtful criticism is fine, but please don't be rigidly or generically negative."

"Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something."

"Please respond to the strongest plausible interpretation of what someone says, not a weaker one that's easier to criticize."

https://news.ycombinator.com/newsguidelines.html

17. superconduct123 ◴[] No.44321641[source]
That looks both cool and infuriating
18. throwaway314155 ◴[] No.44322664[source]
I would bet good money that many of the functions they chose not to drill down into (such as settings -> volume) do nothing at all or cause an error.

It's a fronted generator. It's fast. That's cool. But is being pitched as a functioning OS generator and I can't help but think it isn't given the failure rates for those sorts of tasks. Further, the success rates for HTML generation probably _are_ good enough for a Holmes-esque (perhaps too harsh) rugpull (again, too harsh) demo.

A cool glimpse into what the future might look like in any case.

19. asterisk_ ◴[] No.44325125[source]
I feel like one quickly hits a similar partial observability problem as with e.g. light sensors. How often do you wave around annoyed because the light turned off.

To get _truly_ self driving UIs you need to read the mind of your users. It's some heavy tailed distribution all the way down. Interesting research problem on its own.

We already have adaptive UIs (profiles in VSC anyone? Vim, Emacs?) they're mostly under-utilized because takes time to setup + most people are not better at designing their own workflow relative to the sane default.

20. iLoveOncall ◴[] No.44329689{4}[source]
Read the code that is actually being generated. It's only the content of the page, which itself is loaded progressively.

It takes 2 seconds to generate an extremely basic 300 characters page of content. Again, what is impressive here?

It's not fast, it gives the illusion of being fast.

replies(1): >>44334635 #
21. superfrank ◴[] No.44334635{5}[source]
I know what it's doing and I'm impressed. If you understand what it's doing and aren't impressed, that's cool too. I think we just see things differently and I doubt either of us will convince the other one to change their mind on this