←back to thread

317 points laserduck | 8 comments | | HN request time: 0s | source | bottom
Show context
aubanel ◴[] No.42158417[source]
I know nothing about chip design. But saying "Applying AI to field X won't work, because X is complex, and LLMs currently have subhuman performance at this" always sounds dubious.

VCs are not investing in the current LLM-based systems to improve X, they're investing in a future where LLM based systems will be 100x more performant.

Writing is complex, LLMs once had subhuman performance, and yet. Digital art. Music (see suno.AI) There is a pattern here.

replies(7): >>42158545 #>>42158550 #>>42158576 #>>42159935 #>>42160061 #>>42165587 #>>42169569 #
duped ◴[] No.42159935[source]
AI still has subhuman performance for art. It feels like the venn diagram of people who are bullish on LLMs and people who don't understand logistic curves is a circle.
replies(1): >>42161059 #
1. jjk166 ◴[] No.42161059[source]
You ask 100,000 humans each to make a photo realistic rendering of a alpaca playing basketball on the moon in 90 seconds, an LLM is going to outperform every single one of them.
replies(2): >>42163079 #>>42165589 #
2. astrange ◴[] No.42163079[source]
Diffusion models aren't actually LLMs, they're a different architecture. Which makes it even weirder we invented them at the same time.

Also, they might not be able to do it. eg most models can't generate "horse riding an astronaut" or "upside-down car".

replies(1): >>42167611 #
3. duped ◴[] No.42165589[source]
That's not a meaningful benchmark for valuing art or creating art
replies(1): >>42185998 #
4. sincerely ◴[] No.42167611[source]
To be fair, most humans can't draw any better than stick figures.
replies(1): >>42171006 #
5. foldr ◴[] No.42171006{3}[source]
This is true, but humans are much better at including specified elements in an image with specified spatial relationships. A description like a "A porpoise seated at a desk writing a letter" will reliably produce (terrible) drawings consisting of parts corresponding to the porpoise, parts corresponding to the desk, and parts corresponding to the letter, with the arrangement of the parts roughly corresponding to the description.
replies(1): >>42186231 #
6. jjk166 ◴[] No.42185998[source]
What meaningful benchmark would you use? Art by it's nature is subjectively experienced - what one person considers great, meaningful, soul-moving art, another may consider terrible, meaningless, and empty. Both opinions are equally valid.

But if you're using AI to create art, you're typically not trying to move someone's soul. You're trying to create a work that depicts something in a particular style with a particular fidelity with a certain amount of resource consumption. That is the only metric by which it makes any sense to evaluate the machine designed to do that specific task.

7. jjk166 ◴[] No.42186231{4}[source]
Humans being better at one specific aspect of a task is not equivalent to humans being overall better at the task.

I just entered your prompt into an AI image generator and in under a second it gave me an image[0] of what looks to me like an anthropomorphic dolphin sitting at a desk writing a letter in a little study. I then had to google what the difference between a porpoise and a dolphin was because I genuinely thought porpoises looked much more like manatees. While I could nitpick the AI's work for making the porpoise's snout a little too long, had I drawn it the porpoise would have been a vaguely marine looking blob with no anatomy detailed enough to recognize let alone criticize. I am quite confident that if you asked for a large number of images based on that prompt from humans, it would easily rank among the best, and it's unlikely you'd get any which were markedly better. The fact it can generate this image nearly instantaneously though is astounding. If your goal was to get one masterpiece hanging in the Louvre, this particular tool would not suffice, but if your goal was to illustrate children's books, this tool could do in hours what would have taken a team of humans months. That is superhuman performance.

[0] https://api.deepai.org/job-view-file/e0b80ca6-d934-42e4-9a7e...

(Sorry if the link doesn't remain good for long)

replies(1): >>42189407 #
8. foldr ◴[] No.42189407{5}[source]
An AI image generator will sometimes do a good job on this sort of prompt, but it fails in different ways to the ways that humans fail.

Whether humans or AI are better at the task overall is probably too vague a question to answer, depending a lot on how you weight different desirables.