Most active commenters
  • bubblyworld(8)
  • throwaway290(5)

←back to thread

422 points simedw | 22 comments | | HN request time: 1.534s | source | bottom
Show context
bubblyworld ◴[] No.44433602[source]
Classic that the first example is for parsing the goddamn recipe from the goddamn recipe site. Instant thumbs up from me haha, looks like a neat little project.
replies(3): >>44435722 #>>44436466 #>>44438277 #
1. lpribis ◴[] No.44438277[source]
Another great example of LLM hype train re-inventing something that already existed [1] (and was actually thought out) but making it worse and non-deterministic in the worst ways possible.

https://schema.org/Recipe

replies(6): >>44438799 #>>44439573 #>>44440529 #>>44440626 #>>44440664 #>>44440708 #
2. ◴[] No.44438799[source]
3. ◴[] No.44439573[source]
4. soap- ◴[] No.44440529[source]
And that would be great, if anyone used it.

LLMs are specifically good at a task like this because they can extract content from any webpage, regardless of it supports whatever standard that no one implements

5. komali2 ◴[] No.44440626[source]
That's a cool schema, but the LLM solution is necessary because recipe website makers will never use the schema because they want you to have to read through garbage, with some misguided belief that this helps their SEO or something. Or maybe they get more money if you scroll through more ads?
replies(2): >>44440719 #>>44442528 #
6. VMG ◴[] No.44440664[source]
The LLM thing actually works. Who cares if it's deterministic. Maybe the same people who come up with arcane schemas that nobody ever uses?
7. bubblyworld ◴[] No.44440708[source]
Can we stop with the unprovoked dissing of anyone using LLMs for anything? Or at least start your own thread for it. It's an unpleasant, incredibly boring/predictable standard for discourse (more so than the LLMs themselves lol).
replies(2): >>44440863 #>>44442494 #
8. bubblyworld ◴[] No.44440719[source]
I'm genuinely a bit confused by the recipe blog business model. Like there's got to be one, right? People don't usually spew the same story about their grandma hundreds of times on a real blog.

Just hitting keywords for search? Many of them don't even have ads so I feel like that can't be it. Maybe referrals?

replies(1): >>44440918 #
9. alt187 ◴[] No.44440863[source]
It's in fact very provoked. The LLM just changes the instructions of the recipe and creates new ones. That's an unpleasant standard of user experience.
replies(1): >>44441952 #
10. Revisional_Sin ◴[] No.44440918{3}[source]
SEO. Longer articles get ranked higher.
replies(1): >>44441956 #
11. bubblyworld ◴[] No.44441952{3}[source]
That is a terrible reason to be a dick to someone. Especially someone who has created free software that you have no obligation to use.
12. bubblyworld ◴[] No.44441956{4}[source]
Makes sense, thanks, but how do you actually make money from that without tons of ads? I realise this is a super naive question haha
replies(2): >>44442193 #>>44442355 #
13. gpm ◴[] No.44442355{5}[source]
> without tons of ads

This is a requirement? I literally only browse the web with an ad blocker but I always assumed those sites had tons of ads.

replies(1): >>44443725 #
14. throwaway290 ◴[] No.44442494[source]
When they stop training these LLM on stolen content?
replies(1): >>44442789 #
15. throwaway290 ◴[] No.44442528[source]
How do they make money then? Or do you think they are doing some sort of public service and you are entitled?
replies(1): >>44445346 #
16. bubblyworld ◴[] No.44442789{3}[source]
Then go join a thread where someone is actually talking about that issue, which is very valid, and make a meaningful contribution to the conversation. Jumping on people for no reason other than they have touched a language model is just rude.
replies(1): >>44442964 #
17. throwaway290 ◴[] No.44442964{4}[source]
No you go. It is allowed to diss a thing in the thread about that thing if we think that thing is bad. People here dissed Dropbox when it was launched... and this is no Dropbox.

There are things that are not allowed. But here someone made a good point without any personal attacks. You silencing people is probably the least appropriate thing in this thread.

> make a meaningful contribution

What was not a meaningful contribution, mentioning the relevant schema? Saying using the LLM is bad for example because it is trained on our content without permission and payment? or that this steals from people who provide you content for free and make money from ads?

replies(1): >>44443040 #
18. bubblyworld ◴[] No.44443040{5}[source]
I'm not interested in continuing this conversation.
replies(1): >>44443092 #
19. throwaway290 ◴[] No.44443092{6}[source]
I wish you didn't start it... I see it's you who posted the top level comment but it doesn't mean you "own" the thread.
20. bubblyworld ◴[] No.44443725{6}[source]
Lol, that's funny - good point, I completely forgot I had an ad blocker running 24/7. I don't think I've browsed the raw internet in more than a decade...
21. anExcitedBeast ◴[] No.44445346{3}[source]
They make long articles to maximize ad exposure and SEO. It's good faith --they're doing what they have to to make money with the underlying tech ecosystem-- but it's not a good outcome.

LLMs are shifting that ecosystem (at least temporarily) and new revenue models will emerge. It'll take time to figure out. But we shouldn't artificially support a bad system just because it's the existing system.

Transitions are always awkward. In the meantime, I'm inclined to give people rope to experiment.

replies(1): >>44451552 #
22. throwaway290 ◴[] No.44451552{4}[source]
It seems that your answer is a long euphemism for "I don't give a damn how they make money, they better find a new way I guess".