←back to thread

422 points simedw | 4 comments | | HN request time: 1.19s | source
Show context
bubblyworld ◴[] No.44433602[source]
Classic that the first example is for parsing the goddamn recipe from the goddamn recipe site. Instant thumbs up from me haha, looks like a neat little project.
replies(3): >>44435722 #>>44436466 #>>44438277 #
lpribis ◴[] No.44438277[source]
Another great example of LLM hype train re-inventing something that already existed [1] (and was actually thought out) but making it worse and non-deterministic in the worst ways possible.

https://schema.org/Recipe

replies(6): >>44438799 #>>44439573 #>>44440529 #>>44440626 #>>44440664 #>>44440708 #
komali2 ◴[] No.44440626[source]
That's a cool schema, but the LLM solution is necessary because recipe website makers will never use the schema because they want you to have to read through garbage, with some misguided belief that this helps their SEO or something. Or maybe they get more money if you scroll through more ads?
replies(2): >>44440719 #>>44442528 #
bubblyworld ◴[] No.44440719[source]
I'm genuinely a bit confused by the recipe blog business model. Like there's got to be one, right? People don't usually spew the same story about their grandma hundreds of times on a real blog.

Just hitting keywords for search? Many of them don't even have ads so I feel like that can't be it. Maybe referrals?

replies(1): >>44440918 #
1. Revisional_Sin ◴[] No.44440918[source]
SEO. Longer articles get ranked higher.
replies(1): >>44441956 #
2. bubblyworld ◴[] No.44441956[source]
Makes sense, thanks, but how do you actually make money from that without tons of ads? I realise this is a super naive question haha
replies(2): >>44442193 #>>44442355 #
3. gpm ◴[] No.44442355[source]
> without tons of ads

This is a requirement? I literally only browse the web with an ad blocker but I always assumed those sites had tons of ads.

replies(1): >>44443725 #
4. bubblyworld ◴[] No.44443725{3}[source]
Lol, that's funny - good point, I completely forgot I had an ad blocker running 24/7. I don't think I've browsed the raw internet in more than a decade...