(simedw.com)

422 points simedw | 1 comments | 01 Jul 25 12:49 UTC | HN request time: 0.205s | source

Show context

bubblyworld ◴[01 Jul 25 13:15 UTC] No.44433602[source]▶

Classic that the first example is for parsing the goddamn recipe from the goddamn recipe site. Instant thumbs up from me haha, looks like a neat little project.

replies(3): >>44435722 #>>44436466 #>>44438277 #

andrepd ◴[01 Jul 25 16:44 UTC] No.44435722[source]▶

>>44433602 #

Which it apparently does by completely changing the recipe in random places including ingredients and amounts thereof. It is _indeed_ a very good microcosm of what LLMs are, just not in the way these comments think.

replies(3): >>44435998 #>>44436175 #>>44436268 #

simedw ◴[01 Jul 25 17:28 UTC] No.44436175[source]▶

>>44435722 #

It was actually a bit worse than that the LLM never got the full recipe due to some truncation logic I had added. So it regurgitated the recipe from training, and apparently, it couldn't do both that and convert units at the same time with the lite model (it worked for just flash).

I should have caught that, and there are probably other bugs too waiting to be found. That said, it's still a great recipe.

replies(1): >>44437152 #

andrepd[dead post] ◴[01 Jul 25 19:17 UTC] No.44437152[source]▶

>>44436175 #

[flagged]

0x696C6961 ◴[01 Jul 25 23:24 UTC] No.44438859[source]▶

>>44437152 #

What is the point?

replies(2): >>44439718 #>>44444323 #

plonq ◴[02 Jul 25 02:21 UTC] No.44439718[source]▶

>>44438859 #

I’m someone else but for me the point is a serious bug resulted _incorrect data_, making it impossible to trust the output.

replies(1): >>44440736 #

1. bubblyworld ◴[02 Jul 25 06:22 UTC] No.44440736[source]▶

>>44439718 #

Assuming you are responding in good faith - the author politely acknowledged the bug (despite the snark in the comment they responded to), explained what happened and fixed it. I'm not sure what more I could expect here? Bugs are inevitable, I think it's how they are handled that drives trust for me.

↑

Show HN: Spegel, a Terminal Browser That Uses LLMs to Rewrite Webpages