Show HN: Spegel, a Terminal Browser That Uses LLMs to Rewrite Webpages

1. insane_dreamer ◴[01 Jul 25 14:44 UTC] No.44434388[source]▶

Interesting, but why round-trip through an LLM just to convert HTML to Markdown?

2. markstos ◴[01 Jul 25 14:51 UTC] No.44434463[source]▶

Because the modern web isn't reliably HTML, it's "web apps" with heavy use of JavaScript and API calls. To first display the HTML that you see in your browser, you need a user agent that runs JavaScript and makes all the backend calls that Chrome would make to put together some HTML.

Some websites may still return some static upfront that could be usefully understood without JavaScript processing, but a lot don't.

That's not to say you need an LLM, there are projects like Puppeteer that are like headless browsers that can return the rendered HTML, which can then be sent through an HTML to Markdown filter. That would be less computationally intensive.

replies(1): >>44435180 #

3. insane_dreamer ◴[01 Jul 25 15:54 UTC] No.44435180[source]▶

>>44434463 #

> That's not to say you need an LLM, ... then be sent through an HTML to Markdown filter. That would be less computationally intensive.

which was exactly my point

4. crent ◴[01 Jul 25 16:02 UTC] No.44435272[source]▶

>>44434388 (TP) #

Because this isn't just converting HTML to markdown. I'd recommend taking another look at the website and particularly read the recipe example as it demonstrates the goal of the project pretty well.