←back to thread

422 points simedw | 5 comments | | HN request time: 0.531s | source
1. deepdarkforest ◴[] No.44434122[source]
The main problem with these approaches is that most sites now are useless without JS or having access to the accessibility tree. Projects like browser-use or other DOM based approaches at least see the DOM(and screenshots).

I wonder if you could turn this into a chrome extension that at least filters and parses the DOM

replies(2): >>44434185 #>>44434192 #
2. jadbox ◴[] No.44434185[source]
I actually made a CLI tool recently that uses Puppeteer to render the page including JS, summarizes key info and actions, and enables simple form filling all from a CLI menu. I built it for my own use-cases (checking and paying power bills from CLI), but I'd love to get feedback on the core concept: https://github.com/jadbox/solomonagent
replies(1): >>44435329 #
3. willsmith72 ◴[] No.44434192[source]
True for stuff requiring interaction, but to help their LCP/SEO lots of sites these days render plain html first. It's not "usable" but for viewing it's pretty good
4. andoando ◴[] No.44435329[source]
Dude I love this. I've been thinking of doing this exactly this, but for as a screen reader for accessibility reasons.
replies(1): >>44437447 #
5. jadbox ◴[] No.44437447{3}[source]
Thanks, it's alpha at the moment- next feature is complex forms and bug fixing broken actions (downloading). Do give it a spin! Welcome to contribute or drop feedback on the repo :)