←back to thread

422 points simedw | 2 comments | | HN request time: 0.516s | source
Show context
deepdarkforest ◴[] No.44434122[source]
The main problem with these approaches is that most sites now are useless without JS or having access to the accessibility tree. Projects like browser-use or other DOM based approaches at least see the DOM(and screenshots).

I wonder if you could turn this into a chrome extension that at least filters and parses the DOM

replies(2): >>44434185 #>>44434192 #
jadbox ◴[] No.44434185[source]
I actually made a CLI tool recently that uses Puppeteer to render the page including JS, summarizes key info and actions, and enables simple form filling all from a CLI menu. I built it for my own use-cases (checking and paying power bills from CLI), but I'd love to get feedback on the core concept: https://github.com/jadbox/solomonagent
replies(1): >>44435329 #
1. andoando ◴[] No.44435329[source]
Dude I love this. I've been thinking of doing this exactly this, but for as a screen reader for accessibility reasons.
replies(1): >>44437447 #
2. jadbox ◴[] No.44437447[source]
Thanks, it's alpha at the moment- next feature is complex forms and bug fixing broken actions (downloading). Do give it a spin! Welcome to contribute or drop feedback on the repo :)