←back to thread

Claude in Chrome

(claude.com)
278 points ianrahman | 7 comments | | HN request time: 0.403s | source | bottom
1. prescriptivist ◴[] No.46341919[source]
I used this in earnest yesterday on my Zillow saved listings. I prompted it to analyze the listings (I've got about 70 or so saved) and summarize the most recent price drops for each one and it mostly failed at the task. It gave the impression that it paginated through all the listings, but I don't think it actually did. I think the mechanism by which it works, which is to click links and take screenshots and analyze them must be some kind of token efficiency trade-off (as opposed to consuming the DOM) and it seems not great at the task.

As a reformed AI skeptic I see the promise in a tool like this, but this is light years behind other Anthropic products in terms of efficacy. Will be interesting to see how it plays out though.

replies(5): >>46342156 #>>46342529 #>>46343346 #>>46343689 #>>46346706 #
2. jetbalsa ◴[] No.46342156[source]
would be interesting to see if this works in playwright using your existing browser's remote control APIs (Using claude code via the playwright mcp)
replies(1): >>46342309 #
3. baby_souffle ◴[] No.46342309[source]
I've had extensive luck doing just that. Spend some time doing the initial work to see how the page works and then give the llm examples of the HTML that should be clicked for next page or the css classes that indicate the details you're after and then ask for a playwright to yaml tool.

Been doing this for a few months now to keep an eye on the prices for local grocery stores. I had to introduce random jitter so Ali Express wouldn't block me from trying to dump my decade+ of order history.

4. fouc ◴[] No.46342529[source]
sometimes I find that it helps if my prompt directly names the tools that I want the LLM to use, i.e. I'll tell it "do a WebFetch of so and so" etc.
5. csomar ◴[] No.46343346[source]
LLMs struggle with time (or don't really have a concept with time). So unless that is addressed, they'll always suck in these tasks as you need synchronization. This is why text/cli was a much better UX to work with. std in/out is the best way to go but someone has to release something to keep pumping numbers.
6. jstummbillig ◴[] No.46343689[source]
> light years behind

So... give it another 3 month? (I assume we are talking AI light years)

7. jazzyjackson ◴[] No.46346706[source]
What an asinine strategy to feed screenshots (does it scroll down and render the whole page?)

I had good luck treating HTML as XML and having Claude write xpath queries to grab useful data without ingesting the whole damn DOM