←back to thread

439 points david927 | 1 comments | | HN request time: 0.202s | source

What are you working on? Any new ideas which you're thinking about?
Show context
dataviz1000 ◴[] No.44416926[source]
I built an IPC/RPC shim for a Chrome extension so I can send strongly-typed messages between isolated JS contexts that otherwise expose wildly inconsistent messaging APIs.

I discovered that VSCode has a very nice solution so I pulled the core VSCode libraries and injected them into a Chrome extension using the dependency injection, ipc / rpc, eventing to bridge the gap between all of these isolated JS contexts and expose a single, strongly‐typed messaging API, my IPC/RPC shim sits on top of each of the native environments and communication mechanisms.

Yesterday, Microsoft released the source code for the Copilot chat. Apparently, since the basis of my Chrome extension is the same core libraries I can drop the VSCode chat UI into the side panel without much friction. Although, I might continue to use Microsoft's FluentUI chat currently implemented in the extension.

Because Copilot chat has a lot of code that runs in node in Electron, now I'm working in porting all the agent capabilities for browser automation from the Copilot chat including the code for intent, prompt creation, tools, disambiguation, chunking, embedding, ect. I'm 4 to 6 weeks away from having feature parity of Playwright for automation from a Chrome extension side panel that can do most of the inference using huggingface transformer.js locally. Nonetheless, heuristics exposed as tools such that if the intent is playing a video, all that is required is a tool that collects all the video tags and related elements with metadata. No need to use $10 in tokens to figure out which video element to play.

Yeah, I think I'm 4 to 6 weeks away from having a Copilot chat in a browser doing agent automation.

If you want to see where I'm at today, https://github.com/adam-s/doomberg-terminal.

replies(1): >>44418311 #
kordlessagain ◴[] No.44418311[source]
> AI-Powered News Intelligence

When I did Grub the crawler back in the day, that's what I was shooting for!

If you want a jumpstart on the Playwright stuff: https://github.com/kordless/gnosis-wraith. Runs on Google Cloud Run. The UI is still in progress but you can test it here: https://wraith.nuts.services. Uses tokens to email for login.

The extension stuff is the way to go, IMHO! You can capture any page, even automatically.

replies(1): >>44419666 #
1. dataviz1000 ◴[] No.44419666[source]
That is awesome! Thank you for sharing!