Show HN: Autotab – Programmable AI browser for turning web tasks into APIs

1. pacifi30 ◴[20 Nov 24 22:43 UTC] No.42198908[source]▶

>>42197741 (OP) #

Pretty slick. I recorded a session for ordering from a restaurant website, and it did repeat the entire workflow. It had some issues with a modal popped up but all in all well done! We have been trying to robotify the task of ordering from restaurant for our clients and seems like your solution can work well for us. I am guessing that you want your users to use Autotab browser, what is use for API?

replies(2): >>42198946 #>>42198960 #

2. jonasnelle ◴[20 Nov 24 22:49 UTC] No.42198946[source]▶

>>42198908 (TP) #

Thanks! We think of the browser as an authoring tool where you create, test and refine skills.

After you've done that, the API is great for cases where you want to incorporate Autotab into a larger data flow or product.

For instance, say Company A has taught Autotab to migrate their customers' data - so their customers just see a sync button in the Company A product, which kicks off a Autotab run via API. Same for restaurant booking, if you'd want that to happen programatically.

replies(2): >>42199080 #>>42199368 #

3. jonasnelle ◴[20 Nov 24 22:50 UTC] No.42198960[source]▶

>>42198908 (TP) #

Also for the modal popup - this is the kind of issue that goes away in run mode because Autotab will escalate to bigger models to self-heal.

If the modal pops up frequently you can also record an click to dismiss it and make that click optional so Autotab knows to move on if the modal does not pop up sometimes.

4. pacifi30 ◴[20 Nov 24 23:06 UTC] No.42199080[source]▶

>>42198946 #

Understood! How does it work if we have several different restaurants to order from, do I need to record each ordering session and create skills for each restaurant or it can infer on its own given the task to order from a restaurant. Secondly, any docs or samples to see how to integrate this with your API?

replies(1): >>42199147 #

5. jonasnelle ◴[20 Nov 24 23:17 UTC] No.42199147{3}[source]▶

>>42199080 #

Depends on how different the flows are for different restaurants. If they're just different names but use the same booking system you'd typically use an input and have Autotab find the correct restaurant first. If they're totally different booking systems you can try the instruct (open ended agentic) step but my guess is that will be too slow and unreliable for now, so you'd probably want to record different skills for each.

Docs are here with sample code: https://docs.autotab.com/api-reference

6. handfuloflight ◴[20 Nov 24 23:43 UTC] No.42199368[source]▶

>>42198946 #

Is the API also charged based on runtime? And I'm assuming that workflow happens in the cloud? What if it's behind a login? What if that login requires 2FA?

replies(1): >>42199541 #

7. alexirobbins ◴[21 Nov 24 00:05 UTC] No.42199541{3}[source]▶

>>42199368 #

Yep exactly. Authentication is primarily handled with session data, so passwords never leave your device, but we also support setting secrets.

Here is more info on auth and security: https://docs.autotab.com/manual/security

For 2FA, different users take different approaches. Everything from teaching Autotab to pull auth codes from their email, to setting intervention requests at the top of their skills, to enterprise integrations that we support with SSO and dedicated machine accounts.