working on cursor for desktop. why rely on AI agent that’s self-contained when it’s limited, can’t access the browser, can’t open apps or click around.
i simply want mine to be able to fill in forms in preview with a passport image as context. also to be able to do recurring tasks as if i was the desktop user. e.g., i’m going to bed keep working on this spreadsheet.
it’s working and built but very slow and buggy atm. uses multimodal LLMS and OCR but lots more optimizations needed. need to make it a lot faster. can demo it and need help if anyone is interested.