←back to thread

1303 points serjester | 1 comments | | HN request time: 0.235s | source
Show context
nickandbro ◴[] No.42953976[source]
I think very soon a new model will destroy whatever startups and services are built around document ingestion. As in a model that can take in a pdf page as a image and transcribe it to text with near perfect accuracy.
replies(2): >>42954513 #>>42955074 #
depr ◴[] No.42954513[source]
I think the Azure Document Intelligence, Google Document AI and Amazon Textract are among the best if not the best services though and they offer these models.
replies(1): >>42959514 #
1. nnurmanov ◴[] No.42959514[source]
I have not tested Azure Document Intelligence, Google Document AI, but AWS Textract, LLamaparse, Unstructured and Omni made to my shortlist. I have not tested Docling, as I could not install it on my Windows laptop.