←back to thread

357 points ingve | 1 comments | | HN request time: 0.212s | source
Show context
andrethegiant ◴[] No.43974436[source]
Cloudflare’s ai.toMarkdown() function available in Workers AI can handle PDFs pretty easily. Judging from speed alone, it seems they’re parsing the actual content rather than shoving into OCR/LLM.

Shameless plug: I use this under the hood when you prefix any PDF URL with https://pure.md/ to convert to raw text.

replies(4): >>43974514 #>>43974535 #>>43974602 #>>43975027 #
1. bambax ◴[] No.43975027[source]
It doesn't seem to handle multi-columns PDFs well?