/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
PDF to Text, a challenging problem
(www.marginalia.nu)
357 points
ingve
| 1 comments |
13 May 25 15:01 UTC
|
HN request time: 0.225s
|
source
1.
constantinum
◴[
13 May 25 17:45 UTC
]
No.
43975633
[source]
▶
>>43973721 (OP)
#
PDF parsing is hell indeed, with all sorts of edge cases that breaks business workflows, more on that here
https://unstract.com/blog/pdf-hell-and-practical-rag-applica...
ID:
GO
↑