/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
DeepSeek OCR
(github.com)
990 points
pierre
| 1 comments |
20 Oct 25 06:26 UTC
|
HN request time: 0s
|
source
Show context
yoran
◴[
20 Oct 25 07:22 UTC
]
No.
45640836
[source]
▶
>>45640594 (OP)
#
How does an LLM approach to OCR compare to say Azure AI Document Intelligence (
https://learn.microsoft.com/en-us/azure/ai-services/document...
) or Google's Vision API (
https://cloud.google.com/vision?hl=en
)?
replies(7):
>>45640943
#
>>45640992
#
>>45642214
#
>>45643557
#
>>45644126
#
>>45647313
#
>>45667751
#
1.
make3
◴[
20 Oct 25 13:17 UTC
]
No.
45643557
[source]
▶
>>45640836
#
aren't all of these multimodal LLM approaches, just open vs closed ones
ID:
GO
↑