←back to thread

DeepSeek OCR

(github.com)
990 points pierre | 8 comments | | HN request time: 0.447s | source | bottom
1. simonw ◴[] No.45646783[source]
I figured out how to get this running on the NVIDIA Spark (ARM64, which makes PyTorch a little bit trickier than usual) by running Claude Code as root in a new Docker container and having it figure it out. Notes here: https://simonwillison.net/2025/Oct/20/deepseek-ocr-claude-co...

Here's a result I got https://github.com/simonw/research/blob/main/deepseek-ocr-nv... - against this image: https://static.simonwillison.net/static/2025/ft.jpeg

replies(4): >>45647059 #>>45647109 #>>45649327 #>>45649447 #
2. jjcm ◴[] No.45647059[source]
Looks like this did really solid, with the exception of the paragraph directly below the quote. It hallucinated some filler there and bridge it with the next column.

Thanks for running the test quickly!

replies(1): >>45649140 #
3. throwaway314155 ◴[] No.45647109[source]
> by running Claude Code as root in a new Docker container

How do you get the "as root" part of that to work?

(sorry if it's explained in your article)

replies(1): >>45647538 #
4. simonw ◴[] No.45647538[source]
Run it on a root account and do:

  IS_SANDBOX=1 claude --dangerously-skip-permissions
replies(1): >>45650580 #
5. djmips ◴[] No.45649140[source]
By my eye it just bridge. I didn't see any filler. It went from "Code is a language" - above the quote and then to "in a garden by name." which was the top of the next column but missing the chicken subject.
6. arkmm ◴[] No.45649327[source]
Wow, this deserves its own submission.
7. CaptainOfCoit ◴[] No.45649447[source]
It missed the initial "A" in the text which I sort of understand, seems not a lot of news articles were put in the dataset. But more interestingly, it missed the entire "Hallucination is a risk and...", the article "theme" next to the author name also the final email.
8. throwaway314155 ◴[] No.45650580{3}[source]
Thanks!!