←back to thread

DeepSeek OCR

(github.com)
990 points pierre | 2 comments | | HN request time: 0.014s | source
Show context
simonw ◴[] No.45646783[source]
I figured out how to get this running on the NVIDIA Spark (ARM64, which makes PyTorch a little bit trickier than usual) by running Claude Code as root in a new Docker container and having it figure it out. Notes here: https://simonwillison.net/2025/Oct/20/deepseek-ocr-claude-co...

Here's a result I got https://github.com/simonw/research/blob/main/deepseek-ocr-nv... - against this image: https://static.simonwillison.net/static/2025/ft.jpeg

replies(4): >>45647059 #>>45647109 #>>45649327 #>>45649447 #
1. jjcm ◴[] No.45647059[source]
Looks like this did really solid, with the exception of the paragraph directly below the quote. It hallucinated some filler there and bridge it with the next column.

Thanks for running the test quickly!

replies(1): >>45649140 #
2. djmips ◴[] No.45649140[source]
By my eye it just bridge. I didn't see any filler. It went from "Code is a language" - above the quote and then to "in a garden by name." which was the top of the next column but missing the chicken subject.