←back to thread

DeepSeek OCR

(github.com)
990 points pierre | 2 comments | | HN request time: 0.417s | source
1. singularity2001 ◴[] No.45641792[source]
Instead of downloading a specific OCR model how would one fare just downloading the currently best multi-modal foundation model? And what would that be at less than 30 GB?
replies(1): >>45648640 #
2. prats226 ◴[] No.45648640[source]
Then you can just download finetuned version of same multi-modal foundation model that's trained on documents?