←back to thread

DeepSeek OCR

(github.com)
990 points pierre | 1 comments | | HN request time: 0s | source
Show context
singularity2001 ◴[] No.45641792[source]
Instead of downloading a specific OCR model how would one fare just downloading the currently best multi-modal foundation model? And what would that be at less than 30 GB?
replies(1): >>45648640 #
1. prats226 ◴[] No.45648640[source]
Then you can just download finetuned version of same multi-modal foundation model that's trained on documents?