←back to thread

357 points ingve | 3 comments | | HN request time: 0.66s | source
1. EmilStenstrom ◴[] No.43974754[source]
I think using Gemma3 in vision mode could be a good use-case for converting PDF to text. It’s downloadable and runnable on a local computer, with decent memory requirements depending on which size you pick. Did anyone try it?
replies(2): >>43975132 #>>43976271 #
2. CaptainFever ◴[] No.43975132[source]
Kind of unrelated, but Gemma 3's weights are unfree, so perhaps LLaVA (https://ollama.com/library/llava) would be a good alternative.
3. ljlolel ◴[] No.43976271[source]
Mistral OCR has the best in class document understanding. https://mistral.ai/news/mistral-ocr