←back to thread

DeepSeek OCR

(github.com)
990 points pierre | 1 comments | | HN request time: 0.236s | source
1. allanren ◴[] No.45645346[source]
It says the conversation can reduce size with large compression, which basic make the image blur but still contaim import information.

This is indeed amazing. It's actually how human try to understand and remember things. BY VISUAL! And when memory fade out, the image are getting blurred.

Not sure if those close source multimodal models are already using this method.