←back to thread

DeepSeek OCR

(github.com)
990 points pierre | 1 comments | | HN request time: 0.2s | source
Show context
piker ◴[] No.45640676[source]
This looks really cool for prototyping and playing around.

It seems to me though if one is building a modern application that needs to get image segmentation and/or text recognition right there are better APIs available than natural language? It seems like a lot of effort to make a production-scale CV application to weigh it down with all of an LLM’s shortcomings. Not a field I’m familiar with but I would assume that this doesn’t produce state of the art results—that would change the analysis.

replies(2): >>45640692 #>>45642239 #
1. CheeseFromLidl ◴[] No.45642239[source]
As a hobby photographer, I organise everything for speedy retrieval but this would be amazing to search my collection.