←back to thread

1303 points serjester | 1 comments | | HN request time: 0.359s | source
1. kym6464 ◴[] No.42961025[source]
RE: the loss of bounding box information

You can recover word-level bounding boxes and confidence scores by using a traditional OCR engine such as AWS Textract and matching the results to Gemini’s output – see https://docless.app for a demo (disclaimer: I am the founder)