That's a valid problem you are solving.
I had similar usecase that I solved using PDF[dot]co
What it does:
- Extracts specific data from PDFs based on your custom schema - Returns clean, structured JSON that's ready to use - Works with just a PDF link + your schema definition
Just run npm install documind to get started.