←back to thread

293 points lapnect | 5 comments | | HN request time: 0.415s | source
1. bbor ◴[] No.42154472[source]
Looks awesome! Been doing a lot of OCR recently, and love the addition to the space. The reigning champion in the PDF -> Markdown space (AFAIK) is Facebook's Nougat[1], and I'm excited to hook this up to DSPy and see which works better for philosophy books. This repo links the Zerox[2] project by some startup, which also looks awesome, and certainly more smoothly advertised than Nougat. Would love corrections/advice from any actual experts passing by this comment section :)

That said, I have a few questions if OP/anyone knows the answers:

1. What is Together.ai, and is this model OSS? Their website sells them as a hosting service, and the "Custom Models" page[3] seems to be about custom finetuning, not, like, training new proprietary models in-house. They might have a HuggingFace profile but it's hard to tell if it's them https://huggingface.co/TogetherAI

2. The GitHub says "hosted demo", but the hosting part is just the tiny (clean!) WebGUI, yes? It's implied that this functionality is and will always be available only through API calls?

P.S. The header links are broken on my desktop browser -- no onClick triggered

[1] https://facebookresearch.github.io/nougat/

[2] https://github.com/getomni-ai/zerox

[3] https://www.together.ai/products#custom-models

replies(4): >>42154592 #>>42154679 #>>42154719 #>>42154807 #
2. magicalhippo ◴[] No.42154592[source]
Yeah was hoping for something I could self-host, both for privacy and cost.
3. gexla ◴[] No.42154679[source]
My guess is together.ai is at least partially sponsoring the demo.
4. jurnalanas ◴[] No.42154719[source]
the project author is Devrel from Together.ai. This is a fantastic way to advertise a dev tool, though.
5. rajansheth ◴[] No.42154807[source]
together.ai serves 100+ open-source models including multi-modal Llama 3.2 with an OpenAI compatible API