I vibecoded a similar app. Here’s the open source link, if folks want to build their own:
replies(1):
Real metrics from my M1 Max: 4.5hr video file transcribed in 3 minutes 32
seconds. Works completely offline.
First 5 HN users who click the button on the page get it free. Literally promo code straight to the app sore
Key differences vs Rev/Otter:
- No 2-hour file limits (handles any length)
- Timecodes stay accurate on long files (no drift from chunking)
- Supports MP3, WAV, MP4, MOV, M4A, FLAC
- Exports to SRT, VTT, JSON, PDF, DOCX, CSV, Markdown
Built for macOS. Happy to answer questions!
You haven’t shared any architectural details. What model? What size? How can anyone be sure that what you’re building is truly offline?