Also look at Vibe:
It even supports speaker differentiation/recognition and is open source on mac/windows/linux;
replies(1):
Real metrics from my M1 Max: 4.5hr video file transcribed in 3 minutes 32
seconds. Works completely offline.
First 5 HN users who click the button on the page get it free. Literally promo code straight to the app sore
Key differences vs Rev/Otter:
- No 2-hour file limits (handles any length)
- Timecodes stay accurate on long files (no drift from chunking)
- Supports MP3, WAV, MP4, MOV, M4A, FLAC
- Exports to SRT, VTT, JSON, PDF, DOCX, CSV, Markdown
Built for macOS. Happy to answer questions!
It even supports speaker differentiation/recognition and is open source on mac/windows/linux;