←back to thread

137 points rezivor | 2 comments | | HN request time: 0.401s | source

Hey HN! Built this because I was tired of waiting hours for transcription services and didn't want to upload sensitive recordings to the cloud.

  Real metrics from my M1 Max: 4.5hr video file transcribed in 3 minutes 32
  seconds. Works completely offline.

   First 5 HN users who click the button on the page get it free. Literally promo code straight to the app sore  


  Key differences vs Rev/Otter:
  - No 2-hour file limits (handles any length)
  - Timecodes stay accurate on long files (no drift from chunking)
  - Supports MP3, WAV, MP4, MOV, M4A, FLAC
  - Exports to SRT, VTT, JSON, PDF, DOCX, CSV, Markdown

  Built for macOS. Happy to answer questions!
1. der_philipp ◴[] No.45604549[source]
Also look at Vibe:

It even supports speaker differentiation/recognition and is open source on mac/windows/linux;

https://github.com/thewh1teagle/vibe

replies(1): >>45605146 #
2. der_philipp ◴[] No.45605146[source]
It uses whisper, but also directly calls other tools and puts everything under one nice Gui