←back to thread

137 points rezivor | 5 comments | | HN request time: 0.304s | source

Hey HN! Built this because I was tired of waiting hours for transcription services and didn't want to upload sensitive recordings to the cloud.

  Real metrics from my M1 Max: 4.5hr video file transcribed in 3 minutes 32
  seconds. Works completely offline.

   First 5 HN users who click the button on the page get it free. Literally promo code straight to the app sore  


  Key differences vs Rev/Otter:
  - No 2-hour file limits (handles any length)
  - Timecodes stay accurate on long files (no drift from chunking)
  - Supports MP3, WAV, MP4, MOV, M4A, FLAC
  - Exports to SRT, VTT, JSON, PDF, DOCX, CSV, Markdown

  Built for macOS. Happy to answer questions!
1. CrazyCatDog ◴[] No.45592654[source]
Question: can it discern (and label) different speakers? If so, could you kindly share the limit on speakers per video?
replies(3): >>45593345 #>>45593508 #>>45594867 #
2. oidar ◴[] No.45593345[source]
You are looking for speaker diarization. No one is doing this well currently on device (in macOS land at least).
replies(1): >>45612481 #
3. rezivor ◴[] No.45593508[source]
No, not yet! That will definitely be included in the next update next month. Thank you for reminding me of peoples unique need for this use case
4. CharlesW ◴[] No.45594867[source]
MacWhisper Pro supports this, if your need for this is time-sensitive. https://macwhisper.helpscoutdocs.com/article/32-automatic-sp...
5. shloky ◴[] No.45612481[source]
Or in the cloud tbh