←back to thread

137 points rezivor | 5 comments | | HN request time: 0.954s | source

Hey HN! Built this because I was tired of waiting hours for transcription services and didn't want to upload sensitive recordings to the cloud.

  Real metrics from my M1 Max: 4.5hr video file transcribed in 3 minutes 32
  seconds. Works completely offline.

   First 5 HN users who click the button on the page get it free. Literally promo code straight to the app sore  


  Key differences vs Rev/Otter:
  - No 2-hour file limits (handles any length)
  - Timecodes stay accurate on long files (no drift from chunking)
  - Supports MP3, WAV, MP4, MOV, M4A, FLAC
  - Exports to SRT, VTT, JSON, PDF, DOCX, CSV, Markdown

  Built for macOS. Happy to answer questions!
1. Telemakhos ◴[] No.45591769[source]
What languages does this support? Does it support switching between multiple languages in one video?

For example, could it support a video that included spoken Latin, ancient Greek, German, and Italian?

replies(2): >>45591989 #>>45598131 #
2. rezivor ◴[] No.45591989[source]
eng der fr de es it pt ru zh ko ar and ja
replies(1): >>45592592 #
3. Telemakhos ◴[] No.45592592[source]
So, can it handle multiple languages in one video, or do you need to segment the different languages using LID first? This has been a thorny issue for people working in multilingual audio (there are at least two or three of us).
replies(1): >>45592721 #
4. rezivor ◴[] No.45592721{3}[source]
I haven't test that specific edge case, I'm sorry. I tested 2 langue's having a normal conversation and that worked fine- "Auto or English" handle multiple lan the best
5. runxel ◴[] No.45598131[source]
So weird that this is nowhere stated on the website at all. Was literally the first and only thing I was interested in. So bad.