I vibecoded a similar app. Here’s the open source link, if folks want to build their own:
 replies(1): 
  Real metrics from my M1 Max: 4.5hr video file transcribed in 3 minutes 32
  seconds. Works completely offline.
   First 5 HN users who click the button on the page get it free. Literally promo code straight to the app sore  
  Key differences vs Rev/Otter:
  - No 2-hour file limits (handles any length)
  - Timecodes stay accurate on long files (no drift from chunking)
  - Supports MP3, WAV, MP4, MOV, M4A, FLAC
  - Exports to SRT, VTT, JSON, PDF, DOCX, CSV, Markdown
  Built for macOS. Happy to answer questions!