Real metrics from my M1 Max: 4.5hr video file transcribed in 3 minutes 32
seconds. Works completely offline.
First 5 HN users who click the button on the page get it free. Literally promo code straight to the app sore
Key differences vs Rev/Otter:
- No 2-hour file limits (handles any length)
- Timecodes stay accurate on long files (no drift from chunking)
- Supports MP3, WAV, MP4, MOV, M4A, FLAC
- Exports to SRT, VTT, JSON, PDF, DOCX, CSV, Markdown
Built for macOS. Happy to answer questions!
(Also, the text is completely illegible on your site.)
For example, could it support a video that included spoken Latin, ancient Greek, German, and Italian?
I will include better version support (probably to os 13).
I haven't tried a 4+ hour video with MacWhisper but I presume that would work the same.
Also the Whisper model doesn't really have a context window, it already segments the audio with a certain amount of overlap between the chunks, I really have a hard time understanding what you are trying to say here.
https://scriberpro.cc/about/ Are you trolling people with this page's design? Unreadable colors AND a wobble effect? :D
You haven’t shared any architectural details. What model? What size? How can anyone be sure that what you’re building is truly offline?
The first thought I had when it loaded was, "Did we forget how to make webpages?"
Sorry. I'm sure the software is great, but yeah.
Cool product, but it would be better if you stopped spreading misinformation to support it.
The elapsed-time timestamps didn't correlate well with other data sources. I figured it was a mistake on my end, and just brushed it off.
Green, yellow, red or whatever hue is fine, as long as it's dark or light enough. Colorblind and non-colorblind people can see how dark or light a color is (luminance), but they might not agree on the hue. That's why WCAG contrast checks require luminance contrast and not hue contrast.
It's best to use a contrast checker because it's not always intuitive how dark or light a color is e.g. yellow and lime are almost as light as white.
I have change the bg color
Thanks.
I'm using the browser built in transcription service plus downloading a model and running it via webgpu. No login. At the end of your meeting, you get a zip file with the audio, transcript and summary.
I don’t see this sort of thing, has the page changed? Edit: the comments here…
The drop shadow on the pages does make it deeply unpleasant to read.
I have launched apps focused on a new feature in the latest OS and regretted it. The # of people who have the latest OS is much smaller than the full install base for much longer than I thought. As a result, my marketing conversion was unnaturally low - people who liked the app idea but couldn't install because they had the wrong OS. This causes two problems: potential users I activated but couldn't convert and this signal gets internalized by the App Store, pushing down future impressions.
Now I always have a fallback implementation of the feature so I can target the prior OS. Both Mac and iOS.
That was back in 2023. I assume things work better now.
It even supports speaker differentiation/recognition and is open source on mac/windows/linux;
Thanks for sharing.
Looking forward to the "Speaker Detection" feature release. ;)