←back to thread

48 points LorenDB | 1 comments | | HN request time: 0.209s | source
Show context
ipsum2 ◴[] No.44380330[source]
I've been using Nvidia's parakeet model, it's been better than Whisper v3 large and smaller. Only supports English.

https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2

replies(3): >>44380380 #>>44381533 #>>44384824 #
1. PeterStuer ◴[] No.44384824[source]
In my side by side testing of Whisper and Parakeet in transcribing Euro-English meeting recordings, Whisper produced the better result, but Parakeet was faster.

I'm sticking with Whisper as it is fast enough for my use case.