←back to thread

49 points LorenDB | 1 comments | | HN request time: 0.203s | source
Show context
ipsum2 ◴[] No.44380330[source]
I've been using Nvidia's parakeet model, it's been better than Whisper v3 large and smaller. Only supports English.

https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2

replies(3): >>44380380 #>>44381533 #>>44384824 #
nico ◴[] No.44380380[source]
Does it need a newer GPU? Or can it run on just CPU?

Would it run on a raspberry pi?

replies(4): >>44380459 #>>44380499 #>>44380936 #>>44382292 #
1. GaggiX ◴[] No.44380459[source]
Look up for faster whisper or distilled whisper models, smaller models run quite nicely but perform poorly outside of English, if you are interested in a different language it's better to finetune it (HuggingFace has a huge amount of finetuned Whisper models).