Hey, this looks really nice and worked like a breeze for French!
Question: out of the processing steps you mention - transcription, quality filtering, segment selection, and (I guess) wrong-word selection) are there any truly manual steps? Those would be the ones that prevent you from building this for just about any language that has good transcription available, right?