(app.fluentsubs.com)

456 points ph4evers | 2 comments | 01 Apr 25 05:46 UTC | HN request time: 0s | source

I've been working on a little side project that combines Duolingo-like listening comprehension exercises with real content .

Every video is transcribed to get much better transcripts than the closed captions. I filter on high quality transcripts, and afterwards a LLM selects only plausible segments for the exercises. This seems to work well for quality control and seems to be reliable enough for these short exercises.

Would love your thoughts!

1. palata ◴[01 Apr 25 14:01 UTC] No.43546898[source]▶

>>43543235 (OP) #

This is cool!

I'm curious now: how do you transcribe the videos? And how do you align the transcript with the video (in terms of timing)? Are there libraries doing that?

replies(1): >>43550034 #

2. ph4evers ◴[01 Apr 25 18:33 UTC] No.43550034[source]▶

>>43546898 (TP) #

I'm using AssemblyAI and Deepgram. AssemblyAI for the large languages and Deepgram for the smaller ones.

↑

Show HN: Duolingo-style exercises but with real-world content like the news