←back to thread

456 points ph4evers | 2 comments | | HN request time: 0.401s | source

I've been working on a little side project that combines Duolingo-like listening comprehension exercises with real content .

Every video is transcribed to get much better transcripts than the closed captions. I filter on high quality transcripts, and afterwards a LLM selects only plausible segments for the exercises. This seems to work well for quality control and seems to be reliable enough for these short exercises.

Would love your thoughts!

1. hk__2 ◴[] No.43544800[source]
I tried an exercise with Italian, but for some reason one of the words is not in the list to drag and drop ("qualcuno"), so I’m stuck: https://app.fluentsubs.com/exercises/cm8y1r2cv004m8v1pr775ko...

Edit: also tried in French, and it shows some words in red (I guess that means "invalid" -- please don’t convey information with color only) although they are correct: https://app.fluentsubs.com/exercises/cm8y1o6d5002s8v1p2h0m2f...

replies(1): >>43549751 #
2. ph4evers ◴[] No.43549751[source]
Thank you for trying and the feedback!

I'm working on improving the feedback. It is a bit confusing since some words are very similar so you have no idea what went wrong.

I checked the Italian video. But I don't fully understand: https://imgur.com/a/YcF3dnb . It doesn't pick qualcuno as a filler word. Is it still broken?