←back to thread

456 points ph4evers | 3 comments | | HN request time: 0.338s | source

I've been working on a little side project that combines Duolingo-like listening comprehension exercises with real content .

Every video is transcribed to get much better transcripts than the closed captions. I filter on high quality transcripts, and afterwards a LLM selects only plausible segments for the exercises. This seems to work well for quality control and seems to be reliable enough for these short exercises.

Would love your thoughts!

1. philipjoubert ◴[] No.43543848[source]
This is great - I've actually started building something similar myself a few months ago.

Requests:

- Split Spanish between Spain and Latin America

- Add difficulty levels (consider speaking speed and vocabulary used)

- Ability to select which topics I want the videos to be about (e.g. science, celebrity gossip, AI)

replies(2): >>43545233 #>>43549583 #
2. nbcesar ◴[] No.43545233[source]
+1 to splitting Spanish. Even better is picking a Spanish speaking country and listening to news from that specific country.
3. ph4evers ◴[] No.43549583[source]
Thanks!

> Split Spanish between Spain and Latin America Will do!

> Add difficulty levels (consider speaking speed and vocabulary used)

I'm working on splitting it up in easy/normal videos. That should be do-able to assess.

> Ability to select which topics I want the videos to be about (e.g. science, celebrity gossip, AI)

I'm thinking about creating a browser plugin where you can tick a box to automatically import it into Fluentsubs. Or create an exercise from an existing video. It will take minutes before it is fully transcribed but it can be a nice way to prep your own content without people blaming me that I serve biased content.

I'm not sure though if people are willing to install browser plugins. I'm always a bit weiry with plugins that are invasive on websites like YouTube.