←back to thread

1901 points l2silver | 1 comments | | HN request time: 2.519s | source

Maybe you've created your own AR program for wearables that shows the definition of a word when you highlight it IRL, or you've built a personal calendar app for your family to display on a monitor in the kitchen. Whatever it is, I'd love to hear it.
1. nonoesp ◴[] No.35739425[source]
I wrote a live video/audio marker creator with OpenAI's Whisper.

A Stream Deck XL button runs a Python script that creates a timestamp for ongoing video and audio recordings which I use for live stream and podcast chapters while I'm streaming or recording.

Different buttons on the Stream Deck create markers with different labels, e.g., Introduction, Break, Marker, etc.

But as it's hard to name markers for every section and you have to go back to the recordings to infer what each section was about, I added a dynamic marker script that trims audio around a marker with ffmpeg, locally transcribes it with Whisper, and appends the transcript to the marker, so I can easily guess what the marker title should be without having to scroll and watch the video.