←back to thread

666 points georgemandis | 1 comments | | HN request time: 0.212s | source
Show context
pbbakkum ◴[] No.44382153[source]
This is great, thank you for sharing. I work on these APIs at OpenAI, it's a surprise to me that it still works reasonably well at 2/3x speed, but on the other hand for phone channels we get 8khz audio that is upsampled to 24khz for the model and it still works well. Note there's probably a measurable decrease in transcription accuracy that worsens as you deviate from 1x speed. Also we really need to support bigger/longer file uploads :)
replies(2): >>44382203 #>>44384158 #
1. nerder92 ◴[] No.44382203[source]
Quick Feedback: Would it be cool to research this internally and maybe find a sweet spot in speed multiplier where the loss is minimal. This pre-processing is quite cheap and could bring down the API price eventually.