←back to thread

612 points meetpateltech | 1 comments | | HN request time: 0s | source
Show context
Ninjinka ◴[] No.42951897[source]
Pricing is CRAZY.

Audio input is $0.70 per million tokens on 2.0 Flash, $0.075 for 2.0 Flash-Lite and 1.5 Flash.

For gpt-4o-mini-audio-preview, it's $10 per million tokens of audio input.

replies(2): >>42952141 #>>42952542 #
sunaookami ◴[] No.42952141[source]
Sadly: "Gemini can only infer responses to English-language speech."

https://ai.google.dev/gemini-api/docs/audio?lang=rest#techni...

replies(1): >>42958141 #
1. mbrock ◴[] No.42958141[source]
I don't know what they mean by this but the obvious interpretation is not true. It understands other languages, it even does really well with low representation languages, in my case Latvian.