←back to thread

652 points toebee | 3 comments | | HN request time: 0s | source
Show context
Versipelle ◴[] No.43754941[source]
This is really impressive; we're getting close to a dream of mine: the ability to generate proper audiobooks from EPUBs. Not just a robotic single voice for everything, but different, consistent voices for each protagonist, with the LLM analyzing the text to guess which voice to use and add an appropriate tone, much like a voice actor would do.

I've tried "EPUB to audiobook" tools, but they are really miles behind what a real narrator accomplishes and make the audiobook impossible to engage with

replies(2): >>43755849 #>>43755991 #
azinman2 ◴[] No.43755991[source]
Wouldn’t it be more desirable to hear an actual human on an audiobook? Ideally the author?
replies(6): >>43756515 #>>43756799 #>>43757678 #>>43758680 #>>43759650 #>>43773492 #
1. senordevnyc ◴[] No.43756515[source]
Honestly, I’d say that’s true only for the author. Anyone else is just going to be interpreting the words to understand how to best convey the character / emotion / situation / etc., just like an AI will have to do. If an AI can do that more effectively than a human, why not?

The author could be better, because they at least have other info beyond the text to rely on, they can go off-script or add little details, etc.

replies(1): >>43756762 #
2. DrSiemer ◴[] No.43756762[source]
As somebody who has listened to hundreds of audiobooks, I can tell you authors are generally not the best choice to voice their own work. They may know every intent, but they are writers, not actors.

The most skilled readers will make you want to read books _just because they narrated them_. They add a unique quality to the story, that you do not get from reading yourself or from watching a video adaptation.

Currently I'm in The Age of Madness, read by Steven Pacey. He's fantastic. The late Roy Dotrice is worth a mention as well, for voicing Game of Thrones and claiming the Guinness world record for most distinct voices (224) in one series.

It will be awesome if we can create readings automatically, but it will be a while before TTS can compete with the best readers out there.

replies(1): >>43757590 #
3. azinman2 ◴[] No.43757590[source]
I’d suggest even if the TTS sounded good, I’d still rather a human because:

1. It’s a job that seems worthwhile to support, especially as it’s “practice” that only adds to a lifetime of work and improves their central skill set

2. A voice actor will bring their own flare, just like any actor does to their job

3. They (should) prepare for the book, understanding what it’s about in its entirety, and bring that context to the reading