←back to thread

242 points alphabetting | 4 comments | | HN request time: 0s | source
Show context
wenbin ◴[] No.41872782[source]
NotebookLM is contributing to fake podcasts across the internet, with over 1,300 and counting:

https://github.com/ListenNotes/ai-generated-fake-podcasts/bl...

Google is taking a different approach this time, moving quickly. While NotebookLM is indeed a remarkable tool for personal productivity and learning, it also opens the door for spammers to mass-produce content that isn't meant for human consumption.

Amidst all the praise for this project, I’d like to offer a different perspective. I hope the NotebookLM team sees this and recognizes the seriousness of the spam issue, which will only grow if left unaddressed. If you know someone on the team, please bring this to their attention - Could you please provide a tool or some plain-English guidelines to help detect audio generated by NotebookLM? Is there a watermark or any other identifiable marker that can be used?

Just recently, a Hacker News post highlighted how nearly all Google image results for "baby peacock" are AI-generated: https://news.ycombinator.com/item?id=41767648

It won't be long before we see a similar trend with low-quality, AI-generated fake podcasts flooding the internet.

replies(14): >>41872802 #>>41872821 #>>41872878 #>>41872954 #>>41873067 #>>41873074 #>>41873152 #>>41873269 #>>41873297 #>>41873476 #>>41874055 #>>41874427 #>>41874680 #>>41875008 #
jsheard ◴[] No.41872821[source]
> it also opens the door for spammers to mass-produce content that isn't meant for human consumption.

What's new? Every novel class of genAI product has brought a tidal wave of slop, spam and/or scams to the medium it generates. If anyone working on a product like this doesn't anticipate it being used to mass produce vapid white-noise "content" on an industrial scale then they haven't been paying attention.

replies(1): >>41872875 #
wenbin ◴[] No.41872875[source]
This is definitely not a new issue.

What I’m aiming for is to ensure that the NotebookLM team is aware of the impact and actively considering it. Hopefully, they are already working on tools or mechanisms to address the problem—ideally before their colleagues at YouTube and Google Search come asking for help to fight NotebookLM-generated spams :)

It's certainly easier for the creators of genAI to build detection tools than for outsiders to do so. AI audio detection is a hard problem - https://www.npr.org/2024/04/05/1241446778/deepfake-audio-det...

replies(2): >>41873156 #>>41875011 #
1. criddell ◴[] No.41873156[source]
> What I’m aiming for is to ensure that the NotebookLM team is aware of the impact and actively considering it.

What is the impact? Have any of them attracted an audience of any meaningful size? If a month from now there are 1.3 million generated podcasts, what do you anticipate the fallout to be?

replies(1): >>41875383 #
2. sgdfhijfgsdfgds ◴[] No.41875383[source]
> If a month from now there are 1.3 million generated podcasts, what do you anticipate the fallout to be?

Is this a rhetorical question? Because the answer for podcast indexing and search services is surely pretty obvious.

replies(1): >>41875734 #
3. criddell ◴[] No.41875734[source]
Why is it a problem? There's even more material for those services now and for their customers, the value these services can provide is even higher.
replies(1): >>41876317 #
4. ungreased0675 ◴[] No.41876317{3}[source]
Wouldn’t the value be lower if podcasts end up the way product review blogs have? Endless spam that causes people to append “Reddit” to their searches in hopes of finding something human generated.