Most active commenters
  • thomassmith65(5)
  • xnx(5)

←back to thread

990 points smitop | 22 comments | | HN request time: 1.442s | source | bottom
Show context
ranger_danger ◴[] No.44330199[source]
I'm surprised they don't just inject the ads directly into the video stream, I think that would solve their issue overnight (not that I want any ads personally). You could also rate-limit it to the playback speed to prevent pre-downloading the stream easily. But now that everything uses HLS/DASH, it's easy to inject different content right in the middle of the stream without re-encoding anything.
replies(15): >>44330295 #>>44330306 #>>44330327 #>>44330366 #>>44332987 #>>44333096 #>>44333102 #>>44333133 #>>44333320 #>>44333605 #>>44333700 #>>44333858 #>>44334367 #>>44335037 #>>44335453 #
noman-land ◴[] No.44333096[source]
There exists crowdsourced adblocking based on timestamps (SponsorBlock, Tubular). Soon we will have realtime on-device content-aware AI adblocking. They will ever win.
replies(2): >>44333119 #>>44333839 #
1. thomassmith65 ◴[] No.44333119[source]
Once we get content-aware AI adblocking, every video and podcast will turn into a product placement.
replies(3): >>44333293 #>>44333362 #>>44333371 #
2. ekianjo ◴[] No.44333293[source]
They are already doing product placement everywhere..'
replies(1): >>44333425 #
3. xnx ◴[] No.44333362[source]
I use content aware ad blocking to remove inserted and native ads from podcasts. The next level adblocking will be rewriting content that is overly commercial.
replies(3): >>44333369 #>>44333416 #>>44333638 #
4. toomuchtodo ◴[] No.44333369[source]
LLM ad blockers as content processors are next.
5. hsbauauvhabzb ◴[] No.44333371[source]
It’s already a race to the bottom, blocking tech improves and so does marketing. The latter will pump out as much as you’re scientifically proven to accept before switching off.
6. noahjk ◴[] No.44333416[source]
Any info on how you do that?
replies(1): >>44334156 #
7. thomassmith65 ◴[] No.44333425[source]
Few shows are relentless about it.

In the future, everything will be like that vapid chicken wing podcast (the one where they bring on an interesting, talented person and then waste half an hour interviewing her about sriracha)

So Ira Glass will be narrating This American Life while simultaneously reviewing different varieties of Doritos, etc.

...or the producers of The Rest is History will add the Planters Peanut Man as a third host

...or Marques Brownlee will review every product in relation to how well it works with Bose headphones

replies(4): >>44333556 #>>44333614 #>>44333646 #>>44334904 #
8. squigz ◴[] No.44333556{3}[source]
No, the future will not be like that.
replies(1): >>44333669 #
9. sodality2 ◴[] No.44333614{3}[source]
> Few shows are relentless about it.

My favorite relentless one is Tracker (Amazon Prime), who spend approximately 30% of screen time dedicated to showing off a GMC pickup and Airstream, but the most egregious was one dialog line:

> As Colter enters and gives them hugs, Velma remembers that they got a gift for him. Reenie hands him the gift - very conspicuously packaged in an Amazon box with its trademark logo and blue tape - and says, "I've gotta say, next-day delivery is pretty sweet. Thank you, Amazon Prime!"

10. nickthegreek ◴[] No.44333638[source]
got any links to set this up?
replies(1): >>44339609 #
11. nickthegreek ◴[] No.44333646{3}[source]
that is not what they do on hot ones. sean is an intelligent interviewer and their team goes above and beyond to find interesting lore in people’s past to showcase. guests are routinely impressed.
replies(1): >>44333714 #
12. thomassmith65 ◴[] No.44333669{4}[source]
I've seen the future, and it kills 99.99% of germs, bacteria and viruses...

...it powers through tough grease and grime

...with no harsh smells!

The future is Fantastik®.

13. thomassmith65 ◴[] No.44333714{4}[source]
If a person enjoys a show that is also a brand of hot sauce, it's not for me to say they shouldn't. It's just not my thing; I have too many hangups.
14. coppsilgold ◴[] No.44334156{3}[source]
I imagine you can do it by AI-transcribing the podcast while preserving timestamp metadata for each symbol. Use LLM to identify undesirable segments (ask it to output json or something) and then cut them out from the audio with ffmpeg.

Then you would need to set up a server that would do all this and serve as a 'mirror' to your podcasts without the ads.

replies(2): >>44334188 #>>44334221 #
15. noahjk ◴[] No.44334188{4}[source]
I actually found a project which does almost exactly what you've described:

https://github.com/jdrbc/podly_pure_podcasts

16. xnx ◴[] No.44334221{4}[source]
You almost exactly described my process: podcast-dl > whisper > Gemini > ffmpeg > ftp > cheap web host
replies(2): >>44334244 #>>44338226 #
17. thomassmith65 ◴[] No.44334244{5}[source]
If you've gone through that much effort, you might as well turn it into a subscription service. It would be resource intensive, but some people would gladly pay through their nose to rid their podcasts of ads.
replies(1): >>44338314 #
18. noman-land ◴[] No.44334904{3}[source]
Even though I hate advertising I think Hot Ones is one of the few efforts to do a good job with this.

1. Interviewing a guest while they are eating insanely spicy food is an extremely novel idea and the guest's reactions and answers end up being really interesting and unexpected as a result. It humanizes famous people in a way I've never seen before because you can't just bluff your way through it.

2. The hot sauce vendors are often small companies or indie makers (at least they used to be). This is way different than reviewing 11 bags of Superman Transformers 3D Doritos Walmart Product Placement for the next blockbuster.

3. Hot sauce is interesting! Nearly every culture on every continent has hot sauce. They are made from a huge variety of interesting and unusual ingredients but are also simple and can be made at home. Hot sauce hasn't been explored in this way in popular culture.

19. walthamstow ◴[] No.44338226{5}[source]
What's your prompt for Gemini like, does it include examples of ads? Assume you're using Flash for cost?

I also have a setup like this, I transcribe with Whisper and send it to OpenAI 4o-mini to detect ads then clip those segments with pydub, but my prompt must be lacking because the success rate on detecting ads is maybe 60%

replies(1): >>44339606 #
20. xnx ◴[] No.44338314{6}[source]
I'd definitely like to make it easier to use and spread it more widely, but I can't directly distribute the edited (copyrighted) podcast files. Might share transcript markers of the text right before and after ad segments, which is like a slightly more complicated version of what SponsorBlock does.
21. xnx ◴[] No.44339606{6}[source]
My Gemini Flash 2.0 prompt: "Below is the transcript of a podcast preceded by a line number. Reply with the line numbers that are likely to be from advertisements, promotions, commercials, sponsorships, or ending credits."

I think it's better than 60%, but I should definitely set up some evals.

I split the text by sentence, but was considering having the LLM try and put into paragraph (that might conceptually chunk commercial sentences together), but what I've got has been good enough for me.

I wanted to switch to Flash 2.5, but it looks like they increased the price a lot.

I think I could do a fair bit of ad identification just with text heuristics: "This podcast is sponsored/supported by...", etc.

22. xnx ◴[] No.44339609{3}[source]
Not yet. It's an extremely crude collection of scripts and code, but I should still put it out there soon.