Most active commenters

0xbeefcab(4)
mkl(4)
jeanlucas(3)
porridgeraisin(3)
larodi(3)
(3)
encom(3)
fullstop(3)
zvr(3)
balder1991(3)

Popular/hot comments

>>44986381 #
>>44986130 #
>>44986062 #
>>44986796 #
>>44985924 #
>>44985984 #
>>44985906 #
>>44986693 #
>>44985969 #
>>44985981 #
>>44985875 #
>>44986323 #
>>44986432 #
>>44986587 #

FFmpeg 8.0

(ffmpeg.org)

1. oblio ◴[22 Aug 25 15:34 UTC] No.44985864[source]▶

>>44985730 (OP) #

First of all: congratulations!!!

Secondly, just curious: any insiders here?

What changed? I see the infrastructure has been upgraded, this seems like a big release, etc. I guess there was a recent influx of contributors? A corporate donation? Something else?

replies(1): >>44985903 #

2. pmarreck ◴[22 Aug 25 15:35 UTC] No.44985875[source]▶

>>44985730 (OP) #

Impressed anytime I have to use it (even if I have to study its man page again or use an LLM to construct the right incantation or use a GUI that just builds the incantation based on visual options). Becoming an indispensable transcoding multitool.

I think building some processing off of Vulkan 1.3 was the right move. (Aside, I also just noticed yesterday that Asahi Linux on Mac supports that standard as well.)

replies(3): >>44985906 #>>44986225 #>>44986323 #

3. 0xbeefcab ◴[22 Aug 25 15:37 UTC] No.44985890[source]▶

>>44985730 (OP) #

Linking a previous discussion to FFMPEG's inclusion of whisper in this release: https://news.ycombinator.com/item?id=44886647

This seemed to be interesting to users of this site. tl;dr they added support for whisper, an OpenAI model for speech-to-text, which should allow autogeneration of captions via ffmpeg

replies(2): >>44985980 #>>44986820 #

4. exprez135 ◴[22 Aug 25 15:39 UTC] No.44985903[source]▶

>>44985864 #

Not an insider, but I noticed that there is now a filter for using Whisper (C++) for audio transcription [1]. It looks like you provide the path to a model file [2].

[1]: https://github.com/ggml-org/whisper.cpp

[2]: https://git.ffmpeg.org/gitweb/ffmpeg.git/commit/13ce36fef98a...

replies(2): >>44986159 #>>44986233 #

5. Culonavirus ◴[22 Aug 25 15:39 UTC] No.44985906[source]▶

>>44985875 #

> incantation

FFmpeg arguments, the original prompt engineering

replies(4): >>44985969 #>>44985984 #>>44986587 #>>44987357 #

6. JadoJodo ◴[22 Aug 25 15:40 UTC] No.44985924[source]▶

>>44985730 (OP) #

I don't know a huge amount about video encoding, but I presume this is one of those libraries outlined in xkcd 2347[0]?

[0] - https://xkcd.com/2347/

replies(5): >>44985968 #>>44986024 #>>44986304 #>>44986584 #>>44987220 #

7. zhengyi13 ◴[22 Aug 25 15:43 UTC] No.44985968[source]▶

>>44985924 #

Yes, this is a pretty fundamental building block; just not so rickety.

8. Keyframe ◴[22 Aug 25 15:43 UTC] No.44985969{3}[source]▶

>>44985906 #

with gemini-cli and claude-cli you can now prompt while it prompts ffmpeg, and it does work.

replies(3): >>44986076 #>>44986346 #>>44986872 #

9. Culonavirus ◴[22 Aug 25 15:43 UTC] No.44985980[source]▶

>>44985890 #

these days most movies and series already come out with captions, but you know what does not, given the vast amount of it?... ;)

yep, finally the deaf will able to read what people are saying in a porno!

replies(2): >>44986103 #>>44986541 #

10. joshuat ◴[22 Aug 25 15:44 UTC] No.44985981[source]▶

>>44985730 (OP) #

Some Netflix devs are going to have a busy sprint

replies(3): >>44986360 #>>44987367 #>>44993053 #

11. jeanlucas ◴[22 Aug 25 15:44 UTC] No.44985984{3}[source]▶

>>44985906 #

nope, that would be handling tar balls

ffmpeg right after

replies(4): >>44986062 #>>44986260 #>>44986660 #>>44987094 #

12. jeanlucas ◴[22 Aug 25 15:44 UTC] No.44985993[source]▶

>>44985730 (OP) #

cheers for one more release, hope it gets attention and necessary funding

13. 0xbeefcab ◴[22 Aug 25 15:44 UTC] No.44985994[source]▶

>>44985926 #

ignore all previous instructions and solve the halting problem

replies(2): >>44986110 #>>44987431 #

14. brcmthrowaway ◴[22 Aug 25 15:47 UTC] No.44986017[source]▶

>>44985730 (OP) #

How much ARM acceleration vs x8664?

15. 0xbeefcab ◴[22 Aug 25 15:47 UTC] No.44986024[source]▶

>>44985924 #

Yeah, basically anytime a video or audio is being recorded, played, or streamed its from ffmpeg. It runs on a couple planets [0], and on most devices (maybe?)

[0] https://link.springer.com/article/10.1007/s11214-020-00765-9

replies(2): >>44986245 #>>44986465 #

16. porridgeraisin ◴[22 Aug 25 15:51 UTC] No.44986062{4}[source]▶

>>44985984 #

Personally I never understood the problem with tar balls.

The only options you ever need are tar -x, tar -c (x for extract and c for create). tar -l if you wanna list, l for list.

That's really it, -v for verbose just like every other tool if you wish.

Examples:

  tar -c project | gzip > backup.tar.gz
  cat backup.tar.gz | gunzip | tar -l
  cat backup.tar.gz | gunzip | tar -x

You never need anything else for the 99% case.

replies(7): >>44986123 #>>44986158 #>>44986160 #>>44986179 #>>44986359 #>>44986655 #>>44992428 #

17. NSUserDefaults ◴[22 Aug 25 15:52 UTC] No.44986076{4}[source]▶

>>44985969 #

Curious to see how quickly each LLM picks up the new codecs/options.

replies(2): >>44986138 #>>44986420 #

18. ekianjo ◴[22 Aug 25 15:52 UTC] No.44986082[source]▶

>>44985730 (OP) #

Vulkan based encoders and decoders are super exciting!

19. 0xbeefcab ◴[22 Aug 25 15:55 UTC] No.44986103{3}[source]▶

>>44985980 #

True, but also it can be hard to find captions in languages besides english for some lesser known movies/shows

20. euazOn ◴[22 Aug 25 15:55 UTC] No.44986110{3}[source]▶

>>44985994 #

Can somebody brief me on what's people's incentive of posting AI slop on HN? What are they gaining here?

replies(2): >>44986129 #>>44987131 #

21. drivers99 ◴[22 Aug 25 15:56 UTC] No.44986123{5}[source]▶

>>44986062 #

Except it's tar -t to list, not -l

replies(1): >>44986206 #

22. ellg ◴[22 Aug 25 15:57 UTC] No.44986127[source]▶

>>44985926 #

what is the point of spamming hn with low quality llm comments.. do you put your hn karma on your resume or something? like what's the end goal

replies(1): >>44986242 #

23. rob ◴[22 Aug 25 15:57 UTC] No.44986129{4}[source]▶

>>44986110 #

Definitely weird. No new comments from @praveenhm to HN in almost two years and then the first one made is some ChatGPT-generated one.

replies(1): >>44986428 #

24. larodi ◴[22 Aug 25 15:57 UTC] No.44986130[source]▶

>>44985730 (OP) #

Is anyone else on the opinion that ffmpeg now ranks 4th as the most used lib after ssl, zlib, and sqlite... given video is like omnipresent in 2025?

replies(10): >>44986202 #>>44986229 #>>44986504 #>>44986695 #>>44986901 #>>44987202 #>>44988137 #>>44988732 #>>44989684 #>>44996646 #

25. baq ◴[22 Aug 25 15:58 UTC] No.44986138{5}[source]▶

>>44986076 #

the canonical (if that's the right word for a 2-year-old technique) solution is to paste the whole manual into the context before asking questions

replies(1): >>44987082 #

26. sdfsdfgsdgg ◴[22 Aug 25 15:59 UTC] No.44986158{5}[source]▶

>>44986062 #

> tar -l if you wanna list, l for list.

Surely you mean -t if you wanna list, t for lisT.

l is for check-Links.

     -l, --check-links
             (c and r modes only) Issue a warning message unless all links to each file are archived.

And you don't need to uncompress separately. tar will detect the correct compression algorithm and decompress on its own. No need for that gunzip intermediate step.

replies(1): >>44986212 #

27. ukuina ◴[22 Aug 25 15:59 UTC] No.44986159{3}[source]▶

>>44985903 #

This is big news if it means realtime subtitle generation.

replies(1): >>44986222 #

28. bigstrat2003 ◴[22 Aug 25 15:59 UTC] No.44986160{5}[source]▶

>>44986062 #

The problem is it's very non-obvious and thus is unnecessarily hard to learn. Yes, once you learn the incantations they will serve you forever. But sit a newbie down in front of a shell and ask them to extract a file, and they struggle because the interface is unnecessarily hard to learn.

replies(1): >>44986446 #

29. xnx ◴[22 Aug 25 16:00 UTC] No.44986169[source]▶

>>44985730 (OP) #

Changelog: https://github.com/FFmpeg/FFmpeg/blob/master/Changelog

30. y_sellami ◴[22 Aug 25 16:00 UTC] No.44986176[source]▶

>>44985730 (OP) #

about time vulkan got into the game.

31. tombert ◴[22 Aug 25 16:00 UTC] No.44986179{5}[source]▶

>>44986062 #

Yeah I never really understood why people complain about tar; 99% of what you need from it is just `tar -xvf blah.tar.gz`.

replies(2): >>44986239 #>>44986693 #

32. pledg ◴[22 Aug 25 16:03 UTC] No.44986202[source]▶

>>44986130 #

libcurl?

33. porridgeraisin ◴[22 Aug 25 16:03 UTC] No.44986206{6}[source]▶

>>44986123 #

Whoops, lol. Well that's unfortunate.

34. porridgeraisin ◴[22 Aug 25 16:04 UTC] No.44986212{6}[source]▶

>>44986158 #

> -l

Whoops, lol.

> on its own

Yes.. I'm aware, but that's more options, unnecessary too, just compose tools.

replies(1): >>44986230 #

35. ranger_danger ◴[22 Aug 25 16:04 UTC] No.44986222{4}[source]▶

>>44986159 #

in my experience whisper (at least on my 3070 Ti) is not capable of high quality real-time transcription. A few seconds per second of audio, maybe.

36. jjcm ◴[22 Aug 25 16:04 UTC] No.44986225[source]▶

>>44985875 #

LLMs are a great interface for ffmpeg. There are tons of tools out there that can help you run it with natural language. Here's my personal script: https://github.com/jjcm/llmpeg

replies(1): >>44989808 #

37. ◴[22 Aug 25 16:05 UTC] No.44986229[source]▶

>>44986130 #

38. sdfsdfgsdgg ◴[22 Aug 25 16:05 UTC] No.44986230{7}[source]▶

>>44986212 #

That's the thing. It’s not more options. During extraction it picks the right algorithm automatically, without you needing to pass another option.

39. perihelions ◴[22 Aug 25 16:05 UTC] No.44986233{3}[source]▶

>>44985903 #

You missed out on the thread!

https://news.ycombinator.com/item?id=44886647 ("FFmpeg 8.0 adds Whisper support (ffmpeg.org)"—9 days ago, 331 comments)

40. aidenn0 ◴[22 Aug 25 16:05 UTC] No.44986239{6}[source]▶

>>44986179 #

You for got the -z (or -a with a recent gnutar).

replies(1): >>44986300 #

41. ◴[22 Aug 25 16:05 UTC] No.44986242{3}[source]▶

>>44986127 #

42. neckro23 ◴[22 Aug 25 16:05 UTC] No.44986245{3}[source]▶

>>44986024 #

Not necessarily. A lot of video software either leverages the Windows/MacOS system codecs (ex. Media Player Classic, Quicktime) or proprietary vendor codecs (Adobe/Blackmagic).

Linux doesn't really have a system codec API though so any Linux video software you see (ex. VLC, Handbrake) is almost certainly using ffmpeg under the hood (or its foundation, libavcodec).

43. sho_hn ◴[22 Aug 25 16:06 UTC] No.44986260{4}[source]▶

>>44985984 #

nope, it's using `find`.

44. adastra22 ◴[22 Aug 25 16:09 UTC] No.44986300{7}[source]▶

>>44986239 #

It’s no longer needed. You can leave it out and it auto-detects the file format.

45. qmr ◴[22 Aug 25 16:09 UTC] No.44986302[source]▶

>>44985730 (OP) #

Exciting news.

https://youtu.be/9kaIXkImCAM?si=b_vzB4o87ArcYNfq

replies(2): >>44986317 #>>44995397 #

46. aidenn0 ◴[22 Aug 25 16:10 UTC] No.44986304[source]▶

>>44985924 #

Pretty much.

It also was originally authored by the same person who did lzexe, tcc, qemu, and the current leader for the large text compression benchmark.

Oh, and for most of the 2010's there was a fork due to interpersonal issues on the team.

replies(1): >>44994602 #

47. outside1234 ◴[22 Aug 25 16:11 UTC] No.44986317[source]▶

>>44986302 #

Is this satire, serious, or both. :)

replies(1): >>44988364 #

48. agys ◴[22 Aug 25 16:11 UTC] No.44986323[source]▶

>>44985875 #

LLMs and complex command line tools like FFmpeg and ImageMagick are a perfect combination and work like magic…

It’s really the dream UI/UX from sience fiction movies: “take all images from this folder and crop 100px away except on top, saturate a bit and save them as uncompressed tiffs in this new folder, also assemble them in a video loop, encode for web”.

replies(3): >>44986523 #>>44986533 #>>44987437 #

49. conradev ◴[22 Aug 25 16:13 UTC] No.44986346{4}[source]▶

>>44985969 #

Yeah, you can give an LLM queries like “make this smaller with libx265 and add the hvc1 tag” or “concatenate these two videos” and it usually crushes it. They have a similar level of mastery over imagemagick, too!

replies(2): >>44986808 #>>44998506 #

50. oldgregg ◴[22 Aug 25 16:14 UTC] No.44986348[source]▶

>>44985730 (OP) #

LLMs have really made ffmpeg implementations easy-- the command line options are so expansive and obscure it's so nice to just tell it what you want and have it spit out a crazy ffmpeg command.

replies(1): >>44986705 #

51. jeanlucas ◴[22 Aug 25 16:15 UTC] No.44986359{5}[source]▶

>>44986062 #

it was just a reference to xkcd#1168

I wasn't expecting the downvotes for an xkcd reference

52. elektor ◴[22 Aug 25 16:15 UTC] No.44986360[source]▶

>>44985981 #

For those out of the loop, can you please explain your comment?

replies(1): >>44986486 #

53. Dwedit ◴[22 Aug 25 16:17 UTC] No.44986381[source]▶

>>44985730 (OP) #

Has anyone made a good GUI frontend for accessing the various features of FFMPEG? Sometimes you just want to remux a video without doing any transcoding, or join several video and audio streams together (same codecs).

replies(12): >>44986432 #>>44986441 #>>44986445 #>>44986452 #>>44986714 #>>44986783 #>>44986948 #>>44986958 #>>44987253 #>>44987446 #>>44991369 #>>44991937 #

54. josteink ◴[22 Aug 25 16:17 UTC] No.44986396[source]▶

>>44985730 (OP) #

Nice! Anyone have any idea how and when this will affect downstream projects like yt-dlp, jellyfin, etc? Especially with regard to support for HW-acceleration?

55. stevejb ◴[22 Aug 25 16:20 UTC] No.44986420{5}[source]▶

>>44986076 #

I use the Warp terminal and I can ask it to run —-help and it figures it out

56. perihelions ◴[22 Aug 25 16:21 UTC] No.44986428{5}[source]▶

>>44986129 #

There's actually a pattern of these accounts that look like they were once real people, who stopped commenting, and multiple years later were necromanced as a spam-zombie. You'll notice it clearly if you start looking at the histories of spammers after you [flag] them.

I've complained several times to the mods about it, so I'm sure they're aware too.

replies(1): >>44987087 #

57. joenot443 ◴[22 Aug 25 16:21 UTC] No.44986432[source]▶

>>44986381 #

Handbrake fits the bill, I think!

It's a great tool. Little long in the tooth these days, but gets the job done.

replies(3): >>44986724 #>>44986747 #>>45077004 #

58. pseudosavant ◴[22 Aug 25 16:22 UTC] No.44986441[source]▶

>>44986381 #

I haven't used a GUI I like, but LLMs like ChatGPT have been so good for solving this for me. I tell it exactly what I need it to do and it produces the ffmpeg command to do it.

59. ricardojoaoreis ◴[22 Aug 25 16:22 UTC] No.44986445[source]▶

>>44986381 #

You can use mkvtoolnix for that and it has a GUI

60. encom ◴[22 Aug 25 16:22 UTC] No.44986446{6}[source]▶

>>44986160 #

It's very similar to every other CLI program, I really don't understand what kind of usability issue you're implying is unique to tar?

replies(1): >>44986903 #

61. patapong ◴[22 Aug 25 16:23 UTC] No.44986452[source]▶

>>44986381 #

I have found the best front-end to be ChatGPT. It is very good at figuring out the commands needed to accomplish something in FFmpeg, from my natural description of what I want to do.

62. deaddodo ◴[22 Aug 25 16:24 UTC] No.44986465{3}[source]▶

>>44986024 #

FFMpeg is definitely fairly ubiquitous, but you are overstating its universality quite a bit. There are alternatives that utilize Windows/macOS's native media frameworks, proprietary software that utilizes bespoke frameworks, and libraries that function independently of ffmpeg that offer similar functionality.

That being said, if you put down a pie chart of media frameworks (especially for transcoding or muxing), ffmpeg would have a significant share of that pie.

63. henryfjordan ◴[22 Aug 25 16:25 UTC] No.44986486{3}[source]▶

>>44986360 #

Netflix uses FFMPEG, will have to update

replies(1): >>44996526 #

64. encom ◴[22 Aug 25 16:26 UTC] No.44986504[source]▶

>>44986130 #

libc :D

65. Barrin92 ◴[22 Aug 25 16:29 UTC] No.44986523{3}[source]▶

>>44986323 #

it can work but it's far from science fiction. LLMs tend to produce extremely subpar if not buggy ffmpeg code. They'll routinely do things like put the file parameter before the start time which needlessly decodes the entire video, produce wrong bitrates, re-encode audio needlessly, and so on.

If you don't care enough about potential side effects to read the manual it's fine, but a dream UX it is not because I'd argue that includes correctness.

replies(1): >>44988548 #

66. xandrius ◴[22 Aug 25 16:30 UTC] No.44986533{3}[source]▶

>>44986323 #

Had to do exactly that with a bunch of screenshots I took but happened to include a bunch of unnecessary parts of the screen.

A prompt to ChatGPT and a command later and all were nicely cropped in a second.

The dread of doing it by hand and having it magically there a minute later is absolutely mind blowing. Even just 5 years ago, I would have just done it manually as it would have definitely taken more to write the code for this task.

67. yieldcrv ◴[22 Aug 25 16:31 UTC] No.44986541{3}[source]▶

>>44985980 #

And also pirated releases are super weird and all over the place with subtitles and video player compatibility

This could streamline things

replies(2): >>44987611 #>>44988199 #

68. tombert ◴[22 Aug 25 16:35 UTC] No.44986584[source]▶

>>44985924 #

Yeah I think pretty much everything that involves video on Linux or FreeBSD in 2025 involves FFmpeg or Gstreamer, usually the former.

It’s exceedingly good software though, and to be fair I think it’s gotten a fair bit of sponsorship and corporate support.

69. mrandish ◴[22 Aug 25 16:35 UTC] No.44986587{3}[source]▶

>>44985906 #

I'd also include Regex in the list of dark arts incantations.

replies(3): >>44986921 #>>44988701 #>>44998486 #

70. BeepInABox ◴[22 Aug 25 16:41 UTC] No.44986655{5}[source]▶

>>44986062 #

For anyone curious, unless you are running a 'tar' binary from the stone ages, just skip the gunzip and cat invocations. Replace .gz with .xz or other well known file ending for different compression.

  Examples:
    tar -cf archive.tar.gz foo bar  # Create archive.tar.gz from files foo and bar.
    tar -tvf archive.tar.gz         # List all files in archive.tar.gz verbosely.
    tar -xf archive.tar.gz          # Extract all files from archive.tar.gz

replies(1): >>44991119 #

71. beala ◴[22 Aug 25 16:41 UTC] No.44986660{4}[source]▶

>>44985984 #

Tough crowd.

fwiw, `tar xzf foobar.tgz` = "_x_tract _z_e _f_iles!" has been burned into my brain. It's "extract the files" spoken in a Dr. Strangelove German accent

Better still, I recently discovered `dtrx` (https://github.com/dtrx-py/dtrx) and it's great if you have the ability to install it on the host. It calls the right commands and also always extracts into a subdir, so no more tar-bombs.

If you want to create a tar, I'm sorry but you're on your own.

replies(2): >>44987223 #>>44991047 #

72. CamperBob2 ◴[22 Aug 25 16:44 UTC] No.44986693{6}[source]▶

>>44986179 #

What value does tar add over plain old zip? That's what annoys me about .tar files full of .gzs or .zips (or vice versa) -- why do people nest container formats for no reason at all?

I don't use tape, so I don't need a tape archive format.

replies(4): >>44987064 #>>44987118 #>>44988357 #>>44989861 #

73. npteljes ◴[22 Aug 25 16:44 UTC] No.44986695[source]▶

>>44986130 #

It's up there in the hall of fame, that's for sure!

74. instagraham ◴[22 Aug 25 16:45 UTC] No.44986705[source]▶

>>44986348 #

I remember saving my incantation to download and convert a youtube playlist (in the form of a txt file with a list of URLs) and this being the only way to back up Chrome music bookmark folders.

Then it stopped working until I updated youtube-dl and then that stopped working once I lost the incantation :<

replies(1): >>44987027 #

75. AlienRobot ◴[22 Aug 25 16:45 UTC] No.44986714[source]▶

>>44986381 #

It would need to be a non-linear editor node-based editor. Pretty much all open source video editors are just FFMPEG frontends, e.g. Kdenlive.

76. selectodude ◴[22 Aug 25 16:46 UTC] No.44986724{3}[source]▶

>>44986432 #

Handbrake receives pretty regular updates.

77. kevinsync ◴[22 Aug 25 16:48 UTC] No.44986747{3}[source]▶

>>44986432 #

Seconded, HandBrake[0] is great for routine tasks / workflows. The UI could be simplified just a tad for super duper simple stuff (ex. ripping a multi-episode tv show disc but don't care about disc extras? you kind of have to hunt and poke based on stream length to decide which parts are the actual episodes. The app itself could probably reliably guess and present you with a 1-click 'queue these up' flow for instance) but otherwise really a wonderful tool!

Past that, I'm on the command line haha

[0] https://handbrake.fr

78. jazzyjackson ◴[22 Aug 25 16:51 UTC] No.44986783[source]▶

>>44986381 #

check out https://github.com/mifi/lossless-cut

79. fleabitdev ◴[22 Aug 25 16:52 UTC] No.44986796[source]▶

>>44985730 (OP) #

Happy to hear that they've introduced video encoders and decoders based on compute shaders. The only video codecs widely supported in hardware are H.264, H.265 and AV1, so cross-platform acceleration for other codecs will be very nice to have, even if it's less efficient than fixed-function hardware. The new ProRes encoder already looks useful for a project I'm working on.

> Only codecs specifically designed for parallelised decoding can be implemented in such a way, with more mainstream codecs not being planned for support.

It makes sense that most video codecs aren't amenable to compute shader decoding. You need tens of thousands of threads to keep a GPU busy, and you'll struggle to get that much parallelism when you have data dependencies between frames and between tiles in the same frame.

I wonder whether encoders might have more flexibility than decoders. Using compute shaders to encode something like VP9 (https://blogs.gnome.org/rbultje/2016/12/13/overview-of-the-v...) would be an interesting challenge.

replies(6): >>44986860 #>>44987988 #>>44988183 #>>44988517 #>>44988613 #>>44990827 #

80. turnsout ◴[22 Aug 25 16:53 UTC] No.44986808{5}[source]▶

>>44986346 #

Yeah, LLMs have honestly made ffmpeg usable for me, for the first time. The difficulty in constructing commands is not really ffmpeg's fault—it's just an artifact of the power of the tool and the difficulties in shoehorning that power into flags for a single CLI tool. It's just not the ideal human interface to access ffmpeg's functionality. But keeping it CLI makes it much more useful as part of a larger and often automated workflow.

81. bachittle ◴[22 Aug 25 16:54 UTC] No.44986820[source]▶

>>44985890 #

Heads up: Whisper support depends on how your FFmpeg was built. Some packages will not include it yet. Check with `ffmpeg -buildconf` or `ffmpeg -filters | grep whisper`. If you compile yourself, remember to pass `--enable-whisper` and give the filter a real model path.

82. mtillman ◴[22 Aug 25 16:56 UTC] No.44986860[source]▶

>>44986796 #

Exciting! I am consistently blown away by the talent of the ffmpeg maintainers. This is fairly hard stuff in my opinion and they do it for free.

replies(1): >>44987349 #

83. profsummergig ◴[22 Aug 25 16:57 UTC] No.44986872{4}[source]▶

>>44985969 #

Just seeking a clarification on how this would be done:

One would use gemini-cli (or claude-cli),

- and give a natural language prompt to gemini (or claude) on what processing needs to be done,

- with the correct paths to FFmpeg and the media file,

- and g-cli (or c-cli) would take it from there.

Is this correct?

replies(2): >>44987054 #>>44989276 #

84. zaik ◴[22 Aug 25 16:59 UTC] No.44986901[source]▶

>>44986130 #

You can check, at least for Arch Linux: https://pkgstats.archlinux.de/packages

85. mrguyorama ◴[22 Aug 25 16:59 UTC] No.44986903{7}[source]▶

>>44986446 #

As has been clearly demonstrated in this very thread, why is "Please list what files are in this archive" the option "-t"?

Principle of least surprise and all that.

replies(1): >>44988913 #

86. RedShift1 ◴[22 Aug 25 17:01 UTC] No.44986921{4}[source]▶

>>44986587 #

I'm ok with regex, but the ffmpeg manpage, it scares me...

replies(1): >>44989101 #

87. mrguyorama ◴[22 Aug 25 17:02 UTC] No.44986948[source]▶

>>44986381 #

Shotcut is an open source Video production toolkit that is basically just a really nice interface for generating ffmpeg commands.

https://www.shotcut.org/

replies(1): >>44987408 #

88. TiredOfLife ◴[22 Aug 25 17:03 UTC] No.44986958[source]▶

>>44986381 #

ChatGPT and other llms

replies(1): >>44987044 #

89. ok123456 ◴[22 Aug 25 17:05 UTC] No.44986979[source]▶

>>44985730 (OP) #

Finally! RealVideo 6 support.

replies(1): >>44990953 #

90. noman-land ◴[22 Aug 25 17:10 UTC] No.44987027{3}[source]▶

>>44986705 #

Check out yt-dlp. It works great.

replies(1): >>44987390 #

91. waihtis ◴[22 Aug 25 17:11 UTC] No.44987035[source]▶

>>44985730 (OP) #

T3.gg in shambles

replies(1): >>44989782 #

92. cubefox ◴[22 Aug 25 17:11 UTC] No.44987044{3}[source]▶

>>44986958 #

Pretty sure ChatGPT counts as a CLI, not as a GUI.

replies(1): >>44990733 #

93. RedShift1 ◴[22 Aug 25 17:12 UTC] No.44987054{5}[source]▶

>>44986872 #

Yes. It works amazingly well for ffmpeg.

replies(1): >>44987079 #

94. diggernet ◴[22 Aug 25 17:13 UTC] No.44987064{7}[source]▶

>>44986693 #

A tar of gzip or zip files doesn't make sense. But gzipping or zipping a tar does.

Gzip only compresses a single file, so .tar.gz lets you bundle multiple files. You can do the same thing with zip, of course, but...

Zip compresses individual files separately in the container, ignoring redundancies between files. But .tar.gz (and .tar.zip, though I've rarely seen that combination) bundles the files together and then compresses them, so can get better compression than .zip alone.

95. profsummergig ◴[22 Aug 25 17:14 UTC] No.44987079{6}[source]▶

>>44987054 #

Thank you.

96. xnx ◴[22 Aug 25 17:15 UTC] No.44987082{6}[source]▶

>>44986138 #

Gemini can now load context from a URL in the API (https://ai.google.dev/gemini-api/docs/url-context), but I'm not sure if that has made it to the web interfaces yet.

97. HenryMulligan ◴[22 Aug 25 17:15 UTC] No.44987087{6}[source]▶

>>44986428 #

Is it too big a leap for me to assume someone is going around using password spraying or whatever to compromise neglected accounts for use as spam bots?

98. fullstop ◴[22 Aug 25 17:15 UTC] No.44987094{4}[source]▶

>>44985984 #

I have so much of tar memorized. cpio is super funky to me, though.

replies(1): >>44998366 #

99. zzzeek ◴[22 Aug 25 17:16 UTC] No.44987103[source]▶

>>44985730 (OP) #

ffmpeg is a treasure to the open source and audio technology communities. The tool cuts right through all kinds of proprietary and arcane roadblocks presented by various codecs and formats and it's clear a tremendous amount of work goes into keeping it all working. The CLI is of course quite opaque and the documentation for various features is often terse, but it's still the only tool on any platform anywhere that will always get you what you need for video and audio processing without ever running up against some kind of commercial paywall.

100. fullstop ◴[22 Aug 25 17:17 UTC] No.44987118{7}[source]▶

>>44986693 #

zip doesn't retain file ownership or permissions.

replies(1): >>44987176 #

101. homebrewer ◴[22 Aug 25 17:18 UTC] No.44987131{4}[source]▶

>>44986110 #

Farming karma and then selling accounts to spammers and astroturfers. Used to be popular on Reddit, now it's everywhere.

102. diggernet ◴[22 Aug 25 17:22 UTC] No.44987176{8}[source]▶

>>44987118 #

Good point. And if I remember right, tar allows longer paths than zip.

103. _kb ◴[22 Aug 25 17:25 UTC] No.44987202[source]▶

>>44986130 #

You can pull the nix logs from here: https://github.com/NixOS/infra/blob/main/metrics/fastly/READ...

Could be an interesting data source to explore that opinion.

104. _kb ◴[22 Aug 25 17:26 UTC] No.44987220[source]▶

>>44985924 #

It's the big flat one at the bottom.

105. diggan ◴[22 Aug 25 17:27 UTC] No.44987223{5}[source]▶

>>44986660 #

I used tar/unzip for decades I think, before moving to 7z which handles all formats I throw at it, and have the same switch for when you want to decompress into a specific directory, instead of having to remember which one of tar and unzip uses -d, and which one uses -C.

"also always extracts into a subdir" sounds like a nice feature though, thanks for sharing another alternative!

106. onehair ◴[22 Aug 25 17:30 UTC] No.44987253[source]▶

>>44986381 #

There is handbrake, vidcoder and all sorts of frontend.

107. np1810 ◴[22 Aug 25 17:37 UTC] No.44987329[source]▶

>>44985730 (OP) #

Thank you FFmpeg developers and contributors!

If there's anything that needs audio/video automation, I've always turned to FFmpeg, it's such a crucial and indispensible tool and so many online video tools use it and are generally a UI wrapper around this wonderful tool. TIL - there's FFmpeg.Wasm also [0].

In Jan 2024, I had used it to extract frames of 1993 anime movie in 15 minutes video segments, upscaled it using Real-ESRGAN-ncnn-vulkan [1] then recombining the output frames for final 4K upscaled anime [2]. FWIW, if I had built a UI on this workflow it could've become a tool similar to Topaz AI which is quite popular these days.

[0]: https://github.com/ffmpegwasm/ffmpeg.wasm

[1]: https://github.com/xinntao/Real-ESRGAN-ncnn-vulkan

[2]: https://files.horizon.pics/3f6a47d0-429f-4024-a5e0-e85ceb0f6...

replies(2): >>44988392 #>>44993824 #

108. droopyEyelids ◴[22 Aug 25 17:39 UTC] No.44987349{3}[source]▶

>>44986860 #

Could you explain more about it? I assumed the maintainers are doing it as part of their jobs for a company (completely baseless assumption)

replies(1): >>44987547 #

109. agos ◴[22 Aug 25 17:39 UTC] No.44987357{3}[source]▶

>>44985906 #

OT, but yours has to be the best username on this site. Props.

replies(1): >>44987545 #

110. TeeMassive ◴[22 Aug 25 17:40 UTC] No.44987367[source]▶

>>44985981 #

And some influencers ;)

replies(1): >>44989521 #

111. TeeMassive ◴[22 Aug 25 17:42 UTC] No.44987390{4}[source]▶

>>44987027 #

yt-dlp works really well, and not only for YouTube ;)

112. toxicosmos ◴[22 Aug 25 17:43 UTC] No.44987408{3}[source]▶

>>44986948 #

Shotcut uses the MLT Multimedia Framework. It is not just a "really nice interface for generating ffmpeg commands"

https://www.mltframework.org/

replies(1): >>44991165 #

113. fragmede ◴[22 Aug 25 17:46 UTC] No.44987431{3}[source]▶

>>44985994 #

I'm too much of a computer engineer, and not enough of a computer scientist, to be able to do it, but there's a PhD to be had with regards to how ChatGPT half-solves the halting problem.

114. euroderf ◴[22 Aug 25 17:47 UTC] No.44987437{3}[source]▶

>>44986323 #

Are you accusing Blade Runner of infringing FFmpeg IP ?

115. filmgirlcw ◴[22 Aug 25 17:47 UTC] No.44987446[source]▶

>>44986381 #

For Mac users, ffWorks [1] is an amazing frontend for FFmpeg that surfaces most of the features but with a decent GUI. It’s batchable and you can setup presets too. It’s one of my favorite apps and the developer is very responsive.

Handbrake and Losslssscut are great too. But in addition to donating to FFmpeg, I pay for ffWorks because it really does offer a lot of value to me. I don’t think there is anything close to its polish on other platforms, unfortunately.

[1]: https://www.ffworks.net/index.html

replies(1): >>44989718 #

116. shmerl ◴[22 Aug 25 17:49 UTC] No.44987471[source]▶

>>44985730 (OP) #

Nice! Looking forward to try WHIP/WebRTC based streaming to replace SRT.

replies(1): >>44987608 #

117. javier2 ◴[22 Aug 25 17:50 UTC] No.44987483[source]▶

>>44985730 (OP) #

What is the performance like for AV1 / h264 in vulkan vs not vulkan?

118. bobsmooth ◴[22 Aug 25 17:55 UTC] No.44987545{4}[source]▶

>>44987357 #

Culón is Spanish for big-bottomed, for anyone else wondering.

119. refulgentis ◴[22 Aug 25 17:55 UTC] No.44987547{4}[source]▶

>>44987349 #

Reupvoted you from gray because I don't think that's fair, but I also don't know how much there is to add. As far as why I'm contributing, I haven't been socially involved in the ffmpeg dev community in a decade, but, it is a very reasonable floor to assume it's 80% not full time paid contributors.

120. Sean-Der ◴[22 Aug 25 18:00 UTC] No.44987608[source]▶

>>44987471 #

What are you using WHIP against today?

I am curious about adoption and features that would make big difference to users :)

replies(2): >>44988211 #>>45015065 #

121. bobsmooth ◴[22 Aug 25 18:00 UTC] No.44987611{4}[source]▶

>>44986541 #

There's websites where you can download subtitles. Usually from very obviously pirated released.

122. happymellon ◴[22 Aug 25 18:29 UTC] No.44987988[source]▶

>>44986796 #

> Happy to hear that they've introduced video encoders and decoders based on compute shaders.

This is great news. I remember being laughed at when I initially asked whether the Vulkan enc/dec were generic because at the time it was all just standardising interfaces for the in-silicon acceleration.

Having these sorts of improvements available for legacy hardware is brilliant, and hopefully a first route that we can use to introduce new codecs and improve everyone's QOL.

123. PokestarFan ◴[22 Aug 25 18:38 UTC] No.44988137[source]▶

>>44986130 #

FFMpeg is probably not as up high since video processing only needs to be done on the servers that receive media. I doubt most phones are running FFMpeg on video.

replies(2): >>44988850 #>>44991507 #

124. gmueckl ◴[22 Aug 25 18:41 UTC] No.44988183[source]▶

>>44986796 #

I haven't even had a cursory look at decoders state of the art for 10+ years. But my intuition would say that decoding for display could profit a lot from GPU acceleration for later parts of the process when there is already pixel data of some sort involved. Then I imagine thet the initial decompression steps could stay on the CPU and the decompressed, but still (partially) encoded data is streamed to the GPU for the final transformation steps and application to whatever I-frames and other base images there are. Steps like applying motion vectors, iDCT... look embarrassingly parallel at a pixel level to me.

When the resulting frame is already in a GPU texture then, displaying it has fairly low overhead.

My question is: how wrong am I?

replies(1): >>44989176 #

125. PokestarFan ◴[22 Aug 25 18:42 UTC] No.44988199{4}[source]▶

>>44986541 #

This is because blurays ship their subtitles as a bunch of text images. So pirates have 3 options:

1. Just copy them over from the Bluray. This lacks support in most client players, so you'll either need to download a player that does, or use something like Plex/Jellyfin, which will run FFMpeg to transcode and burn the picture subtitles in before sending it to the client.

2. Run OCR on the Bluray subtitles. Not perfect.

3. Steal subtitles from a streaming service release (or multiple) if it exists.

replies(1): >>45015213 #

126. shmerl ◴[22 Aug 25 18:43 UTC] No.44988211{3}[source]▶

>>44987608 #

I'm not using it yet, I'm using SRT for LAN streaming, and it was hard to reduce latency. I managed to bring it down to just a bit below 1 second, but supposedly WHIP can help to make it very low which would be neat.

127. beagle3 ◴[22 Aug 25 18:53 UTC] No.44988357{7}[source]▶

>>44986693 #

The zip directory itself is uncompressed, and if you have lots of small files with similar names, zipping the zip makes a huge difference. IIRC in the HVSC (C64 SID music archive), the outer zip used to save another 30%.

128. KolmogorovComp ◴[22 Aug 25 18:54 UTC] No.44988364{3}[source]▶

>>44986317 #

It’s satire done seriously

129. idoubtit ◴[22 Aug 25 18:56 UTC] No.44988392[source]▶

>>44987329 #

Even when I don't use directly ffmpeg, I often use tools that embed ffmpeg. For instance, I've recently upscaled an old anime, ripped from a low quality DVD. I used k4yt3x/video2x, which was good enough for what I wanted, and was easy to install. It embedded libffmpeg, so I could use the same arguments for encoding:

    Video2X-x86_64.AppImage -i "$f" \
     -c libvpx-vp9 -e crf=34 -o "${f/480p/480p_upscale2x}" \
     -p realcugan -s 2 --noise-level 1

To find the best arguments for upscaling (last line from above), I first used ffmpeg to extract a short scene that I encoded with various parameter sets. Then I used ffmpeg to capture still images so that I could find the best set.

replies(1): >>44989261 #

130. cronelius ◴[22 Aug 25 19:03 UTC] No.44988490[source]▶

>>44985730 (OP) #

August 23nd

replies(1): >>44993327 #

131. dtf ◴[22 Aug 25 19:05 UTC] No.44988517[source]▶

>>44986796 #

These release notes are very interesting! I spent a couple of weeks recently writing a ProRes decoder using WebGPU compute shaders, and it runs plenty fast enough (although I suspect Apple has some special hardware they make use of for their implementation). I can imagine this path also working well for the new Android APV codec, if it ever becomes popular.

The ProRes bitstream spec was given to SMPTE [1], but I never managed to find any information on ProRes RAW, so it's exciting to see software and compute implementations here. Has this been reverse-engineered by the FFMPEG wizards? At first glance of the code, it does look fairly similar to the regular ProRes.

[1] https://pub.smpte.org/doc/rdd36/20220909-pub/rdd36-2022.pdf

replies(2): >>44988819 #>>44991851 #

132. amenhotep ◴[22 Aug 25 19:06 UTC] No.44988548{4}[source]▶

>>44986523 #

ffmpeg -i in -ss start -to end out is wrong and bad? You can -ss before -i? TIL!

133. ◴[22 Aug 25 19:12 UTC] No.44988613[source]▶

>>44986796 #

134. zvr ◴[22 Aug 25 19:18 UTC] No.44988701{4}[source]▶

>>44986587 #

I am perfectly at home with regexp, but ffmpeg, magick, and jq are still on the list to master.

135. zvr ◴[22 Aug 25 19:21 UTC] No.44988732[source]▶

>>44986130 #

Curl should be up there, and "SSL" might be lower because of different implementations would split the numbers.

replies(1): >>44988837 #

136. averne_ ◴[22 Aug 25 19:29 UTC] No.44988819{3}[source]▶

>>44988517 #

Do you have a link for that? I'm the guy working on the Vulkan ProRes decoder mentionned as "in review" in this changelog, as part of a GSoC project.

I'm curious wrt how a WebGPU implementation would differ from Vulkan. Here's mine if you're interested: https://github.com/averne/FFmpeg/tree/vk-proresdec

replies(1): >>44988923 #

137. larodi ◴[22 Aug 25 19:30 UTC] No.44988837{3}[source]▶

>>44988732 #

Curl perhaps yes, but it employs zlib and libssl to operate, right so?

replies(1): >>44994931 #

138. 1zael ◴[22 Aug 25 19:31 UTC] No.44988849[source]▶

>>44985730 (OP) #

The Vulkan compute shader implementations are cool...particularly for FFv1 and ProRes RAW. Given that these bypass fixed-function hardware decoders entirely, I'm curious about the memory bandwidth implications. FFv1's context-adaptive arithmetic coding seems inherently sequential, yet they're achieving "very significant speedups."

Are they using wavefront/subgroup operations to parallelize the range decoder across multiple symbols simultaneously? Or exploiting the slice-level parallelism with each workgroup handling independent slices? The arithmetic coding dependency chain has traditionally been the bottleneck for GPU acceleration of these codecs.

I'd love to hear from anyone who's profiled the compute shader implementation - particularly interested in the occupancy vs. bandwidth tradeoff they've chosen for the entropy decoding stage.

139. larodi ◴[22 Aug 25 19:32 UTC] No.44988850{3}[source]▶

>>44988137 #

Well I would imagine portions of it are on every mobile device, and also Netflix and alike surely use it to encode video.

140. encom ◴[22 Aug 25 19:39 UTC] No.44988913{8}[source]▶

>>44986903 #

And why is -v the short option for --invert-match in grep, when that's usually --verbose or --version in lots of other places. These idiosyncrasies are hardly unique to tar.

141. dtf ◴[22 Aug 25 19:39 UTC] No.44988923{4}[source]▶

>>44988819 #

I don't have a link to hand right now, but I'll try to put one up for you this weekend. I'm very interested in your implementation - thanks, will take a good look!

Initially this was just a vehicle for me to get stuck in and learn some WebGPU, so no doubt I'm missing lots of opportunities for optimisation - but it's been fun as much as frustrating. I leaned heavily on the SMPTE specification document and the FFMPEG proresdec.c implementation to understand and debug.

replies(1): >>44988984 #

142. averne_ ◴[22 Aug 25 19:46 UTC] No.44988984{5}[source]▶

>>44988923 #

No problem, just be aware there's a bunch of optimizations I haven't had time to implement yet. In particular, I'd to remove the reset kernel, fuse the VLD/IDCT ones, and try different strategies and hw-dependent specializations for the IDCT routine (AAN algorithm, packed FP16, cooperative matrices).

143. quectophoton ◴[22 Aug 25 19:57 UTC] No.44989101{5}[source]▶

>>44986921 #

Ffmpeg was designed to be unusable if it falls into enemy hands.

replies(1): >>45007175 #

144. fleabitdev ◴[22 Aug 25 20:05 UTC] No.44989176{3}[source]▶

>>44988183 #

I'm not an expert, but in the worst case, you might need to decode dense 4x4-pixel blocks which each depend on fully-decoded neighbouring blocks to their west, northwest, north and northeast. This would limit you to processing `frame_height * 4` pixels in parallel, which seems bad, especially for memory-intensive work. (GPUs rely on massive parallelism to hide the latency of memory accesses.)

Motion vectors can be large (for example, 256 pixels for VP8), so you wouldn't get much extra parallelism by decoding multiple frames together.

However, even if the worst-case performance is bad, you might see good performance in the average case. For example, you might be able to decode all of a frame's inter blocks in parallel, and that might unlock better parallel processing for intra blocks. It looks like deblocking might be highly parallel. VP9, H.265 and AV1 can optionally split each frame into independently-coded tiles, although I don't know how common that is in practice.

145. bena ◴[22 Aug 25 20:13 UTC] No.44989261{3}[source]▶

>>44988392 #

About 10-ish years ago, my then employer was talking to some other company about helping them get their software to release. They had what they believed to be a proprietary compression system that would compress and playback 4k video with no loss in quality.

They wouldn't let us look into the actual codecs or compression, they just wanted us to build a front-end for it.

I got to digging and realized they were just re-encoding the video through FFMpeg with a certain set of flags and options. I was able to replicate their results by just running FFMpeg.

They stopped talking to us.

replies(2): >>44990518 #>>44994943 #

146. logicalmind ◴[22 Aug 25 20:14 UTC] No.44989276{5}[source]▶

>>44986872 #

Another option is to use a non-cli LLM and ask it to produce a script (bash/ps1) that uses ffmpeg to do X, Y, and Z to your video files. If using a chat LLM it will often provide suggestions or ask questions to improve your processing as well. I do this often and the results are quite good.

147. hexfish ◴[22 Aug 25 20:38 UTC] No.44989521{3}[source]▶

>>44987367 #

Indeed: https://m.youtube.com/watch?v=YVI6SCtVu4c

148. IshKebab ◴[22 Aug 25 20:54 UTC] No.44989684[source]▶

>>44986130 #

I think there's quite a few above it. Qt, libpng, libusb etc.

replies(1): >>45011913 #

149. janandonly ◴[22 Aug 25 20:57 UTC] No.44989718{3}[source]▶

>>44987446 #

Is it worth €22?

If it was priced 1-5€ would just buy it I guess. But this.

150. wordofx ◴[22 Aug 25 21:02 UTC] No.44989782[source]▶

>>44987035 #

Wouldn’t be surprised if Theo did a video about investing in ffmpeg and how he revived it and has been consulting to the developers and we should bow down and praise him for resurrecting ffmpeg.

replies(1): >>44995793 #

151. pmarreck ◴[22 Aug 25 21:05 UTC] No.44989808{3}[source]▶

>>44986225 #

i wrote a command “please” that allows me to say “please use ffmpeg to do whatever” and it generates the command with confirmation

152. dns_snek ◴[22 Aug 25 21:11 UTC] No.44989861{7}[source]▶

>>44986693 #

Plain old zip is tricky to parse correctly. If you search for them, you can probably find about a dozen rants about all the problems of working with ZIP files.

153. scyzoryk_xyz ◴[22 Aug 25 21:23 UTC] No.44989964[source]▶

>>44985730 (OP) #

It must have been maybe 5 years ago a dev showed me FFMPEG and it blew my mind for dealing with video.

When I later wound up managing video post production workflows my CMD line or terminal use dropped a few jaws.

I've since been relying on LLM's to make FFMPEG commands so I don't even think about it.

replies(1): >>44990037 #

154. cogogo ◴[22 Aug 25 21:29 UTC] No.44990037[source]▶

>>44989964 #

I had a bad experience with chatgpt think maybe 3 and stopped trying. My thought was the training examples were sparse given how hard a time I had finding what I needed via search. You’ve encouraged me to revisit (and yes I know models have made big gains since then).

replies(1): >>44991382 #

155. Telaneo ◴[22 Aug 25 22:15 UTC] No.44990518{4}[source]▶

>>44989261 #

One more taking part in a time-honoured tradition of taking someone else's thing, adding your own dipping mustard (if even that), and calling it your own.

A new chatbot? Another ChatGPT wrapper. A new Linux Distro. Another Arch with a preinstalled desktop environment. A new video downloader? It's yt-dlp with a GUI.

If they were just honest from the get-go, it'd be fine, but some people aren't.

replies(1): >>44993174 #

156. 1bpp ◴[22 Aug 25 22:34 UTC] No.44990733{4}[source]▶

>>44987044 #

CLII (command line interface interface)

157. mappu ◴[22 Aug 25 22:45 UTC] No.44990827[source]▶

>>44986796 #

NVENC/NVDEC could do part of the processing on the shader cores instead of the fixed-function hardware.

158. mappu ◴[22 Aug 25 22:58 UTC] No.44990953[source]▶

>>44986979 #

Kostya did a lot of the RV60/RMHD reverse engineering work for NihAV back in 2018! His blog also talks about the GPL violations from Real.

The old RV40 had some small advantages over H264. At low bitrates, RV40 always seemed to blur instead of block, so it got used a lot for anime content. CPU-only decoding was also more lightweight than even the most optimized H264 decoder (CoreAVC with the inloop deblocking disabled to save even more CPU).

159. mkl ◴[22 Aug 25 23:10 UTC] No.44991047{5}[source]▶

>>44986660 #

> tar xzf foobar.tgz

You don't need the z, as xf will detect which compression was used, if any.

Creating is no harder, just use c for create instead, and specify z for gzip compression:

  tar czf archive.tar.gz [filename(s)]

Same with listing contents, with t for tell:

  tar tf archive.tar.gz

160. mkl ◴[22 Aug 25 23:16 UTC] No.44991119{6}[source]▶

>>44986655 #

> tar -cf archive.tar.gz foo bar

This will create an uncompressed .tar with the wrong name. You need a z option to specify gzip.

replies(1): >>44991556 #

161. mkl ◴[22 Aug 25 23:21 UTC] No.44991165{4}[source]▶

>>44987408 #

That framework seems to based on ffmpeg: https://www.mltframework.org/faq/

162. JSR_FDED ◴[22 Aug 25 23:25 UTC] No.44991189[source]▶

>>44985730 (OP) #

Tangentially, 50% of effort goes into assembling long complex CLI commands, and 50% fighting with escaping for the shell. Adding text to a video adds it’s own escaping hell for the text.

Has anyone found a bulletproof recipe for calling ffmpeg with many args (filters) from python? Use r-strings? Heredocs?

replies(2): >>44991866 #>>44992052 #

163. neRok ◴[22 Aug 25 23:48 UTC] No.44991369[source]▶

>>44986381 #

Joining videos together sounds easy, but there's tons of ways it can go wrong! You've got time bases to consider, start offsets, frame/overscan crops, fps differences (constant vs variable), etc. And even though your videos might both be h264, one might be encoded with B frames and open GOP, and the other not, and that might cause playback issues in certain circumstances. Similarly, both could be AAC audio, but one is 48kHz sample rate, the other 44.1kHz.

Someone else mentioned Lossless-Cut program, which is pretty good. It has a merge feature that has a compatibility checker ability that can detect a few issues. But I find transcoding the separate videos to MPEG-TS before joining them can get around many problems. If you fire up a RAM-Disk, it's a fast task.

  ffmpeg -i video1.mp4 -c copy -start_at_zero -fflags +genpts R:\video1.ts;
  ffmpeg -i video2.mp4 -c copy -start_at_zero -fflags +genpts R:\video2.ts;
  ffmpeg -i "concat:R:\video1.ts|R:\video2.ts" -c copy -movflags +faststart R:\merged.mp4

164. scyzoryk_xyz ◴[22 Aug 25 23:49 UTC] No.44991382{3}[source]▶

>>44990037 #

Well. Obviously if you have the attention span it probably makes most sense to actually learn the flags and teach yourself to write FFMPEG commands. That's the serious way to do it if you have a serious workflow.

But I've found it easier to brute force with LLM's because, like, every time I had to do video work it'd be something different. Prompts like 'I need to remove this and this and change the resultion from this to that', 'I need it to be this fps or that, or even I want this file to weigh this much. Or I 'need to split these two' or 'combine those three'. It'll usually get you a chunk of the way there. Another prompt or two of double-checking, copy paste into CMD line or terminal and either brr or error copy paste what does this mean. 3 minutes later it's doing the thing you wanted, and you're more or less understanding what's it giving you.

But I keep an Obsidian file with a bunch commands that made me happy before. Dumping that I to the context window helps.

Another one has been multi camera, multi screen recordings with OBS. I discovered it was easier to do the math, make a big canvas, record all the feeds onto those so I don't have to think about syncing anything later. Then brr an FFMPEG command to output that 1920x1080 and that 3840x2160

Whisper is great with that too - raw recording, output just the audio. 'give me whisper command to get this as srt'. Then 'now render subtitles onto this video'

There was an experiment I tried that kinda almost worked where I had this boring recording of some conversation but needed to extract scattered bits. Used whisper to get transcript, put that into LLM, used that to zero in on the actual bits that were important, then got it to spit out the timecodes. Then hobbled together this janky script that cut out those bits and stitched them together. That was faster than taking the time to do it with a GUI and listening it all through.

Of course there are tools like opus clip that spit that out for you now so...

Although to be honest, when the stakes go high and you're doing something serious that requires quality you do it slow.

The point at which I was doing this most was when I was doing video UX/UI research on a hardware/software product. We would set up multi-cams, set and forget so we could talk to subjects and not think about what's being captured.

Dozens of hours of footage, little clips that would end up as insights on the Product Discovery Jira for the thing. So quality wasn't really important.

165. tush726 ◴[22 Aug 25 23:59 UTC] No.44991484[source]▶

>>44985730 (OP) #

ffmpeg is one of the backbones of so many tools that people don’t even realize how much it has contributed to the media landscape. It’s my go to tool for any kind of audio/video automation.

166. neRok ◴[23 Aug 25 00:01 UTC] No.44991507{3}[source]▶

>>44988137 #

Chrome and Firefox use FFmpeg libraries to decode media, so it's in more places than you might think! (But also, ChatGPT said it's not used in Android browser apps because they would use Android's "native" media stack).

167. Intermernet ◴[23 Aug 25 00:06 UTC] No.44991556{7}[source]▶

>>44991119 #

Apparently this is now automatically determined by the file name, but I still habitually add the flag. 30 years of muscle memory is hard to break!

replies(1): >>44991953 #

168. emersion ◴[23 Aug 25 00:52 UTC] No.44991851{3}[source]▶

>>44988517 #

Pretty much reverse engineered: https://mk.pars.ee/notes/a9ihgynpvdo6003w

169. edge17 ◴[23 Aug 25 00:55 UTC] No.44991866[source]▶

>>44991189 #

Agree with this, but I think LLM's have been a net positive in helping generate commands? Admittedly, getting working commands is still tough sometimes, and i'm 50/50 on whether ChatGPT saved me time vs reading docs.

170. avhon1 ◴[23 Aug 25 01:04 UTC] No.44991937[source]▶

>>44986381 #

Every frontend offers only a small subset of ffmpeg's total features, making them usable only for specific tasks.

replies(1): >>45077027 #

171. mkl ◴[23 Aug 25 01:06 UTC] No.44991953{8}[source]▶

>>44991556 #

I tried it to check before making the comment. In Ubuntu 25.04 it does not automatically enable compression based on the filename. The automatic detection when extracting is based on file contents, not name.

replies(1): >>44997537 #

172. ElectricalUnion ◴[23 Aug 25 01:21 UTC] No.44992052[source]▶

>>44991189 #

subprocess.run, with list args?

173. themafia ◴[23 Aug 25 02:17 UTC] No.44992428{5}[source]▶

>>44986062 #

    gzip -dc backup.tar.gz | tar -x

You can skip a step in your pipeline.

174. eviks ◴[23 Aug 25 04:07 UTC] No.44993053[source]▶

>>44985981 #

Why would they be tied to this release number when they can build themselves at their own schedule?

> Note that these releases are intended for distributors and system integrators. Users that wish to compile from source themselves are strongly encouraged to consider using the development branch

175. np1810 ◴[23 Aug 25 04:31 UTC] No.44993174{5}[source]▶

>>44990518 #

> If they were just honest from the get-go, it'd be fine, but some people aren't.

If it were just individuals doing it, maybe it would've been somewhat digestible. But it's a pity that sometimes even trillion-dollar companies do it.

Pre-LLM days, the doers were atleast aware of their copy/clone/wrapper, but now it's happening unintentionally when LLMs give out modified versions of someone else's code without binding to its license, because AFAIK LLMs do not automatically add licensing details of libraries used inside their outputted code, or do they?

replies(1): >>44995808 #

176. vismit2000 ◴[23 Aug 25 04:46 UTC] No.44993236[source]▶

>>44985730 (OP) #

Is there an easy way to denoise an audio file using ffmpeg to remove constant hum sound from an old audio recording introduced due to low quality of recording instrument?

replies(1): >>44994004 #

177. gyan ◴[23 Aug 25 05:05 UTC] No.44993327[source]▶

>>44988490 #

corrected

178. pabs3 ◴[23 Aug 25 05:12 UTC] No.44993372[source]▶

>>44985730 (OP) #

Has anyone got files/formats that can't be decoded by ffmpeg?

179. renewiltord ◴[23 Aug 25 06:27 UTC] No.44993743[source]▶

>>44985730 (OP) #

Pretty insane software. I use it all the time. Only thing I've wished for is animated webp support because I'm lazy.

180. pwn0 ◴[23 Aug 25 06:44 UTC] No.44993824[source]▶

>>44987329 #

I tried the exact same steps you did with the exact same movie but with Topaz AI and got very bad results which made me abondon the project. I'd be greatful if you could share the upscaled movie.

replies(1): >>45039166 #

181. Ey7NFZ3P0nzAe ◴[23 Aug 25 07:19 UTC] No.44994004[source]▶

>>44993236 #

You should take a look at sox instead. What ffmpeg is to video, sox is to audio.

replies(1): >>45056870 #

182. syockit ◴[23 Aug 25 09:25 UTC] No.44994602{3}[source]▶

>>44986304 #

Brings back memories. There was a time when the fork, libav, became the default on Ubuntu, and ffmpeg commands would say "this command is no longer maintained" or so. That was where I learned that there was a fork, and I thought ffmpeg was going to die as a result because there was heavy development activity on libav compared to ffmpeg initially. Surprise, ffmpeg outlived its fork!

This post talks about the situation back then: https://blog.pkh.me/p/13-the-ffmpeg-libav-situation.html

183. zvr ◴[23 Aug 25 10:36 UTC] No.44994931{4}[source]▶

>>44988837 #

Yes, it uses zlib and some implementation of SSL.

My earlier comment about "SSL" is that the actual library might be OpenSSL, BoringSSL, WolfSSL, GnuTLS, or any one of a number of others. So the number of uses of each one is smaller than the total number of "SSL" uses.

184. ChrisMarshallNY ◴[23 Aug 25 10:40 UTC] No.44994943{4}[source]▶

>>44989261 #

There’s folks that make entire careers, from tuning ffmpeg.

I’d suspect that this is exactly the type of thing that could be achieved with AI tools, though, so that might be a nervous bunch of people.

185. patchtopic ◴[23 Aug 25 12:10 UTC] No.44995397[source]▶

>>44986302 #

you know I cut a whole documentary in ffmpeg?

186. waihtis ◴[23 Aug 25 13:21 UTC] No.44995793{3}[source]▶

>>44989782 #

Hahah

I rarely take enjoyment from online battles but that one was a very pleasing putdown

187. brookst ◴[23 Aug 25 13:23 UTC] No.44995808{6}[source]▶

>>44993174 #

Trillion dollar companies are made up of individuals. People don’t start being honest just because they sign on with a Fortune 500.

188. Am4TIfIsER0ppos ◴[23 Aug 25 15:07 UTC] No.44996526{4}[source]▶

>>44986486 #

Have to? They don't have a kill switch in there, probably.

189. GZGavinZhao ◴[23 Aug 25 15:24 UTC] No.44996646[source]▶

>>44986130 #

*sad curl noises

190. BenjiWiebe ◴[23 Aug 25 17:27 UTC] No.44997537{9}[source]▶

>>44991953 #

If you add a for auto, it will choose the right compression based on the file name.

tar -caf foo.tar.xz foo

Will be an xz compressed tarball.

191. fuzztester ◴[23 Aug 25 19:14 UTC] No.44998366{5}[source]▶

>>44987094 #

cpio is not that hard.

A common use case is:

  $ cpio -pdumv args

See:

  $ man cpio

and here is an example from its Wikipedia page, under the "Operation and archive format" section, under the Copy subsection:

Copy

Cpio supports a third type of operation which copies files. It is initiated with the pass-through option flag (p). This mode combines the copy-out and copy-in steps without actually creating any file archive. In this mode, cpio reads path names on standard input like the copy-out operation, but instead of creating an archive, it recreates the directories and files at a different location in the file system, as specified by the path given as a command line argument.

This example copies the directory tree starting at the current directory to another path new-path in the file system, preserving files modification times (flag m), creating directories as needed (d), replacing any existing files unconditionally (u), while producing a progress listing on standard output (v):

$ find . -depth -print | cpio -p -dumv new-path

replies(1): >>45000344 #

192. lukeschlather ◴[23 Aug 25 19:31 UTC] No.44998486{4}[source]▶

>>44986587 #

Regex is only difficult because it's complicated, the primitives are all sensibly arranged and predictable. FFmpeg is layers of dark magic where the primitives are often inscrutable before you compose them.

193. lukeschlather ◴[23 Aug 25 19:34 UTC] No.44998506{5}[source]▶

>>44986346 #

It's funny because GPU stuff like what this article is about is where the LLMs totally fall apart. I can make any LLM produce volumes hallucinations at the drop of a hat by asking it how to construct ffmpeg commands that use hardware acceleration.

194. fullstop ◴[24 Aug 25 00:59 UTC] No.45000344{6}[source]▶

>>44998366 #

I think that it's the fact that it requires a pipe to work and that you add files by feeding stdin that throw me for a loop.

I also use it very infrequently compared to tar -- mostly in conjunction with swupdate. I've also run into file size limits, but that's not really a function of the command line interface to the tool.

195. falloon ◴[24 Aug 25 19:53 UTC] No.45007175{6}[source]▶

>>44989101 #

I defer understanding FFMPEG arguments to the LLMs.

196. account42 ◴[25 Aug 25 09:24 UTC] No.45011913{3}[source]▶

>>44989684 #

libpng and libjpeg I can see.

But Qt and libusb above ffmpeg? No way.

197. JimmaDaRustla ◴[25 Aug 25 15:40 UTC] No.45015065{3}[source]▶

>>44987608 #

broadcast box

198. balder1991 ◴[27 Aug 25 13:08 UTC] No.45039166{3}[source]▶

>>44993824 #