←back to thread

917 points cryptophreak | 8 comments | | HN request time: 0.691s | source | bottom
Show context
andreldm ◴[] No.45761642[source]
If handbrake scares them, don’t you dare to demonstrate how to use ffmpeg. I remember when I used handbrake for the first time and thought “wow, it’s much more convenient than struggling with ffmpeg”.
replies(7): >>45761715 #>>45761913 #>>45761996 #>>45762179 #>>45762626 #>>45762929 #>>45764672 #
1. MarkusWandel ◴[] No.45761996[source]
At least with ffmpeg, for 99% of use cases you can just google "how do I do X with ffmpeg" and get a copypasta command line.

Whereas with complicated GUI tools, you have to watch a video to learn how to do it.

replies(4): >>45762099 #>>45762129 #>>45762226 #>>45762333 #
2. nicce ◴[] No.45762099[source]
One of the things LLM shines. For double checking the command explanations, I ask commands to grep the sections from manual instead of relying LLM output blindly.
3. xnx ◴[] No.45762129[source]
Excellent point. Soon computer use AI agents will bridge this gap.
4. left-struck ◴[] No.45762226[source]
I think GUI tools lend themselves more to being able to discover functionality intuitively without needing to look anything up or read a manual, and especially so if you’re coming back to a task you haven’t done in a while. With CLI I constantly have to google or ask an LLM about commands I’ve done many times, whereas with a gui if I do it once I can more easily find my way the next time. Anyway both have their place
replies(1): >>45762475 #
5. xnorswap ◴[] No.45762333[source]
We're going full-circle, because LLMs are amazing for producing just the right incantation of arcane command-line tools. I was struggling to decrypt a file the other day and it whipped me up exactly the right openssl command to get it done.

From which I was able to then say, "Can I have the equivalent source code" and it did that too, from which I was able to spot my mistake in my original attempt. ( The KDF was using md5 not sha ).

I'm willing to bet that LLMs are also just as good at coming up with the right ffmpeg or imagemagick commands with just a vague notion of what is wanted.

Like, can we vignette the video and then add a green alien to the top corner? Sure we can (NB: I've not actually verified the result here) : https://claude.ai/share/5a63c01d-1ba9-458d-bb9d-b722367aea13

replies(1): >>45763167 #
6. thaumasiotes ◴[] No.45762475[source]
> I think GUI tools lend themselves more to being able to discover functionality intuitively without needing to look anything up or read a manual

Well, there are different issues.

Reading a manual is the best you can do, theoretically. But Linux CLI tools have terrible manuals.

I read over the ssh man page multiple times looking for functionality that was available. But the man page failed to make that clear. I had to learn about it from random tutorials instead.

I've been reading lvm documentation recently and it shows some bizarre patterns. Stuff like "for more on this see [related man page]", where [related man page] doesn't have any "more on this". Or, here's what happens if you try to get CLI help:

1. You say `pvs --help`, and get a summary of what flags you can provide to the tool. The big one is -o, documented as `[ -o|--options String ]`. The String defines the information you want. All you have to do is provide the right "options" and you're good. What are they? Well, the --help output ends with this: "Use --longhelp to show all options and advanced commands."

2. Invoke --longhelp and you get nothing about options or advanced commands, although you do get some documentation about the syntax of referring to volumes.

3. Check the man page, and the options aren't there either. Buried inside the documentation for -o is the following sentence: "Use -o help to view the list of all available fields."

4. Back to the command line. `pvs -o help` actually will provide the relevant documentation.

Reading a manual would be fine... if it actually contained the information it was supposed to, arranged in some kind of logically-organized structure. Instead, information on any given topic is spread out across several different types of documentation, with broken cross-references and suggestions that you should try doing the wrong thing.

I'm picking on man pages here, but actually Microsoft's official documentation for their various .NET stuff has the same problem at least as badly.

replies(1): >>45767950 #
7. 8note ◴[] No.45763167[source]
> I'm willing to bet that LLMs are also just as good at coming up with the right ffmpeg or imagemagick commands with just a vague notion of what is wanted.

they are. ive only used ffmpeg via llm, and its easy to get the LLM to make the right incantation as part of a multi-step workflow.

my own lack of understanding of video formats is still a problem, but getting ffmeg to do the right thing only takes a vague notion

8. Ezhik ◴[] No.45767950{3}[source]
It's so frustrating that most man pages explicitly go out of their way to avoid having examples or answering "how do I X" questions.