lsr: ls with io_uring | slacker news

1. maplant ◴[18 Jul 25 14:31 UTC] No.44605037[source]▶

This seems more interesting as demonstration of the amortized performance increase you'd expect from using io_uring, or as a tutorial for using it. I don't understand why I'd switch from using something like eza. If I'm listing 10,000 files the difference is between 40ms and 20ms. I absolutely would not notice that for a single invocation of the command.

replies(2): >>44605508 #>>44606229 #

2. 0x000xca0xfe ◴[18 Jul 25 15:08 UTC] No.44605508[source]▶

>>44605037 (TP) #

Well I have a directory with a couple million JSON files and ls/du take minutes.

Most of the coreutils are not fast enough to actually utilize modern SSDs.

replies(1): >>44605839 #

3. otterley ◴[18 Jul 25 15:31 UTC] No.44605839[source]▶

>>44605508 #

What’s the filesystem type? Ext4 suffers terrible lookup performance with large directories, while xfs absolutely flies.

replies(1): >>44606046 #

4. 0x000xca0xfe ◴[18 Jul 25 15:45 UTC] No.44606046{3}[source]▶

>>44605839 #

Yup, default ext4 and most files are <4KB, so it's extra bad.

Thanks for the comment, didn't know that!

5. rockorager ◴[18 Jul 25 15:57 UTC] No.44606229[source]▶

>>44605037 (TP) #

Yeah, I wrote this as a fun little experiment to learn more io_uring usage. The practical savings of using this are tiny, maybe 5 seconds over your entire life. That wasn't the point haha

replies(2): >>44606524 #>>44606697 #

6. JuettnerDistrib ◴[18 Jul 25 16:20 UTC] No.44606524[source]▶

>>44606229 #

I'd be curious to know if this helps on supercomputers, which are notorious for frequently hanging for a few seconds on an ls -l.

replies(1): >>44608279 #

7. maplant ◴[18 Jul 25 16:35 UTC] No.44606697[source]▶

>>44606229 #

It's a very cool experiment. Just wanted to perhaps steer the conversation towards those things rather than whether or not this was a good ls replacement because like you say that feels like it was missing the point

8. mrlongroots ◴[18 Jul 25 18:37 UTC] No.44608279{3}[source]▶

>>44606524 #

It could, but important to keep in mind that the filesystem architecture there is also very different with a parallel filesystem with disaggregated data and metadata.

When you run `ls -l` you could potentially be enumerating a directory with one file per rank, or worse, one file per particle or something. You could try making the read fast, but I also think that it makes no sense to have that many files: you can do things to reduce the number of files on disk. Also many are trying to push for distributed object stores instead of parallel filesystems... fun space.