Hyperfine: A command-line benchmarking tool

(github.com)

241 points hundredwatt | 1 comments | 18 Nov 24 21:47 UTC | HN request time: 0.21s | source

Show context

forrestthewoods ◴[19 Nov 24 06:37 UTC] No.42180588[source]▶

Hyperfine is hyper frustrating because it only works with really really fine microsecond level benchmarks. Once you get into the millisecond range it’s worthless.

replies(2): >>42180660 #>>42182084 #

anotherhue ◴[19 Nov 24 06:50 UTC] No.42180660[source]▶

>>42180588 #

It spawns a new process each time right? I would think that would but a cap on how accurate it can get.

For my purposes I use it all the time though, quick and easy sanity-check.

replies(2): >>42180722 #>>42180749 #

forrestthewoods ◴[19 Nov 24 07:03 UTC] No.42180722[source]▶

>>42180660 #

The issue is it runs a kajillion tests to try and be “statistical”. But there’s no good way to say “just run it for 5 seconds and give me the best answer you can”. It’s very much designed for nanosecond to low microsecond benchmarks. Trying to fight this is trying to smash a square peg through a round hole.

replies(3): >>42180876 #>>42180891 #>>42182129 #

1. sharkdp ◴[19 Nov 24 11:00 UTC] No.42182129[source]▶

>>42180722 #

> The issue is it runs a kajillion tests to try and be “statistical”.

If you see any reason for putting “statistical” in quotes, please let us know. hyperfine does not run a lot of tests, but it does try to find outliers in your measurements. This is really valuable in some cases. For example: we can detect when the first run of your program takes much longer than the rest of the runs. We can then show you a warning to let you know that you probably want to either use some warmup runs, or a "--prepare" command to clean (OS) caches if you want a cold-cache benchmark.

> But there’s no good way to say “just run it for 5 seconds and give me the best answer you can”.

What is the "best answer you can"?

> It’s very much designed for nanosecond to low microsecond benchmarks.

Absolutely not. With hyperfine, you can not measure execution times in the "low microsecond" range, let alone nanosecond range. See also my other comment.

↑