Consider adding high-precision microbenchmarks for small sequences #148

ldionne · 2016-08-24T14:38:05Z

Forking #124 to discuss high precision microbenchmarks for small inputs more precisely. The idea is that we'd like to have a precise view of the behavior of algorithms on small sequences, because this is how they are mostly (but not only) used in the wild. I see two different approaches:

For each sequence length N, generate K different sequences of length N and call the algorithm once on each sequence. By having a sufficiently large K, the total compilation time is increased and the relative error is reduced.
Same as (1), but then divide the result by K to find an approximation of the absolute time taken for a single algorithm. I have reservations with this approach, because the compilation time as we increase K (the number of small sequences) is not necessarily linear.

The text was updated successfully, but these errors were encountered:

brunocodutra · 2016-08-24T14:58:20Z

Would K be constant for all N for a given algorithm? If so, I'd be ok with the first option. If not I think we would have to stick with the second one, because otherwise charts would be non-intuitive to the viewer.

Edit: Just to clarify, since the overhead already decreases for larger N so that a smaller K would be enough, having K decrease with the increase of N would have the benefit of keeping build times reasonably short. This is quite obvious when you look at benchmarks that take over a second for N > 100, where the noise basically disappears even on travis.

ldionne · 2016-08-24T20:46:33Z

Would K be constant for all N for a given algorithm?

We can choose to go either way. I agree that decreasing K as N increases would help keep the build times short and having a large K for large Ns would not be useful. However, I'm unclear on how we would specify which K to associate to which N.

brunocodutra · 2016-08-25T00:26:13Z

I'm unclear on how we would specify which K to associate to which N

I believe choosing K so that K*N = <some constant> should be good enough.

We can choose to go either way.

So let us take a step at a time. Since the first option is basically a special case of the second one, why don't we implement 1. and then, if necessary, refine it into 2.?

ldionne · 2016-08-25T14:45:37Z

So let us take a step at a time. Since the first option is basically a special case of the second one, why don't we implement 1. and then, if necessary, refine it into 2.?

Sounds good. That raises (at least) one question: do we want to tweak the existing benchmarks so that they have higher precision for smaller sequences, or do we want to create a new set of benchmarks for "small sequences"? The first option would obviously be better if that can be done, but that might end up taking too much Travis time, depending on how smart we can be.

brunocodutra · 2016-08-25T16:04:38Z

do we want to tweak the existing benchmarks so that they have higher precision for smaller sequences, or do we want to create a new set of benchmarks for "small sequences"?

I'd say a bit of both, reduce the maximum size of sequences, so that we don't abuse travis too much, and make sure all measurements are precise enough, so that results don't vary that much across runs. As for a number, @ericniebler suggested we focused on < 100 elements. I have no strong opinion about this, but I think it's a reasonable suggestion.

I'm just not very comfortable with the idea of fine tuning K for each algorithm heuristically until results "look good", but I'm afraid we have no better way?

ldionne · 2016-08-26T17:02:23Z

I don't see a better way either.

ldionne · 2016-10-23T20:03:55Z

This has been resolved by #150.

ldionne added the feature label Aug 24, 2016

brunocodutra mentioned this issue Aug 24, 2016

Smooth benchmark results #144

Merged

ldionne mentioned this issue Sep 9, 2016

Made metabench use the system + user time for his measurements instea… #149

Merged

brunocodutra mentioned this issue Oct 11, 2016

Improve the resolution of benchmark measurements #150

Merged

This was referenced Oct 16, 2016

Consider benchmarking real-world scenarios #151

Open

Metabench is measuring the wrong thing #124

Closed

ldionne closed this as completed Oct 23, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider adding high-precision microbenchmarks for small sequences #148

Consider adding high-precision microbenchmarks for small sequences #148

ldionne commented Aug 24, 2016

brunocodutra commented Aug 24, 2016 •

edited

Loading

ldionne commented Aug 24, 2016

brunocodutra commented Aug 25, 2016

ldionne commented Aug 25, 2016

brunocodutra commented Aug 25, 2016 •

edited

Loading

ldionne commented Aug 26, 2016

ldionne commented Oct 23, 2016

Consider adding high-precision microbenchmarks for small sequences #148

Consider adding high-precision microbenchmarks for small sequences #148

Comments

ldionne commented Aug 24, 2016

brunocodutra commented Aug 24, 2016 • edited Loading

ldionne commented Aug 24, 2016

brunocodutra commented Aug 25, 2016

ldionne commented Aug 25, 2016

brunocodutra commented Aug 25, 2016 • edited Loading

ldionne commented Aug 26, 2016

ldionne commented Oct 23, 2016

brunocodutra commented Aug 24, 2016 •

edited

Loading

brunocodutra commented Aug 25, 2016 •

edited

Loading