Reproducible benchmark code? #3

craigbarnes · 2019-07-16T20:39:05Z

Is the code used to produce the numbers in the readme available somewhere? Based on the methodology described, it seems likely that triehash is benefiting significantly from (unrealistically favourable) branch prediction:

... each hash function was run 1,000,000 times for each word

julian-klode · 2019-07-16T20:44:35Z

See https://github.com/julian-klode/hashbench

julian-klode · 2019-07-16T20:46:58Z

Needs access to the Debian porter boxes, and probably updates for changes there, to run the entire thing; but can also run locally (make run).

julian-klode · 2019-07-16T20:49:58Z

Not sure how you'd test that differently, though, you do need to run the function a lot to get usable results.

craigbarnes · 2019-07-16T21:07:59Z

Thanks for the pointers. I just wanted to see how unpredictable inputs would change the results.

To make it fair and avoid adding too much overhead to the loop, I was thinking the best way might be to pre-compute an array of randomly selected (known) words and use words[iteration % ARRAY_SIZE(words)] as the input. Otherwise the branch predictor is getting perfect conditions for most of the 1,000,000 iterations, when in practice most of the branches are probably highly unpredictable.

julian-klode · 2019-07-16T21:12:49Z

Won't that be affected too much by data caches?

craigbarnes · 2019-07-16T21:16:18Z

It'll increase cache pressure I guess, but not by much for a small array. It'd certainly be much closer to a real world scenario than repeating the same input a million times.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproducible benchmark code? #3

Reproducible benchmark code? #3

craigbarnes commented Jul 16, 2019 •

edited

Loading

julian-klode commented Jul 16, 2019

julian-klode commented Jul 16, 2019

julian-klode commented Jul 16, 2019

craigbarnes commented Jul 16, 2019 •

edited

Loading

julian-klode commented Jul 16, 2019

craigbarnes commented Jul 16, 2019 •

edited

Loading

Reproducible benchmark code? #3

Reproducible benchmark code? #3

Comments

craigbarnes commented Jul 16, 2019 • edited Loading

julian-klode commented Jul 16, 2019

julian-klode commented Jul 16, 2019

julian-klode commented Jul 16, 2019

craigbarnes commented Jul 16, 2019 • edited Loading

julian-klode commented Jul 16, 2019

craigbarnes commented Jul 16, 2019 • edited Loading

craigbarnes commented Jul 16, 2019 •

edited

Loading

craigbarnes commented Jul 16, 2019 •

edited

Loading

craigbarnes commented Jul 16, 2019 •

edited

Loading