Lightweight benchmarks for Turing #1534

luiarthur · 2021-02-01T22:12:22Z

Currently, the contents of benchmarks/ are outdated. It and the accompanying workflow file (.github/workflows/MicroBenchmarks.yml) needs updating.

A set of lightweight benchmarks are needed to measure performance (speed) for standard inference algorithms and models between (1) consecutive releases and (2) a PR and the latest release.

These benchmarks should run automatically (via GitHub Actions) whenever a PR is made and when a new release is created.

Benchmarks (timings) for each release should be stored (e.g. as a release asset) for regression testing.

Ideally, a warning would be raised if regressions are detected. Differences in timings between releases should also be stored/recorded.

Things to consider:

Results from the benchmarks could be stored as GitHub release assets. Open to suggestions for other locations.
Visualizing the benchmarks
- Some users are interested in how model performance scales for a particular model, for example, by data size, number of features, etc. Useful visuals will be helpful for digesting the benchmarks.
Models to benchmark
- Models of a wide variety for potentially different data sizes should be considered. But we hope to run all the tests rather quickly (well under an hour). Inference algorithms won't be run to convergence, just long enough to get decent timings.
Inference algorithms to benchmark
- We want to avoid algorithms that adapt in a way that influence timings. For example, NUTS adapts the number of leapfrog steps, which would result in unpredictable timings. HMC, ADVI, GibbsConditional, MH, PG, for example, are fair game.
AD backends to benchmark
Other PPLs to compare against Turing

Resources for Benchmarking in Julia

The text was updated successfully, but these errors were encountered:

luiarthur · 2021-02-01T22:15:06Z

I'm working on this right now, let me know if you have any suggestions/comments.

devmotion · 2021-02-04T08:30:55Z

SciML uses a benchmarking bot, it was described in a blog post on julialang.org a while ago.

luiarthur · 2021-02-04T09:34:38Z

Thanks! I'll have a look.

yebai · 2021-12-16T19:04:47Z

Closed in favour of TuringLang/DynamicPPL.jl#346

luiarthur self-assigned this Feb 1, 2021

yebai closed this as completed Dec 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lightweight benchmarks for Turing #1534

Lightweight benchmarks for Turing #1534

luiarthur commented Feb 1, 2021 •

edited

Loading

luiarthur commented Feb 1, 2021

devmotion commented Feb 4, 2021

luiarthur commented Feb 4, 2021

yebai commented Dec 16, 2021

Lightweight benchmarks for Turing #1534

Lightweight benchmarks for Turing #1534

Comments

luiarthur commented Feb 1, 2021 • edited Loading

Resources for Benchmarking in Julia

luiarthur commented Feb 1, 2021

devmotion commented Feb 4, 2021

luiarthur commented Feb 4, 2021

yebai commented Dec 16, 2021

luiarthur commented Feb 1, 2021 •

edited

Loading