Compatibility with new DPPL version #1900

torfjelde · 2022-11-07T19:03:25Z

The goal of this PR is just to make Turing work with the new DynamicPPL version, not to obtain full feature-parity of SimpleVarInfo and VarInfo (though it is a significant step towards exactly this).

Main changes are:

unflatten is now used to convert a vector into a AbstractVarInfo.
Use (inv)link!! instead of deprecated (inv)link!.
setindex!! instead of setindex!.
Use evaluate and capture resulting AbstractVarInfo to also support immutable impls of AbstractVarInfo.

Closes #1899 and #1898

cause timeouts

src/inference/mh.jl

torfjelde · 2022-11-10T17:30:52Z

So it seems like on MacOS, the estimates are a bit more noisy, e.g. some tests are failing because now the error is slightly above the atol=0.2 in the https://github.com/TuringLang/Turing.jl/blob/master/test/inference/Inference.jl#L76-L80.

And there's a bug with MH (it's producing completely incorrect results). Trying to figure that out now.

…tor/unflatten

torfjelde · 2022-11-10T17:44:15Z

So it seems like on MacOS, the estimates are a bit more noisy, e.g. some tests are failing because now the error is slightly above the atol=0.2 in the https://github.com/TuringLang/Turing.jl/blob/master/test/inference/Inference.jl#L76-L80.

@yebai @devmotion Regarding this, should I just increase the threshold? It def seems like the samplers are doing the right thing (and locally on linux, the tests are passing).

yebai · 2022-11-10T17:58:42Z

Sure

torfjelde · 2022-11-11T03:12:49Z

Hmm, one thing to note here: it seems the tests takes ~2X to run from before (I just looked at some previously closed PRs) 😕 I'm uncertain if this is compile-time or runtime, but nonetheless maybe something to be aware of.

devmotion · 2022-11-11T09:05:35Z

Hmm that's very unfortunate. It would be good to know where the regression comes from, if it is run-time or compilation time, if it is AD-backend specific etc. Maybe comparing the timings and allocations in the logs (e.g., https://github.com/TuringLang/Turing.jl/actions/runs/3441679090/jobs/5741483837#step:6:825) could give some hints?

yebai · 2022-11-11T11:24:15Z

@torfjelde Re performance regression, can you check

whether we correctly use TypedVarInfo in [Merged by Bors] - Linearization/flattening of SimpleVarInfo DynamicPPL.jl#417
whether we accidentally evaluate the model twice somewhere in the flatten/unflatten pipeline?

Also, if this PR has not broken tests, I am happy to merge it as is, and then fix regression in a new PR. This will be alright if we don't make any new releases until performance regression is fixed.

torfjelde · 2022-11-11T12:18:52Z

Good news! It looks like the Emcee sampler is the cause.

Current
- Windows: https://github.com/TuringLang/Turing.jl/actions/runs/3441679090/jobs/5741483751#step:6:828. Emcee tests takes 1.8hrs.
- MacOS: https://github.com/TuringLang/Turing.jl/actions/runs/3441679090/jobs/5741483837#step:6:825. Emcee tests takes 1275s.
Old
- On Windows: https://github.com/TuringLang/Turing.jl/actions/runs/2955691229/jobs/4725904554#step:6:817. Emcee tests takes 24.7s.
- On MacOS: https://github.com/TuringLang/Turing.jl/actions/runs/2955691229/jobs/4725904639#step:6:814. Emcee tests takes 28.0s.

I need to have a look at what's causing this though. It's also "interesting" that the effect is significantly different between architectures.

torfjelde · 2022-11-11T12:21:00Z

whether we accidentally evaluate the model twice somewhere in the flatten/unflatten pipeline?

And just for the record, we don't do any execution of the model in this. In particular, for VarInfo, nothing has changed in the flatten/unflatten functionality, i.e. unflatten(vi, spl, x) = VarInfo(vi, spl, x).

Co-authored-by: David Widmann <[email protected]>

torfjelde · 2022-11-11T12:42:02Z

Okay, so this is all very weird.

If I run the tests locally, then the same slowdown occurs when I hit include("inference/emcee.jl").

But if I copy-paste the code from runtests.jl up until and including the include("inference/emcee.jl") and execute it in the REPL (using the env created by TestEnv.activate), then it runs in 30s, as before.

So I'm very confused.

Have you seen anything like this before @devmotion?

devmotion · 2022-11-11T13:30:31Z

Maybe, I'm not sure. But I've definitely seen cases where Pkg.test behaves differently from running the same code in the REPL.

The more obvious difference is that Pkg.test starts a Julia process with some specific commandline options (by default), leading to differences such as described in https://discourse.julialang.org/t/erratic-test-failure-difference-between-test-and-include-test-runtests-jl/72342 and similar discourse posts. So the first thing to check might be to start the Julia REPL with exactly the same commandline options as the ones used by Pkg.test (you could e.g. inspect Base.julia_cmd()).

I've also seen differences due to the use of Revise, so it might be good to also check the behaviour if Revise is not loaded (e.g., by starting a clean Julia process with --startup-file=no).

I guess you already checked if the same package versions are used?

And then, of course, scoping is different in the REPL: https://docs.julialang.org/en/v1/manual/variables-and-scoping/

torfjelde · 2022-11-11T14:21:28Z

After reading your reply @devmotion I was reminded of how depwarn=yes has often be the cause of perf regressions like this, aaaand lo and behold it indeed was the case. Once TuringLang/DynamicPPL.jl#433 is through, we should be good here.

…tor/unflatten

@yebai

This PR fixes the performance regressions seen for `Emcee` in TuringLang/Turing.jl#1900. @yebai @devmotion This should be an easy merge.

test/runtests.jl

…tor/unflatten

torfjelde · 2022-11-12T17:34:35Z

Oh you made reverted some of my changes @yebai

Increasing the number of samples helped but didnt' solve it. I guess I'll just increase further.

test/inference/mh.jl

codecov · 2022-11-12T19:57:05Z

Codecov Report

Base: 81.49% // Head: 81.24% // Decreases project coverage by -0.25% ⚠️

Coverage data is based on head (ab69a21) compared to base (2d41f09).
Patch coverage: 87.35% of modified lines in pull request are covered.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1900      +/-   ##
==========================================
- Coverage   81.49%   81.24%   -0.26%     
==========================================
  Files          21       21              
  Lines        1421     1418       -3     
==========================================
- Hits         1158     1152       -6     
- Misses        263      266       +3

Impacted Files	Coverage Δ
src/stdlib/distributions.jl	`57.95% <0.00%> (ø)`
src/inference/hmc.jl	`77.41% <44.44%> (-0.65%)`	⬇️
src/modes/ModeEstimation.jl	`81.96% <85.71%> (-0.82%)`	⬇️
src/Turing.jl	`82.35% <100.00%> (+1.10%)`	⬆️
src/contrib/inference/dynamichmc.jl	`100.00% <100.00%> (ø)`
src/contrib/inference/sghmc.jl	`98.50% <100.00%> (ø)`
src/inference/emcee.jl	`94.11% <100.00%> (ø)`
src/inference/gibbs.jl	`97.33% <100.00%> (ø)`
src/inference/mh.jl	`84.37% <100.00%> (-0.70%)`	⬇️
src/modes/OptimInterface.jl	`89.18% <100.00%> (+0.14%)`	⬆️
... and 3 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

torfjelde added 22 commits July 7, 2022 07:30

use unflatten in evaluation of LogDensityFunction

b3973a0

make AD-related functions able to take AbstractVarInfo

b377ea1

use unflatten where appropriate

977377a

updated Gibbs

276a39b

updated HMC

bf8ec74

move to using BangBang versions of link and invlink

9d41506

use link!!

0ead42f

update tests to be compatible with new DynamicPPL.TestUtils

9b8e937

updated deps for tests

e5d7168

Merge branch 'tor/dppl-bump' into tor/unflatten

4857211

fixed tests for ESS

205c8c3

upper-bound distributions in tests because otherwise depwarns will

4962e1d

cause timeouts

Merge branch 'tor/dppl-bump' into tor/unflatten

e260720

replace link! with link!!, etc.

a2d73d3

added Setfield and updated optimization stuff

b28b22f

updated the contrib to use link!!, etc.

3310eee

updated AD tests

3178bab

Merge branch 'master' into tor/unflatten

901211b

Merge branch 'master' into tor/unflatten

6660a39

updated DPPL versions

b2139c3

removed usage of deprecated inv

13e445d

made some function signatures more restrictive

66e773a

torfjelde commented Nov 9, 2022

View reviewed changes

src/inference/mh.jl Outdated Show resolved Hide resolved

Update src/inference/mh.jl

574ef2d

torfjelde added 2 commits November 10, 2022 17:31

Merge branch 'tor/unflatten' of github.com:TuringLang/Turing.jl into …

9c08dda

…tor/unflatten

fixed MH sampler

c056b01

Merge branch 'master' into tor/unflatten

6b264dc

torfjelde and others added 2 commits November 11, 2022 12:23

disable emcee tests for now

4d5396d

Update Project.toml

a0255b8

Co-authored-by: David Widmann <[email protected]>

further reductions in atol to make tests pass

130dbad

torfjelde mentioned this pull request Nov 11, 2022

Drop unnecessary depwarn + fix Turing tests TuringLang/DynamicPPL.jl#433

Merged

Merge branch 'tor/unflatten' of github.com:TuringLang/Turing.jl into …

e634b20

…tor/unflatten

yebai reviewed Nov 11, 2022

View reviewed changes

test/runtests.jl Outdated Show resolved Hide resolved

yebai and others added 4 commits November 11, 2022 21:39

Update test/runtests.jl

39dc618

Update mh.jl

286dbc0

restrict ForwardDiff for tests to avoid issue with cholesky

e5db993

Merge branch 'tor/unflatten' of github.com:TuringLang/Turing.jl into …

0ab3dd5

…tor/unflatten

torfjelde commented Nov 12, 2022

View reviewed changes

test/inference/mh.jl Outdated Show resolved Hide resolved

increased number of samples and lowered atol for MH tests

ab69a21

yebai approved these changes Nov 12, 2022

View reviewed changes

This was referenced Nov 12, 2022

Improve usage and support for immutable AbstractVarInfo #1752

Closed

Automatic lazy broadcasting/optimization for arrays of distributions #1723

Closed

yebai merged commit 477ae17 into master Nov 12, 2022

delete-merged-branch bot deleted the tor/unflatten branch November 12, 2022 21:07

torfjelde mentioned this pull request Nov 14, 2022

Bump minor version #1909

Merged

sethaxen mentioned this pull request Nov 24, 2022

Add InferenceObjects as a chain_type #1913

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compatibility with new DPPL version #1900

Compatibility with new DPPL version #1900

torfjelde commented Nov 7, 2022 •

edited by yebai

Loading

torfjelde commented Nov 10, 2022

torfjelde commented Nov 10, 2022

yebai commented Nov 10, 2022

torfjelde commented Nov 11, 2022

devmotion commented Nov 11, 2022

yebai commented Nov 11, 2022

torfjelde commented Nov 11, 2022

torfjelde commented Nov 11, 2022

torfjelde commented Nov 11, 2022

devmotion commented Nov 11, 2022

torfjelde commented Nov 11, 2022

torfjelde commented Nov 12, 2022

codecov bot commented Nov 12, 2022

Compatibility with new DPPL version #1900

Compatibility with new DPPL version #1900

Conversation

torfjelde commented Nov 7, 2022 • edited by yebai Loading

torfjelde commented Nov 10, 2022

torfjelde commented Nov 10, 2022

yebai commented Nov 10, 2022

torfjelde commented Nov 11, 2022

devmotion commented Nov 11, 2022

yebai commented Nov 11, 2022

torfjelde commented Nov 11, 2022

torfjelde commented Nov 11, 2022

torfjelde commented Nov 11, 2022

devmotion commented Nov 11, 2022

torfjelde commented Nov 11, 2022

torfjelde commented Nov 12, 2022

codecov bot commented Nov 12, 2022

Codecov Report

torfjelde commented Nov 7, 2022 •

edited by yebai

Loading