Fix + test for compiled ReverseDiff without linking #2097

torfjelde · 2023-10-04T14:41:29Z

Currently, we have this (from https://turinglang.org/TuringBenchmarking.jl/dev/):

julia> using TuringBenchmarking, Turing


julia> @model function demo(x)
           s ~ InverseGamma(2, 3)
           m ~ Normal(0, sqrt(s))
           for i in 1:length(x)
               x[i] ~ Normal(m, sqrt(s))
           end
       end
demo (generic function with 2 methods)


julia> model = demo([1.5, 2.0]);


julia> benchmark_model(
           model;
           # Check correctness of computations
           check=true,
           # Automatic differentiation backends to check and benchmark
           adbackends=[:forwarddiff, :reversediff, :reversediff_compiled, :zygote]
       )
┌ Warning: There is disagreement in the log-density values!
└ @ TuringBenchmarking ~/work/TuringBenchmarking.jl/TuringBenchmarking.jl/src/TuringBenchmarking.jl:248
┌──────────────────────────────────────┬─────────────┐
│                             Standard │ Log-density │
│                              backend │    distance │
├──────────────────────────────────────┼─────────────┤
│                ForwardDiff vs Zygote │        0.00 │
│ ForwardDiff vs ReverseDiff[compiled] │        0.59 │
│           ForwardDiff vs ReverseDiff │        0.00 │
│      Zygote vs ReverseDiff[compiled] │        0.59 │
│                Zygote vs ReverseDiff │        0.00 │
│ ReverseDiff[compiled] vs ReverseDiff │        0.59 │
└──────────────────────────────────────┴─────────────┘
┌ Warning: There is disagreement in the gradients!
└ @ TuringBenchmarking ~/work/TuringBenchmarking.jl/TuringBenchmarking.jl/src/TuringBenchmarking.jl:255
┌──────────────────────────────────────┬──────────┐
│                             Standard │ Gradient │
│                              backend │ distance │
├──────────────────────────────────────┼──────────┤
│                ForwardDiff vs Zygote │     0.00 │
│ ForwardDiff vs ReverseDiff[compiled] │     1.20 │
│           ForwardDiff vs ReverseDiff │     0.00 │
│      Zygote vs ReverseDiff[compiled] │     1.20 │
│                Zygote vs ReverseDiff │     0.00 │
│ ReverseDiff[compiled] vs ReverseDiff │     1.20 │
└──────────────────────────────────────┴──────────┘
2-element BenchmarkTools.BenchmarkGroup:
  tags: []
  "evaluation" => 2-element BenchmarkTools.BenchmarkGroup:
	  tags: []
	  "linked" => Trial(500.000 ns)
	  "standard" => Trial(400.000 ns)
  "gradient" => 4-element BenchmarkTools.BenchmarkGroup:
	  tags: []
	  "Turing.Essential.ReverseDiffAD{false}()" => 2-element BenchmarkTools.BenchmarkGroup:
		  tags: ["ReverseDiff"]
		  "linked" => Trial(11.800 μs)
		  "standard" => Trial(11.200 μs)
	  "Turing.Essential.ReverseDiffAD{true}()" => 2-element BenchmarkTools.BenchmarkGroup:
		  tags: ["ReverseDiff[compiled]"]
		  "linked" => Trial(1.900 μs)
		  "standard" => Trial(1.900 μs)
	  "Turing.Essential.ForwardDiffAD{0, true}()" => 2-element BenchmarkTools.BenchmarkGroup:
		  tags: ["ForwardDiff"]
		  "linked" => Trial(800.000 ns)
		  "standard" => Trial(700.000 ns)
	  "Turing.Essential.ZygoteAD()" => 2-element BenchmarkTools.BenchmarkGroup:
		  tags: ["Zygote"]
		  "linked" => Trial(778.310 μs)
		  "standard" => Trial(768.010 μs)

That is, compiled ReverseDiff is incorrect when not linking! Super-strange, right?

Weeeell, not so much; LogDensityProblemsAD.jl uses zeros as the default input for compiling the tape, which, in the case where we have not performed any linking, causes issues with models involving, say, positively constrained distributions a la InverseGamma: https://github.com/tpapp/LogDensityProblemsAD.jl/blob/e13061ff72ddedb1fccf4deeb69f713972300239/ext/LogDensityProblemsADReverseDiffExt.jl#L54-L58

Note that this is not LogDensityProblemsAD.jl's fault, as it assumes we're working in unconstrained space.

This PR addresses this issue. It's not a very common use-case, but it's useful for identifying performance issues with transformations + it's also relevant if we want to work with Float32 instead of Float64, as the current implementation would then compile the tape with Float64 every time.

codecov · 2023-10-04T17:18:19Z

Codecov Report

Attention: 1 lines in your changes are missing coverage. Please review.

Comparison is base (ed410b1) 0.00% compared to head (31e8f70) 0.00%.

Additional details and impacted files

@@          Coverage Diff           @@
##           master   #2097   +/-   ##
======================================
  Coverage    0.00%   0.00%           
======================================
  Files          21      21           
  Lines        1451    1451           
======================================
  Misses       1451    1451

Files	Coverage Δ
src/essential/ad.jl	`0.00% <0.00%> (ø)`

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

sunxd3 · 2023-10-05T08:22:34Z

Interesting, if I assume that people use autodiff for gradient based sampling algorithms, then are there gradient algorithms that do not require unconstrained space?

torfjelde · 2023-10-05T09:57:01Z

then are there gradient algorithms that do not require unconstrained space?

Yup, e.g. reflective HMC, though we don't currently have these implemented (though there is interest: TuringLang/AdvancedHMC.jl#310)

sunxd3 · 2023-10-05T12:12:15Z

@torfjelde actually, why compile with input zeros causing wrong result? Is it because ReverseDiff use zero inputs for specialization?

This reverts commit b5a07b7.

In light of TuringLang/Turing.jl#2097, we know sometimes computation with `ReverseDiff` compiled can be wrong because `LogDensityProblemsAD` uses zeros array for the compilation process. This PR added a function `getparams` similar to [`DynamicPPL.jl`'s](https://github.com/TuringLang/DynamicPPL.jl/blob/d204fcb658a889421525365808b9830be37d3fdb/src/logdensityfunction.jl#L89). The PR also update the function `get_params_varinfo` so that we can return a DPPL compatible `SimpleVarInfo` with values in unconstrained space.

fix + test for compiled ReverseDiff without linking

31e8f70

yebai requested a review from sunxd3 October 4, 2023 16:13

sunxd3 approved these changes Oct 5, 2023

View reviewed changes

sunxd3 mentioned this pull request Oct 5, 2023

LKJCholesky does not work with compiled ReverseDiff.jl #2091

Open

devmotion mentioned this pull request Oct 5, 2023

04_hidden-markov-model run error TuringLang/docs#425

Closed

sunxd3 mentioned this pull request Oct 5, 2023

Add getparams and get_params_varinfo functions TuringLang/JuliaBUGS.jl#113

Merged

yebai merged commit b5a07b7 into master Oct 5, 2023
13 checks passed

yebai deleted the torfjelde/fix-for-reversediff-without-linking branch October 5, 2023 16:23

torfjelde added a commit that referenced this pull request Oct 6, 2023

Revert "fix + test for compiled ReverseDiff without linking (#2097)"

9f3527e

This reverts commit b5a07b7.

torfjelde mentioned this pull request Oct 20, 2023

Very strange behavior with ReverseDiff.jl compiled when not linking #2084

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix + test for compiled ReverseDiff without linking #2097

Fix + test for compiled ReverseDiff without linking #2097

torfjelde commented Oct 4, 2023

codecov bot commented Oct 4, 2023

sunxd3 commented Oct 5, 2023

torfjelde commented Oct 5, 2023

sunxd3 commented Oct 5, 2023

Fix + test for compiled ReverseDiff without linking #2097

Fix + test for compiled ReverseDiff without linking #2097

Conversation

torfjelde commented Oct 4, 2023

codecov bot commented Oct 4, 2023

Codecov Report

sunxd3 commented Oct 5, 2023

torfjelde commented Oct 5, 2023

sunxd3 commented Oct 5, 2023