Remove usage of `DynamicPPL.tonamedtuple` #2071

torfjelde · 2023-09-01T18:46:00Z

Now that we have DynamicPPL.varname_and_value_leaves we no longer need to make use of tonamedtuple.

EDIT: I should also point out that with this PR we can also properly put non-array types, e.g. Cholesky, into chains:

julia> using Turing

julia> @model demo_lkj() = x ~ LKJCholesky(2, 1.0)
demo_lkj (generic function with 2 methods)

julia> model = demo_lkj();

julia> chain = sample(model, Prior(), 10)
Sampling 100%|█████████████████████████████████████████████████████████████████████████████████████| Time: 0:00:00
Chains MCMC chain (10×4×1 Array{Float64, 3}):

Iterations        = 1:1:10
Number of chains  = 1
Samples per chain = 10
Wall duration     = 0.02 seconds
Compute duration  = 0.02 seconds
parameters        = x[1,1], x[2,1], x[2,2]
internals         = lp

Summary Statistics
  parameters      mean       std      mcse   ess_bulk   ess_tail      rhat   ess_per_sec 
      Symbol   Float64   Float64   Float64    Float64    Float64   Float64       Float64 

      x[1,1]    1.0000    0.0000       NaN        NaN        NaN       NaN           NaN
      x[2,1]   -0.1403    0.6866    0.2171    10.0000    10.0000    0.9145      416.6667
      x[2,2]    0.7043    0.2582    0.0817    10.0000    10.0000    1.7894      416.6667

Quantiles
  parameters      2.5%     25.0%     50.0%     75.0%     97.5% 
      Symbol   Float64   Float64   Float64   Float64   Float64 

      x[1,1]    1.0000    1.0000    1.0000    1.0000    1.0000
      x[2,1]   -0.9725   -0.6705   -0.3638    0.5753    0.7537
      x[2,2]    0.2169    0.6330    0.7611    0.9191    0.9345


julia> chain = sample(model, NUTS(), 10)
┌ Info: Found initial step size
└   ϵ = 1.8
Sampling 100%|█████████████████████████████████████████████████████████████████████████████████████| Time: 0:00:00
Chains MCMC chain (10×15×1 Array{Float64, 3}):

Iterations        = 6:1:15
Number of chains  = 1
Samples per chain = 10
Wall duration     = 0.98 seconds
Compute duration  = 0.98 seconds
parameters        = x[1,1], x[2,1], x[2,2]
internals         = lp, n_steps, is_accept, acceptance_rate, log_density, hamiltonian_energy, hamiltonian_energy_error, max_hamiltonian_energy_error, tree_depth, numerical_error, step_size, nom_step_size

Summary Statistics
  parameters      mean       std      mcse   ess_bulk   ess_tail      rhat   ess_per_sec 
      Symbol   Float64   Float64   Float64    Float64    Float64   Float64       Float64 

      x[1,1]    1.0000    0.0000       NaN        NaN        NaN       NaN           NaN
      x[2,1]    0.0275    0.5981    0.1912    10.0000     5.1546    1.4926       10.1523
      x[2,2]    0.8068    0.1712    0.0762     6.5463     5.1546    1.5393        6.6460

Quantiles
  parameters      2.5%     25.0%     50.0%     75.0%     97.5% 
      Symbol   Float64   Float64   Float64   Float64   Float64 

      x[1,1]    1.0000    1.0000    1.0000    1.0000    1.0000
      x[2,1]   -0.8471   -0.5180    0.3527    0.3527    0.6650
      x[2,2]    0.5314    0.7380    0.8597    0.9357    0.9834

and it's easy to add support for further types by overloading DynamicPPL.varname_and_value_leaves_inner 👍

introduced immutable invlink, and make use of getparams in transition constructions

… Bijectors.invlink

torfjelde · 2023-09-01T18:53:52Z

Btw, @sethaxen do you have any thoughts about what we should return for the Cholesky factor? Currently we return C.UL but an potentially more natural alternative is maybe PDMat?

See https://github.com/TuringLang/DynamicPPL.jl/blob/ba16e3bc91e293c58b03ad287637472d6e11f52f/src/utils.jl#L1020-L1043 for where this is decided.

src/contrib/inference/sghmc.jl

src/inference/emcee.jl

different transition types

src/contrib/inference/sghmc.jl

sethaxen · 2023-09-01T22:51:42Z

Btw, @sethaxen do you have any thoughts about what we should return for the Cholesky factor? Currently we return C.UL

How does the user encounter this return value? If it's just for storage in a Chains, then I tend to think that either one of the strict triangles of the correlation matrix or UL should be returned. I lean towards a strict triangle of the correlation matrix, since a Factorization in general behaves like the matrix it factorizes, not like its factors.

But if currently what is being returned is .UL, then I think it is misleading to label those entries as x[i, j], etc. I would definitely interpret those as entries of the correlation matrix, not of the factor. If you go that route, I recommend replacing x with x.U or x.L, whichever applies.

but an potentially more natural alternative is maybe PDMat?

Here's where my assumptions about flattening don't seem to follow. If the goal is flattening, then an iterator is returned, and I'm not sure how it would be relevant if the Cholesky was first wrapped as a PDMat before iterating.

torfjelde · 2023-09-02T09:12:40Z

How does the user encounter this return value?

Just in how it will be represented in the chain. Currently we just flatten x.UL, which means that you end up with x[1,1], x[1,2], and x[2,2] and their values in the case of x.UL being x.U, etc.

Here's where my assumptions about flattening don't seem to follow. If the goal is flattening, then an iterator is returned, and I'm not sure how it would be relevant if the Cholesky was first wrapped as a PDMat before iterating.

In the sense of, we return the flattened PDMat instead of x.UL i.e. the correlation matrix represented by this cholesky decomp flattened rather than the cholesky flattened.

EDIT:

I lean towards a strict triangle of the correlation matrix, since a Factorization in general behaves like the matrix it factorizes, not like its factors.

Btw, what do you mean when you say "strict triangle" here? You mean excluding the diagonal?

torfjelde · 2023-09-02T09:13:09Z

If you go that route, I recommend replacing x with x.U or x.L, whichever applies.

I like this suggestion, if we indeed go down that route:)

sethaxen · 2023-09-02T09:20:58Z

In the sense of, we return the flattened PDMat instead of x.UL i.e. the correlation matrix represented by this cholesky decomp flattened rather than the cholesky flattened.

Ah okay, then yes, this is also my preference. The only thing I don't like about it is that getindex isn't actually defined on the Cholesky object, so there's still room for confusion. The x.U or x.L approach doesn't have this problem. I leave the final decision up to you!

Btw, what do you mean when you say "strict triangle" here? You mean excluding the diagonal?

Yes, because, the diagonal entries are constrained to be 1s.

torfjelde · 2023-09-02T09:27:04Z

Yeah, unfortunately it's a bit "annoying" to show the strict triangle with the current implementation since it requires knowledge of the distribution (the current flattening-approach just takes into account the value itself, not the distribution, etc.), hence why I didn't do that.

I think for now, we do x.UL or PDMat(x); I'm leaning towards the x.UL to be completely transparent to the user.

codecov · 2023-09-04T12:06:25Z

Codecov Report

Patch and project coverage have no change.

Comparison is base (9423562) 0.00% compared to head (8dbd613) 0.00%.

❗ Current head 8dbd613 differs from pull request most recent head 4aeea42. Consider uploading reports for the commit 4aeea42 to get more accurate results

Additional details and impacted files

@@          Coverage Diff           @@
##           master   #2071   +/-   ##
======================================
  Coverage    0.00%   0.00%           
======================================
  Files          22      21    -1     
  Lines        1492    1447   -45     
======================================
+ Misses       1492    1447   -45

Files Changed	Coverage Δ
ext/TuringDynamicHMCExt.jl	`0.00% <0.00%> (ø)`
ext/TuringOptimExt.jl	`0.00% <0.00%> (ø)`
src/Turing.jl	`0.00% <ø> (ø)`
src/contrib/inference/abstractmcmc.jl	`0.00% <0.00%> (ø)`
src/contrib/inference/sghmc.jl	`0.00% <0.00%> (ø)`
src/essential/ad.jl	`0.00% <ø> (ø)`
src/inference/AdvancedSMC.jl	`0.00% <0.00%> (ø)`
src/inference/Inference.jl	`0.00% <0.00%> (ø)`
src/inference/emcee.jl	`0.00% <0.00%> (ø)`
src/inference/ess.jl	`0.00% <0.00%> (ø)`
... and 6 more

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

torfjelde · 2023-09-04T13:57:14Z

This finally looks ready to be reviewed @yebai ! This is technically a non-breaking release given how we don't export this functionality, but I'm in favour of making it breaking as there are bound to be some downstream issues with this.

yebai

Thanks @torfjelde , it looks good to me!

* Update Utilities.jl * remove obsolete functionality. * remove obsolete utility module * removes more references to utilities.

Project.toml

Co-authored-by: Hong Ge <[email protected]>

* Update Turing.jl * Update Project.toml * Update Turing.jl * Update Essential.jl

torfjelde added 2 commits September 1, 2023 19:43

use getparams also take model as an argument, make use of recently

fcf632b

introduced immutable invlink, and make use of getparams in transition constructions

remove unnecessary custom ad rules for Tracker for Bijectors.link and…

f5d2f4f

… Bijectors.invlink

torfjelde added 2 commits September 1, 2023 20:03

Merge branch 'master' into torfjelde/improved-chains-creation

cf119e8

bump patch version

5714591

yebai reviewed Sep 1, 2023

View reviewed changes

src/contrib/inference/sghmc.jl Show resolved Hide resolved

src/contrib/inference/sghmc.jl Show resolved Hide resolved

src/contrib/inference/sghmc.jl Show resolved Hide resolved

torfjelde commented Sep 1, 2023

View reviewed changes

src/inference/emcee.jl Outdated Show resolved Hide resolved

torfjelde added 2 commits September 1, 2023 21:03

remove unnused nt keyword from Transtiion and fixed incorrect calls to

75d7ceb

different transition types

Update src/inference/emcee.jl

2cdf0ad

torfjelde commented Sep 1, 2023

View reviewed changes

src/contrib/inference/sghmc.jl Outdated Show resolved Hide resolved

Apply suggestions from code review

4146c44

fixed more Transition calls missing model

43cc2b3

torfjelde mentioned this pull request Sep 2, 2023

Improved varname_and_value_leaves for Cholesky TuringLang/DynamicPPL.jl#531

Merged

torfjelde added 11 commits September 2, 2023 11:10

fixed IS

e6e984a

fixed DynamicHMC

078f5de

fixed Emcee

eb86cc8

another attempt at fixing Emcee

28441ca

more fixes

8b4479d

more fixes

4940e48

fixed the strange NamedTuple bundle_samples

b1236f0

fixed syntax error

6ebb53a

hopefully last change

80c99d6

fix transition_to_turing

39f2b02

fixed GibbsConditional

12fe712

torfjelde mentioned this pull request Sep 4, 2023

rename a few folders for clarity #2073

Merged

Merge branch 'master' into torfjelde/improved-chains-creation

814dc96

yebai approved these changes Sep 4, 2023

View reviewed changes

FlattenIterator is replaced by varname_and_value_leaves (#2072)

8dbd613

* Update Utilities.jl * remove obsolete functionality. * remove obsolete utility module * removes more references to utilities.

yebai requested a review from sunxd3 September 4, 2023 15:42

yebai reviewed Sep 4, 2023

View reviewed changes

Project.toml Outdated Show resolved Hide resolved

Update Project.toml

8a3a530

Co-authored-by: Hong Ge <[email protected]>

yebai mentioned this pull request Sep 4, 2023

Tracker imports no longer needed. #2076

Merged

Tracker imports no longer needed. (#2076)

4aeea42

* Update Turing.jl * Update Project.toml * Update Turing.jl * Update Essential.jl

yebai merged commit e22b77c into master Sep 4, 2023

yebai deleted the torfjelde/improved-chains-creation branch September 4, 2023 19:40

torfjelde mentioned this pull request Sep 5, 2023

Remove tonamedtuple TuringLang/DynamicPPL.jl#533

Closed

sunxd3 mentioned this pull request Sep 6, 2023

Issues with generated_quantities and siblings #2077

Closed

torfjelde mentioned this pull request Oct 1, 2023

Fixed externalsampler #2089

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove usage of `DynamicPPL.tonamedtuple` #2071

Remove usage of `DynamicPPL.tonamedtuple` #2071

torfjelde commented Sep 1, 2023 •

edited

Loading

torfjelde commented Sep 1, 2023

sethaxen commented Sep 1, 2023

torfjelde commented Sep 2, 2023 •

edited

Loading

torfjelde commented Sep 2, 2023 •

edited

Loading

sethaxen commented Sep 2, 2023

torfjelde commented Sep 2, 2023

codecov bot commented Sep 4, 2023 •

edited

Loading

torfjelde commented Sep 4, 2023

yebai left a comment

Remove usage of DynamicPPL.tonamedtuple #2071

Remove usage of DynamicPPL.tonamedtuple #2071

Conversation

torfjelde commented Sep 1, 2023 • edited Loading

torfjelde commented Sep 1, 2023

sethaxen commented Sep 1, 2023

torfjelde commented Sep 2, 2023 • edited Loading

torfjelde commented Sep 2, 2023 • edited Loading

sethaxen commented Sep 2, 2023

torfjelde commented Sep 2, 2023

codecov bot commented Sep 4, 2023 • edited Loading

Codecov Report

torfjelde commented Sep 4, 2023

yebai left a comment

Choose a reason for hiding this comment

Remove usage of `DynamicPPL.tonamedtuple` #2071

Remove usage of `DynamicPPL.tonamedtuple` #2071

torfjelde commented Sep 1, 2023 •

edited

Loading

torfjelde commented Sep 2, 2023 •

edited

Loading

torfjelde commented Sep 2, 2023 •

edited

Loading

codecov bot commented Sep 4, 2023 •

edited

Loading