Added `check_model` and sub-module `DebugUtils` #540

torfjelde · 2023-09-23T16:49:51Z

As we're getting more users coming from other languages, who are necessarily not so familiar with Julia, it's becoming more and more important that we avoid the end-user hitting any potentially confusing snags of Turing.jl / DynamicPPL.jl.

Examples are:

Usage of missing and differences between the different ways of conditioning variables.
Repeated variables will silently lead to incorrect behavior.
Etc.

Because of this, I was wondering if it might be a good idea to add some utilities for doing exactly this.

This PR adds a method check_model (which is also exported) which executes the model using a special context (DebugUtils.DebugContext) and performs some checks before, throughout, and after evaluation of the model. See the added tests for some examples.

Moreover, once we improve the documentation (I'm going to add a section on "Syntax"), we can then link to those in the warnings/errors to point the user to where they can figure stuff out.

Here's an example:

julia> using DynamicPPL, Distributions

julia> model = DynamicPPL.TestUtils.DEMO_MODELS[1];

julia> issuccess, (trace, _) = check_model(model);

julia> issuccess
true

julia> trace
3-element Vector{DynamicPPL.DebugUtils.Stmt}:
  assume: VarName[s[1], s[2]] = [5.58585, 0.73498] .~ InverseGamma{Float64}(invd=Gamma{Float64}(α=2.0, θ=0.333333), θ=3.0) ⟼ [5.58585, 0.73498] (logprob = -4.46134)
  assume: VarName[m[1], m[2]] = [0.264206, 0.781137] .~ Normal{Float64}[Normal{Float64}(μ=0.0, σ=2.36344), Normal{Float64}(μ=0.0, σ=0.85731)] ⟼ [0.264206, 0.781137] (logprob = -2.96538)
 observe: [1.5, 2.0] ~ DiagNormal(μ=[0.264206, 0.781137], Σ=[5.58585 0.0; 0.0 0.73498]) (logprob = -3.6914)

julia> trace[1].varname
2-element Vector{VarName{:s, Setfield.IndexLens{Tuple{Int64}}}}:
 s[1]
 s[2]

Thoughts?

As an additional thing, though I'm uncertain if we want this or not, I've added the possibility of saving the tilde-statements as we're going through the model + potentially saving the varinfo after every statement. This can be quite useful for both us debugging "remotely" (as in, just ask the user to run the model with this and give us the output) + it makes it quite easy to perform either visual or programmatic checks of the "trace" of a model, which I think will be quite handy.

Btw, throughout this I've discovered a few bugs, which have subsequently been fixed:) If we decide against something like this, then I'll make separate PRs for this.

single record_tilde! + support for dot tilde + return issuccess and additional info in check_model

can further customize

convenient show methods to make displaying the trace nicer

encountering missing

bit more readable

de-conditioning is restricted to univariate distributions

SamplingContext by default since we're using an empty VarInfo by default

github-actions

Remaining comments which cannot be posted as a review comment to avoid GitHub Rate Limit

JuliaFormatter

test/debug_utils.jl|59|
test/debug_utils.jl|61|
test/debug_utils.jl|73|
test/debug_utils.jl|81|
test/debug_utils.jl|85|
test/debug_utils.jl|90|

src/debug_utils.jl

src/test_utils.jl

test/debug_utils.jl

github-actions · 2023-09-23T17:14:33Z

Pull Request Test Coverage Report for Build 8750716198

Details

109 of 220 (49.55%) changed or added relevant lines in 5 files are covered.
17 unchanged lines in 2 files lost coverage.
Overall coverage decreased (-2.9%) to 78.921%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
src/test_utils.jl	0	5	0.0%
src/utils.jl	1	7	14.29%
src/debug_utils.jl	102	202	50.5%

Files with Coverage Reduction	New Missed Lines	%
src/model.jl	1	89.22%
src/threadsafe.jl	16	48.25%

Totals
Change from base Build 8745687397:	-2.9%
Covered Lines:	2707
Relevant Lines:	3430

💛 - Coveralls

codecov · 2023-09-23T17:14:39Z

Codecov Report

Attention: Patch coverage is 49.08257% with 111 lines in your changes are missing coverage. Please review.

Project coverage is 79.16%. Comparing base (816e962) to head (8c47fad).
Report is 3 commits behind head on master.

❗ Current head 8c47fad differs from pull request most recent head c1aa5ea. Consider uploading reports for the commit c1aa5ea to get more accurate results

Files	Patch %	Lines
src/debug_utils.jl	50.00%	100 Missing ⚠️
src/utils.jl	14.28%	6 Missing ⚠️
src/test_utils.jl	0.00%	5 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #540      +/-   ##
==========================================
- Coverage   83.65%   79.16%   -4.50%     
==========================================
  Files          28       26       -2     
  Lines        3219     3197      -22     
==========================================
- Hits         2693     2531     -162     
- Misses        526      666     +140

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

torfjelde · 2023-09-25T09:33:07Z

I went the check_model_and_trace route for now

test/debug_utils.jl

… docstring

src/debug_utils.jl

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

torfjelde · 2023-09-25T09:51:07Z

Any thoughts on the implementation @devmotion ? I realize I haven't overloaded show much, so am uncertain if what I've done is a good idea or not.

devmotion · 2023-09-25T12:55:00Z

I'd say the main rules are to avoid type piracy, to absolutely never define show methods for Types, and otherwise follow the conventions in the docs (https://docs.julialang.org/en/v1/manual/types/#man-custom-pretty-printing; main points: two argument show method is a compact one-line representation whereas show(io::IO, ::MIME"text/plain", x) is used for longer descriptions and will also be used by display(x) in the REPL).

torfjelde · 2023-09-25T16:07:31Z

But then it seems like my current approach is good?

torfjelde · 2024-04-19T08:39:19Z

Btw @yebai you have any thoughts on this?

Another aspect I think would be useful with this is to extract the trace from the model a few times to determine if the model is "static" (in some sense) or not. We could then warn the user if, say, they're running HMC on a model which has a non-static number of parameters.

EDIT: Added a has_static_constraints method. This is also related to TuringLang/Turing.jl#2195.

EDIT 2: Note that after this PR we can immediately improve the user-experience by just adding a check_model=true to Turing.sample and then run DynamicPPL.check_model(model) before we hit inference.

`has_static_constraints` method to empirically check whether the model has static constraints or if they are indeed changing dependent on realizations

dot-tilde statements

torfjelde · 2024-04-19T18:10:41Z

This is pretty much ready to go:)

yebai

Good to have some debugging tools!

yebai · 2024-06-12T14:07:16Z

We can likely introduce some checks performed after each MCMC step in the future, which can help catch additional issues like changing model dimensionality.

torfjelde added 27 commits September 22, 2023 08:21

initial work on model checking

feedd0e

use record_pre_tilde!, record_post_tilde!, etc. instead of just a

aa183bc

single record_tilde! + support for dot tilde + return issuccess and additional info in check_model

added test_context_interface to TestUtils

6736ec7

added tests for check_model

5cf8240

moved debug contexts and check_model to a separate file

a4f0a22

export check_model + make DebugContext take the model as input so we

9445d2b

can further customize

noticed I forgot to include check_models.jl file

43654e2

fixed tests

f4a1c7e

added record-methods for observe statements too

c3f9b29

use explicit types for the recorded tilde statements + added

f3cc93b

convenient show methods to make displaying the trace nicer

renamd check__model to debug_utils and put it into a module

fd6cb56

renamed test/check_model.jl to test/debug_utils.jl

9906862

removed unnecessary stuff in tests

36ecb52

added test for logging of statements

99f0112

removed unnecessary splatting in broadcasting + improved errors for

aaae4bb

encountering missing

added missing implementation of tilde_observe for PrefixContext

0b7ac48

re-ordered method implementations for DebugContext to make things a

30b3d34

bit more readable

addeed error message indicating that usage of missing for

5020add

de-conditioning is restricted to univariate distributions

added missing left field to ObserveStmt

56f9c6e

fixed conditioned

791186d

fixed fixed too, and moved the _merge to a more sensible location

b0cc1b7

added check_model_post_evaluation and made it so we're using

a4c4bae

SamplingContext by default since we're using an empty VarInfo by default

removed show_statements

5c300f7

perform some simple checks to make sure show is working for statements

20fb298

improved test for show of statements a tiny bit

e9c99f3

added some more docs

1bdd565

more docs

b579889

github-actions bot reviewed Sep 23, 2023

View reviewed changes

more updates to tests

1e42ba9

github-actions bot reviewed Sep 25, 2023

View reviewed changes

test/debug_utils.jl Outdated Show resolved Hide resolved

torfjelde added 4 commits September 25, 2023 10:33

formatting

7863d57

added rng as an optional positional argument to check_model methods

ce86db6

added an example in the docstring of check_model_and_trace

1e8051f

added example of correct and incorrect model in check_model_and_trace…

18036a4

… docstring

github-actions bot reviewed Sep 25, 2023

View reviewed changes

src/debug_utils.jl Outdated Show resolved Hide resolved

Update src/debug_utils.jl

8a51f8b

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

torfjelde added 5 commits September 25, 2023 17:39

fixed docs maybe

6c72790

fixed docstring maybe

8c47fad

Merge branch 'master' into torfjelde/model-check

f86ff3c

fixed reference to Setfield and tests

5e400e5

fixed docs

12f7274

added some conveinence methods in addition to a

3b297d8

`has_static_constraints` method to empirically check whether the model has static constraints or if they are indeed changing dependent on realizations

torfjelde mentioned this pull request Apr 19, 2024

Issues with constrained parameters depending on each other TuringLang/Turing.jl#2195

Closed

improved show for large arrays of varnames whiich can occur in

c1aa5ea

dot-tilde statements

yebai approved these changes Apr 19, 2024

View reviewed changes

yebai enabled auto-merge April 19, 2024 18:48

yebai added this pull request to the merge queue Apr 19, 2024

Merged via the queue into master with commit 824f712 Apr 19, 2024
11 of 12 checks passed

yebai deleted the torfjelde/model-check branch April 19, 2024 19:21

torfjelde mentioned this pull request May 8, 2024

Check model by default TuringLang/Turing.jl#2218

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added `check_model` and sub-module `DebugUtils` #540

Added `check_model` and sub-module `DebugUtils` #540

torfjelde commented Sep 23, 2023

github-actions bot left a comment

github-actions bot commented Sep 23, 2023 •

edited

Loading

codecov bot commented Sep 23, 2023 •

edited

Loading

torfjelde commented Sep 25, 2023

torfjelde commented Sep 25, 2023

devmotion commented Sep 25, 2023

torfjelde commented Sep 25, 2023

torfjelde commented Apr 19, 2024 •

edited

Loading

torfjelde commented Apr 19, 2024

yebai left a comment

yebai commented Jun 12, 2024

Added check_model and sub-module DebugUtils #540

Added check_model and sub-module DebugUtils #540

Conversation

torfjelde commented Sep 23, 2023

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot commented Sep 23, 2023 • edited Loading

Pull Request Test Coverage Report for Build 8750716198

Details

💛 - Coveralls

codecov bot commented Sep 23, 2023 • edited Loading

Codecov Report

torfjelde commented Sep 25, 2023

torfjelde commented Sep 25, 2023

devmotion commented Sep 25, 2023

torfjelde commented Sep 25, 2023

torfjelde commented Apr 19, 2024 • edited Loading

torfjelde commented Apr 19, 2024

yebai left a comment

Choose a reason for hiding this comment

yebai commented Jun 12, 2024

Added `check_model` and sub-module `DebugUtils` #540

Added `check_model` and sub-module `DebugUtils` #540

github-actions bot commented Sep 23, 2023 •

edited

Loading

codecov bot commented Sep 23, 2023 •

edited

Loading

torfjelde commented Apr 19, 2024 •

edited

Loading