Using JET.jl to determine if typed varinfo is okay #728

torfjelde · 2024-11-28T14:40:01Z

After a quick experiment with JET.jl I found some bugs in DynamicPPL.jl (#726), but also realized that we can JET.jl to properly check whether the a given model supports the usage of TypedVarInfo rather than requiring UntypedVarInfo.

This has been a looooooong standing issue, and this seems to work really, really well.

The problem

In Turing.jl, we use TypedVarInfo almost everywhere due to the performance charactersitics that come with it. The problem is that we do so by simply evaluating the given model once and then using the resulting (hopefully, concretety typed) varinfo for all subsequent computations. This works nicely for most typical models, but fails horribly (and uninformably) for a good chunk of models, such as

@model function demo1()
    x ~ Bernoulli()
    if x
        y ~ Normal()
    else
        z ~ Normal()
    end
end

Here we will execute the model once and get, say, a TypedVarInfo containing the variables x and y (because x happend to result in a true sample). If we then re-use this varinfo for sampling, we will ofc run into issues since z is nowhere to be seen.

Technically we can handle this by just widing the container a bit, but if we do that, we need to cpature the new varinfo, which isn't always possible, e.g. when using the LogDensityFunction in a sampler.

As a result, we have a lot of code that just makes the assumption "surely this model is 'static' in what variables and types it contains", which can sometimes be false.

The solution

This PR introduces a determine_varinfo method, which can automagically figure out whether we can use the type stable varinfo properly (i.e. without having to always capture the resulting varinfo, etc.) or if we need to use the untyped varinfo using abstract interpretation offered by JET.jl, all done statically.

Effectively what determine_varinfo does is:

Execute the model once with to get the typed varinfo.
Using JET.jl, statically check if we can run into type issues, e.g. container of NamedTuple{(:x, :y)} cannot handle the value for z being updated (because the entry does not exist).
If we do run into errors, we return an untyped varinfo. If we don't, we return a typed one.

Note that this method doesn't say anything about whether there might be type instabilities; this only checks if we would encounter errors. We can also use JET to check type instabilites, etc., but I think that's a separate functionality and thus PR.

…orfjelde/determine-varinfo

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

torfjelde · 2024-11-28T14:42:59Z

See the tests for what we can properly check here. It honestly seems really good for our purposes 👀

test/ext/DynamicPPLJETExt.jl

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

yebai · 2024-11-28T17:18:47Z

That seems like an elegant trick!

codecov · 2024-11-28T20:20:17Z

Codecov Report

Attention: Patch coverage is 89.28571% with 3 lines in your changes missing coverage. Please review.

Project coverage is 86.49%. Comparing base (0548ddf) to head (4a17e82).
Report is 1 commits behind head on master.

Files with missing lines	Patch %	Lines
src/experimental.jl	60.00%	2 Missing ⚠️
src/DynamicPPL.jl	80.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #728      +/-   ##
==========================================
+ Coverage   86.34%   86.49%   +0.15%     
==========================================
  Files          34       36       +2     
  Lines        4254     4272      +18     
==========================================
+ Hits         3673     3695      +22     
+ Misses        581      577       -4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

coveralls · 2024-11-28T20:20:45Z

Pull Request Test Coverage Report for Build 12236332369

Details

25 of 28 (89.29%) changed or added relevant lines in 4 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.2%) to 86.493%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
src/DynamicPPL.jl	4	5	80.0%
src/experimental.jl	3	5	60.0%

Totals
Change from base Build 12224895333:	0.2%
Covered Lines:	3695
Relevant Lines:	4272

💛 - Coveralls

fallback to current behavior + `supports_varinfo` to `is_suitable_varinfo`

longer needed on Julia 1.10 and onwards + added error hint for when JET.jl has not been loaded

provided context, but uses `SamplingContext` by default (as this should be a stricter check than just evaluation)

the ambiguous `VarINfo`

in sampling context now so no need to handle this explicitly elsewhere

…structed

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

torfjelde · 2024-11-29T09:03:43Z

This honestly seem to work really well. I've now made it so that default_sampler and LogDensityFunction also makes use of this. The question is just how well it works with Turing.jl (will try this now).

…o torfjelde/determine-varinfo

Added docs for `determine_suitable_varinfo` and existing methods that should be documented, e.g. `untyped_varinfo`, `typed_varinfo`, and `default_varinfo`

torfjelde · 2024-12-04T08:08:57Z

Added some docs 👍

…o torfjelde/determine-varinfo

mhauru

Thanks @torfjelde, this seems really handy. Had a few localised comments, nothing major.

Project.toml

src/varinfo.jl

src/experimental.jl

src/DynamicPPL.jl

mhauru · 2024-12-05T11:15:33Z

src/DynamicPPL.jl

+        Base.Experimental.register_error_hint(MethodError) do io, exc, argtypes, _
+            requires_jet =
+                exc.f === DynamicPPL.Experimental._determine_varinfo_jet &&
+                length(argtypes) >= 2 &&
+                argtypes[1] <: Model &&
+                argtypes[2] <: AbstractContext
+            requires_jet |=
+                exc.f === DynamicPPL.Experimental.is_suitable_varinfo &&
+                length(argtypes) >= 3 &&
+                argtypes[1] <: Model &&
+                argtypes[2] <: AbstractContext &&
+                argtypes[3] <: AbstractVarInfo
+            if requires_jet
+                print(
+                    io,
+                    "\n$(exc.f) requires JET.jl to be loaded. Please run `using JET` before calling $(exc.f).",
+                )
+            end
+        end


Could there be some way to test this? I do see that it's tricky. I'm a bit uncomfortable having this in without any testing.

Yeah was thinking the same. We could put in a test strictly before loading JET.jl ofc. It's a bit messy, but seems like the best way 😕

Does wrapping the tests in separate modules save us?

Nah. AFAIK extensions trigger if the package is loaded at any point, e.g. even if a dep loads it

It's also a thing where it doesn't seem like we can nicely get the resulting error message (the error hint is not in the msg of the error or something). So I think we just leave this for now 😕

Sure, it does seem nasty to test for. Have you tried locally that it does what you expect?

src/experimental.jl

ext/DynamicPPLJETExt.jl

Project.toml

src/experimental.jl

torfjelde · 2024-12-05T12:04:17Z

test/ext/DynamicPPLJETExt.jl

+        @model function demo5()
+            x ~ Normal()
+            xs = Any[]
+            push!(xs, x)
+            # `sum(::Vector{Any})` can potentially error unless the dynamic manages to resolve the
+            # correct `zero` method. As a result, this code will run, but JET will raise this is an issue.
+            return sum(xs)
+        end
+        # Should pass if we're only checking the tilde statements.
+        @test DynamicPPL.Experimental.determine_suitable_varinfo(demo5()) isa
+            DynamicPPL.TypedVarInfo
+        # Should fail if we're including errors in the model body.
+        @test DynamicPPL.Experimental.determine_suitable_varinfo(
+            demo5(); only_ddpl=false
+        ) isa DynamicPPL.UntypedVarInfo


This is the example mentioned above @mhauru :)

Co-authored-by: Markus Hauru <[email protected]>

torfjelde · 2024-12-10T08:44:22Z

You happy with this now @mhauru ?:) It's only failiing because of the x86 OOM thingy

mhauru

Yup, am happy, thanks!

torfjelde and others added 6 commits November 27, 2024 22:50

fixed calls to to_linked_internal_transform

361c45e

fixed incorrect call to acclogp_assume!!

545cfab

added determine_varinfo and an implementation using JET for this

abd432f

Merge remote-tracking branch 'origin/torfjelde/minor-bugfixes' into t…

5cd9009

…orfjelde/determine-varinfo

made filtering for errors only in the tilde pipeline optional

d503c3c

formatting

acb2cb0

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

torfjelde commented Nov 28, 2024

View reviewed changes

test/ext/DynamicPPLJETExt.jl Outdated Show resolved Hide resolved

torfjelde and others added 3 commits November 28, 2024 15:44

fixed incorrect comment

902641f

added test for the branch we were currently imssing

d93006b

formatting

64ff18a

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

torfjelde added 3 commits November 28, 2024 19:47

Merge branch 'master' into torfjelde/minor-bugfixes

90c2df0

Merge branch 'torfjelde/minor-bugfixes' into torfjelde/determine-varinfo

a94dbd5

Merge branch 'master' into torfjelde/determine-varinfo

67723d6

torfjelde and others added 11 commits November 29, 2024 09:25

renamed determine_varinfo to determine_suitable_varinfo with

3d8ad44

fallback to current behavior + `supports_varinfo` to `is_suitable_varinfo`

removed now-redundant init used with Requires.jl, since this is no

c06b080

longer needed on Julia 1.10 and onwards + added error hint for when JET.jl has not been loaded

determine_suitable_varinfo now only performs checks using the

d1a5bab

provided context, but uses `SamplingContext` by default (as this should be a stricter check than just evaluation)

formatting

5370e55

updated error hint

dd408ee

added def of untyped_varinfo which takes just model and context

c253e9b

fixed incorrect call to untyped_varinfo in _determine_varinfo_jet

891b46a

explicitly call typed_varinfo when we want such a thing rather than

686ed9f

the ambiguous `VarINfo`

typed_varinfo and untyped_varinfo handles wrapping passed context

d7d785a

in sampling context now so no need to handle this explicitly elsewhere

use determine_suitable_varinfo in LogDensityFunction when not con…

dda56ec

…structed

formatting

46ea18c

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

formatting

c20ede3

torfjelde added 5 commits December 3, 2024 18:50

Merge remote-tracking branch 'origin/torfjelde/determine-varinfo' int…

599488b

…o torfjelde/determine-varinfo

reverted changes to default_varinfo and LogDensityFunction

fa155a4

added a bunch of docs for introduced and existing methods

8496968

Added docs for `determine_suitable_varinfo` and existing methods that should be documented, e.g. `untyped_varinfo`, `typed_varinfo`, and `default_varinfo`

added doctests to determine_suitable_varinfo

fd82871

added JET.jl as a dep to docs

bb87ba0

fixed referencing in docs

62c5cd1

yebai requested review from sunxd3, willtebbutt and mhauru December 4, 2024 11:00

torfjelde added 4 commits December 4, 2024 13:40

fixed docstring

55dc91e

Merge branch 'master' into torfjelde/determine-varinfo

ae51778

fixed doctest

a692ec3

Merge remote-tracking branch 'origin/torfjelde/determine-varinfo' int…

d5eb404

…o torfjelde/determine-varinfo

mhauru requested changes Dec 5, 2024

View reviewed changes

torfjelde commented Dec 5, 2024

View reviewed changes

Project.toml Outdated Show resolved Hide resolved

Update Project.toml

17b6ec9

torfjelde commented Dec 5, 2024

View reviewed changes

src/experimental.jl Outdated Show resolved Hide resolved

torfjelde commented Dec 5, 2024

View reviewed changes

torfjelde and others added 6 commits December 5, 2024 13:04

applied suggestions from @mhauru

bfa88b2

Co-authored-by: Markus Hauru <[email protected]>

fixed doctests

82578cf

finally fixed doctests

3aad34f

removed unnecessary typed_varinfo and untyped_varinfo methods

da3eefe

added filter to ignore source of warnings in doctest

325c5f9

Merge branch 'master' into torfjelde/determine-varinfo

4a17e82

mhauru approved these changes Dec 10, 2024

View reviewed changes

torfjelde merged commit 145f471 into master Dec 10, 2024
11 of 13 checks passed

torfjelde deleted the torfjelde/determine-varinfo branch December 10, 2024 09:48

yebai mentioned this pull request Dec 19, 2024

Taking stochastic control flow a bit more seriously #25

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using JET.jl to determine if typed varinfo is okay #728

Using JET.jl to determine if typed varinfo is okay #728

torfjelde commented Nov 28, 2024

torfjelde commented Nov 28, 2024

yebai commented Nov 28, 2024

codecov bot commented Nov 28, 2024 •

edited

Loading

coveralls commented Nov 28, 2024 •

edited

Loading

torfjelde commented Nov 29, 2024

torfjelde commented Dec 4, 2024

mhauru left a comment

mhauru Dec 5, 2024

torfjelde Dec 5, 2024

mhauru Dec 5, 2024

torfjelde Dec 5, 2024

torfjelde Dec 6, 2024

mhauru Dec 10, 2024

torfjelde Dec 5, 2024

torfjelde commented Dec 10, 2024

mhauru left a comment

Using JET.jl to determine if typed varinfo is okay #728

Using JET.jl to determine if typed varinfo is okay #728

Conversation

torfjelde commented Nov 28, 2024

The problem

The solution

torfjelde commented Nov 28, 2024

yebai commented Nov 28, 2024

codecov bot commented Nov 28, 2024 • edited Loading

Codecov Report

coveralls commented Nov 28, 2024 • edited Loading

Pull Request Test Coverage Report for Build 12236332369

Details

💛 - Coveralls

torfjelde commented Nov 29, 2024

torfjelde commented Dec 4, 2024

mhauru left a comment

Choose a reason for hiding this comment

mhauru Dec 5, 2024

Choose a reason for hiding this comment

torfjelde Dec 5, 2024

Choose a reason for hiding this comment

mhauru Dec 5, 2024

Choose a reason for hiding this comment

torfjelde Dec 5, 2024

Choose a reason for hiding this comment

torfjelde Dec 6, 2024

Choose a reason for hiding this comment

mhauru Dec 10, 2024

Choose a reason for hiding this comment

torfjelde Dec 5, 2024

Choose a reason for hiding this comment

torfjelde commented Dec 10, 2024

mhauru left a comment

Choose a reason for hiding this comment

codecov bot commented Nov 28, 2024 •

edited

Loading

coveralls commented Nov 28, 2024 •

edited

Loading