Derived quantities by danielturek · Pull Request #1547 · nimble-dev/nimble

danielturek · 2025-05-13T13:14:39Z

Adds derived quantities to the MCMC.

Let's see how many issues the testing turns up.

* Tests for mean/variance derived quantities * Tests for logProb derived quantity * Tests for predictive derived quantity * Tests for custom derived quantity * Update derived quantity tests after fixes * Remove unneeded TODO comment --------- Co-authored-by: Daniel Turek <danielturek@gmail.com>

paciorek · 2025-06-20T15:39:55Z

I merged devel into derived_quantities to address a few testing issues that I had fixed in recent weeks. That should fix all but the mcmc and filtering gold file issues.

Let's see what this run of CI shows in terms of the gold file issues. Hopefully those are heisen-failures.

paciorek · 2025-06-20T17:11:42Z

There's a failure in test-bnp that I can replicate running manually on my machine. It occurs in the MCMC compilation in this particular chunk of code:

  code=nimbleCode(
    {
      xi[1:10] ~ dCRP(1 , size=10)
      thetatilde[1] ~ dnorm(0, 1)
      thetatilde[2] ~ dt(0, 1, 1)
      thetatilde[3] ~ dt(0, 1, 1)
      s2tilde[1] ~ dinvgamma(2, 1)
      s2tilde[2] ~ dgamma(1, 1)
      s2tilde[3] ~ dgamma(1, 1)
      for(i in 1:10){
        y[i] ~ dnorm(thetatilde[xi[i]], var=s2tilde[xi[i]])
      }
    }
  )
  Inits=list(xi=rep(1, 10), thetatilde=rep(0,3), s2tilde=rep(1,3))#
  Data=list(y=rnorm(10, 0,1))
  m <- nimbleModel(code, data=Data, inits=Inits)
  cm <- compileNimble(m)
  mConf <- configureMCMC(m, monitors =  c('thetatilde', 's2tilde', 'xi'))
  expect_message(mMCMC <- buildMCMC(mConf), "The number of clusters")
  cMCMC <- compileNimble(mMCMC, project = m) 
  cMCMC$run(1, reset=FALSE)

[...snip...]
Compiling
  [Note] This may take a minute.
  [Note] Use 'showCompilerOutput = TRUE' to see C++ compilation details.
terminate called after throwing an instance of 'std::length_error'
  what():  vector::_M_default_append
Aborted

danielturek · 2025-06-21T02:06:04Z

@paciorek What is the intention of including reset = FALSE in the call

cMCMC$run(1, reset=FALSE)

?

I believe that's what's causing the problem here, and the MCMC system is not designed to run with reset = FALSE, unless a prior run of the MCMC has already taken place.

paciorek · 2025-06-22T17:54:59Z

Hmm.

I don't know why I put reset=FALSE in that test.
That said, this test has been fine for years. And to confirm, I just ran it on current nimble and does run without error.
I would hope that even if we don't intend users to use reset=FALSE on first run that it would not cause an error and certainly not a crash. That said, I see that we tell users not to use FALSE on first run in roxygen.

I'm ok with taking out reset=FALSE but before doing so, I think it would be good if we understand why it causes problems now but not before.

danielturek · 2025-06-22T19:41:19Z

@paciorek I'm away from a computer this week, this is from my phone; forgive briefness. This is caused by some reorganization of the initialization code I did with the setup of the MCMC, wrt mvSamples and related objects. The changes (alongside new initialization code for the derived quantities) is more logical and correct, for how the MCMC is designed to operate. I wouldn't mind also adding an error trap for this case (first run of MCMC using reset = FALSE), which would error trap this case, but I can't do that now. What do you think is best?

paciorek · 2025-06-22T19:58:55Z

To me the most natural thing is that if one does reset=FALSE on first run of the MCMC it has no effect. I.e. if one does not reset MCMC sampling quantities before even running the MCMC, then the MCMC just runs as it otherwise would with whatever the initial quantities are set to..

But you have probably thought this through/understand this more than I do, so if you prefer to error trap this case, it is ok with me.

Perhaps when you work on this you can just remove the reset=FALSE from that test in test-bnp.R.

No immediate hurry here, but I'll note to myself that we are planning on making this change as part of 1.4.0 release.

danielturek · 2025-06-25T01:56:54Z

If I'm seeing this correctly, iIt looks like the testing failures are related to the MCMC gold file. Chris, are you able to look more closely at those, at any point?

…

On Sun, Jun 22, 2025 at 3:59 PM Christopher Paciorek < ***@***.***> wrote: *paciorek* left a comment (nimble-dev/nimble#1547) <#1547 (comment)> To me the most natural thing is that if one does reset=FALSE on first run of the MCMC it has no effect. I.e. if one does not reset MCMC sampling quantities before even running the MCMC, then the MCMC just runs as it otherwise would with whatever the initial quantities are set to.. But you have probably thought this through/understand this more than I do, so if you prefer to error trap this case, it is ok with me. Perhaps when you work on this you can just remove the reset=FALSE from that test in test-bnp.R. No immediate hurry here, but I'll note to myself that we are planning on making this change as part of 1.4.0 release. — Reply to this email directly, view it on GitHub <#1547 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABCNNYLFRIY74WN55PZ2DZD3E4DJLAVCNFSM6AAAAAB5ATJWNCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDSOJUGQYTKNRQGM> . You are receiving this because you authored the thread.Message ID: ***@***.***>

paciorek · 2025-06-25T14:24:40Z

Sure -- I need to look at the MCMC gold file for another PR, so it may be related.

danielturek · 2025-06-25T17:17:29Z

Thanks. I made a minor fix for the MCMC reset argument.

…

On Wed, Jun 25, 2025 at 10:25 AM Christopher Paciorek < ***@***.***> wrote: *paciorek* left a comment (nimble-dev/nimble#1547) <#1547 (comment)> Sure -- I need to look at the MCMC gold file for another PR, so it may be related. — Reply to this email directly, view it on GitHub <#1547 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABCNNYJCLGXZWY7V6WQVLBL3FKWL7AVCNFSM6AAAAAB5ATJWNCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTAMBUHE4DSNJVGY> . You are receiving this because you authored the thread.Message ID: ***@***.***>

paciorek · 2025-06-25T23:50:58Z

@danielturek there's a problem with your fix:

** byte-compile and prepare package for lazy loading
Warning in .checkFieldsInMethod(def, fieldNames, allMethods) :
  non-local assignment to non-field names (possibly misspelled?)
    reset <<- TRUE
( in method "run" for class "MCMC")

danielturek · 2025-06-26T01:03:19Z

@paciorek I just made a small update.

paciorek · 2025-06-26T15:33:36Z

So some (not sure if all) of the gold file problems seem to be caused by a new space character inserted in the reporting of sampler assignment.

E.g., in line 2709 of mcmcTestLog_Correct.Rout, we have:

RW sampler: a[2]

The saved gold file has no space after "[2]", but running tests now produces a space after "a[2]".

On first glance, it looks like this is coming from the change to cat() in the show method for samplerConf.

paciorek · 2025-06-26T15:59:18Z

I think all of the mcmc gold file issues are the extra space. Hopefully the filtering gold issues are too.

danielturek · 2025-06-26T16:01:06Z

That sounds about right, and yes, I would prefer the space not be there. Is there any chance of re-writing the gold file? I suspect you'll say the correct path is to reintroduce the space, the (theoretically) pass testing, then re-remove the space.

paciorek · 2025-06-27T17:22:57Z

@danielturek

Ok to merge this in? Do you want to do it?
I see various roxygen entries, so I am guessing roxygen is more-or-less done, but wanted to check.
Were you planning to draft a section for the manual or would you like me to do that?

danielturek · 2025-06-28T10:42:01Z

@paciorek

Yes, I consider this ready to merge. I'll take care of it.
Yes, the roxygen is accurate.
There is currently no manual entry. If you're willing to draft that I'd appreciate it. It might be reasonable to (in a large part) just reference the vignette; or if not, you could repurpose a lot of information from there.

danielturek · 2025-06-28T15:03:28Z

@paciorek Looks like I didn't update the NEWS file, however.

paciorek · 2025-06-28T16:35:53Z

That's fine -- as of late I've just been doing NEWS all in one go right before release by looking through all the commits/closed PRs/closed issues.

I'm not sure how to feel about vignette vs. manual. Modularity can be nice and I know other packages structure docs around vignettes and the manual can be overwhelming, but I also like have all the content in one place, particularly since we're thinking of this as a core part of the MCMC system.

@perrydv @kenkellner do you have any inclinations in terms of how much to say about derived quantities in the manual vs. the material already in the vignette?

kenkellner · 2025-06-29T12:59:20Z

The existing vignette covers everything and is pretty concise, so maybe it just all goes in the manual?

If most of it stays in the vignette, is it OK that the vignette is not actually included with the package but instead is on Daniel's site? Not sure if there are other situations like that in the manual.

paciorek · 2025-07-03T17:03:17Z

I'm working on manual content for derived quantities.

@danielturek (and no need to respond until you have time) in your vignette, you distinguish between (1) "predictive nodes" and (2) "posterior derived quantities". I think by the latter you mean deterministic predictive "quantities", but elsewhere in our docs we refer to both determ and stoch predictive cases as "predictive nodes". So I am going to modify your language and not distinguish between (1) and (2). Please let me know if I'm misunderstanding or you otherwise are opposed to me modifying your language.

danielturek · 2025-07-07T18:05:13Z

@paciorek Yes, in the text I generally said "predictive nodes" for PP stochastic nodes, and "posterior derived quantities" for PP deterministic nodes. And yes, as you said both of these are just different cases of "predictive nodes". It sounds like we're on the same page, and if you've already modified text according to how you think this should be worded, that's fine with me.

danielturek added 30 commits April 8, 2025 16:19

moved some utility functions to MCMC_util.R

dc6f37f

draft of derived_logProb

ba23006

working on derived quantities

f1f23bf

update to printing

88f017d

working on logProb function

a71de52

working on derived functions

a053fc5

updated notes

8323764

update to derived function

8bae542

working on names

fc4961d

working on derived quantities

0ab40d1

minor update to build

f9b509e

updates to derived functions

ff39203

high level function calls

4ae12eb

running name change, and dealing with length 1 vectors

803fa08

fixed length = 1 vectors

efa4fc0

changed mean and variance to use mvSamples

550cdb6

updates

53e09b0

name change to printing

0bd0e22

added frequencyRecord and saved old mvSamples implementations

e38cd2e

working on argument names

368b694

comments on questions

5b3dc2c

updated comments

bf1bf97

updated notes based on discussion

706b14a

removed old versions of mean and variance

2db7f87

removed storage arrays in mean and variance

4828ca7

changelog

2b22c47

new nimble option MCMCreturnDerivedQuantities to control derived return

8a7710c

derived has final position in runMCMC output list

4c7ad1b

updated comments

695e45f

list names for logProb derived quantity

2d24b99

danielturek and others added 4 commits May 27, 2025 13:45

reverted handling of .all for logProb

5584530

revert logProb = TRUE behavior from configureMCMC

aa974bd

Fix minor roxygen conflict in MCMC_build.R.

422fac8

danielturek and others added 2 commits June 24, 2025 19:00

compulsory reset on first run of MCMC

37864b3

Merge branch 'devel' into derived_quantities

b449cdc

Fix local assignment for reset argument

7870b8c

removed extra space at end of samplerConf$show method

f5daeae

danielturek merged commit e0f18a8 into devel Jun 28, 2025
8 checks passed

Conversation

danielturek commented May 13, 2025

Uh oh!

paciorek commented Jun 20, 2025

Uh oh!

paciorek commented Jun 20, 2025

Uh oh!

danielturek commented Jun 21, 2025

Uh oh!

paciorek commented Jun 22, 2025

Uh oh!

danielturek commented Jun 22, 2025

Uh oh!

paciorek commented Jun 22, 2025

Uh oh!

danielturek commented Jun 25, 2025 via email

Uh oh!

paciorek commented Jun 25, 2025

Uh oh!

danielturek commented Jun 25, 2025 via email

Uh oh!

paciorek commented Jun 25, 2025

Uh oh!

danielturek commented Jun 26, 2025

Uh oh!

paciorek commented Jun 26, 2025

Uh oh!

paciorek commented Jun 26, 2025

Uh oh!

danielturek commented Jun 26, 2025

Uh oh!

paciorek commented Jun 27, 2025

Uh oh!

danielturek commented Jun 28, 2025

Uh oh!

Uh oh!

danielturek commented Jun 28, 2025

Uh oh!

paciorek commented Jun 28, 2025

Uh oh!

kenkellner commented Jun 29, 2025

Uh oh!

paciorek commented Jul 3, 2025

Uh oh!

danielturek commented Jul 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants