Determining whether a functional test has failed (changepoint) #780

MichaelClerx · 2019-04-15T12:47:25Z

Chatting this morning we thought it might be good for functional tests to fail like this:

Gather the last N samples*
Run M tests
See if the M results are likely from the distribution approximated by N

There's some extra complexity in step 1, namely (A) ignoring samples that previously failed the test, (B) if the distribution changes sharply at some point, e.g. after a bug is fixed, we should manually mark all samples before that switch as points to be ignored

This still requires a bit of manual work, but at least we could get rid of some arbitrary thresholds?

Thoughts @ben18785 @martinjrobins ?

chonlei · 2019-04-16T16:29:19Z

First, have a look at the distribution #783

MichaelClerx · 2019-05-16T14:09:36Z

https://github.com/ben18785/pints-changepoints

MichaelClerx · 2019-05-16T14:32:50Z

@fcooper8472 what's our thinking on running this?

E.g. will ./funk report call this in a subprocess and wait for it to finish before analysing the output?

MichaelClerx · 2019-08-02T13:35:39Z

@abhidg https://dev.azure.com/OxfordRSE/pints-functional-testing/_build/results?buildId=439

Seems to be falling over with:

ERROR:pfunk._test:Exception in plot: mcmc_banana_EmceeHammerMCMC_3
Creating plot for mcmc_banana_EmceeHammerMCMC_3
Traceback (most recent call last):
  File "./funk", line 14, in <module>
    main()
  File "/home/pints/functional-testing/pfunk/__main__.py", line 494, in main
    args.func(args)
  File "/home/pints/functional-testing/pfunk/__main__.py", line 133, in run
    pfunk.tests.plot(name, args.database, args.show)
  File "/home/pints/functional-testing/pfunk/tests/_tests.py", line 34, in plot
    _tests[name].plot(database, show)
  File "/home/pints/functional-testing/pfunk/_test.py", line 115, in plot
    figs = self._plot(results)
  File "/home/pints/functional-testing/pfunk/tests/mcmc_banana.py", line 157, in _plot
    figs.append(pfunk.ChangePints().data(results['kld']).figure())
  File "/home/pints/functional-testing/pfunk/changepints.py", line 76, in figure
    fig, ax = rpt.display(self.signal, self.breakpoints())
AttributeError: 'ChangePints' object has no attribute 'signal'

abhidg · 2019-08-02T15:49:28Z

Fixed in pints-team/change-point-testing@a5f1b5e

MichaelClerx · 2019-08-20T11:41:49Z

Next up: Figure out test/pass when to email etc.

MichaelClerx · 2019-09-28T09:20:25Z

We should propbably prioritise this issue, and resolve it in the next few weeks :-)

Have added some new methods, and re-added a few tests we removed earlier because they seemed to hard.

It's interesting that the changepoint code so far hasn't complained about any of the methods. That's

Good! Because it appears more robust than our threshold-based testing
Not ideal, because consistently bad isn't maybe what we're after :D

So I'm guessing the final criterion would be a combination of what we currently have and the changepoint code?

MichaelClerx · 2019-09-28T09:37:02Z

For single chain MCMC methods, we could also consider adding a test that runs mutliple chains and tests whether they've converged?

MichaelClerx · 2019-09-28T09:37:15Z

For optimisers, see #906

MichaelClerx · 2021-01-26T09:58:31Z

pints-team/change-point-testing#23

MichaelClerx added the functional-testing label Apr 15, 2019

MichaelClerx mentioned this issue Apr 25, 2019

Find out why adaptive covariance func test results have changed slightly #784

Closed

MichaelClerx assigned MichaelClerx and ben18785 May 16, 2019

MichaelClerx assigned fcooper8472 May 16, 2019

This was referenced May 17, 2019

assertion of "under threshold for last 3 commits" is a bit flaky #538

Closed

Add plot of distribution of previous results #783

Closed

MichaelClerx changed the title ~~Functional test failures when distribution changes?~~ Functional test failures when distribution changes? (changepoint) May 17, 2019

abhidg self-assigned this Jul 30, 2019

MichaelClerx mentioned this issue Aug 2, 2019

Have a look at ruptures package for changepoint detection #859

Closed

MichaelClerx changed the title ~~Functional test failures when distribution changes? (changepoint)~~ Determining whether a functional test has failed (changepoint) Sep 28, 2019

MichaelClerx added the priority label Sep 28, 2019

MichaelClerx mentioned this issue Sep 30, 2019

Tidy up / redesign functional testing framework #979

Closed

MichaelClerx unassigned abhidg, MichaelClerx, fcooper8472 and ben18785 Mar 31, 2020

MichaelClerx mentioned this issue Jan 26, 2021

Add basic changepoint detection pints-team/change-point-testing#23

Closed

MichaelClerx closed this as completed Jan 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Determining whether a functional test has failed (changepoint) #780

Determining whether a functional test has failed (changepoint) #780

MichaelClerx commented Apr 15, 2019

chonlei commented Apr 16, 2019 •

edited

Loading

MichaelClerx commented May 16, 2019

MichaelClerx commented May 16, 2019

MichaelClerx commented Aug 2, 2019

abhidg commented Aug 2, 2019

MichaelClerx commented Aug 20, 2019

MichaelClerx commented Sep 28, 2019

MichaelClerx commented Sep 28, 2019

MichaelClerx commented Sep 28, 2019

MichaelClerx commented Jan 26, 2021

Determining whether a functional test has failed (changepoint) #780

Determining whether a functional test has failed (changepoint) #780

Comments

MichaelClerx commented Apr 15, 2019

chonlei commented Apr 16, 2019 • edited Loading

MichaelClerx commented May 16, 2019

MichaelClerx commented May 16, 2019

MichaelClerx commented Aug 2, 2019

abhidg commented Aug 2, 2019

MichaelClerx commented Aug 20, 2019

MichaelClerx commented Sep 28, 2019

MichaelClerx commented Sep 28, 2019

MichaelClerx commented Sep 28, 2019

MichaelClerx commented Jan 26, 2021

chonlei commented Apr 16, 2019 •

edited

Loading