Node times remain biased after multiple EP rounds (and bias depends on number of nodes) #444

hyanwong · 2024-12-11T13:28:17Z

This is redating the Quebec genealogy, so there shouldn't be too many polytomies to mess things up. Here we are simply plotting the difference between true and (unconstrained) tsdate times for all nodes.

I was expecting these to asymptote to 0, at least on the linear scale (left), even if not on a log scale (right). It's weird to me that they don't

nspope · 2024-12-11T17:06:32Z

Hmm ... why would you expect them to not have error? They should converge to the (true) posterior mean with more EP iterations, which is not going to equal the true times. In fact, the posterior mean may be systematically larger than the true times, if the posterior is right-skewed. Estimates should converge to the true times as you add more mutations (that is, the posterior is consistent)

Rather than looking at posterior mean vs true time, the quantity to look at to assess calibration is expected vs observed coverage.

hyanwong · 2024-12-11T17:12:41Z

Hmm, I see. That sounds reasonable. I'll try a coverage plot too.

hyanwong · 2024-12-11T17:14:00Z

(I would expect error, but not such clear bias)

nspope · 2024-12-11T17:20:29Z

If the posterior is right-skewed, it's reasonable to see positive bias (and even without skew there's no guarantee that the posterior mean is an unbiased estimator-- rather the opposite, with priors one trades variance for bias, though we're not using a very strong prior here at all). It's probably worth plotting error vs age to see what nodes this is coming from?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Node times remain biased after multiple EP rounds (and bias depends on number of nodes) #444

Node times remain biased after multiple EP rounds (and bias depends on number of nodes) #444

hyanwong commented Dec 11, 2024

nspope commented Dec 11, 2024 •

edited

Loading

hyanwong commented Dec 11, 2024

hyanwong commented Dec 11, 2024

nspope commented Dec 11, 2024 •

edited

Loading

Node times remain biased after multiple EP rounds (and bias depends on number of nodes) #444

Node times remain biased after multiple EP rounds (and bias depends on number of nodes) #444

Comments

hyanwong commented Dec 11, 2024

nspope commented Dec 11, 2024 • edited Loading

hyanwong commented Dec 11, 2024

hyanwong commented Dec 11, 2024

nspope commented Dec 11, 2024 • edited Loading

nspope commented Dec 11, 2024 •

edited

Loading

nspope commented Dec 11, 2024 •

edited

Loading