A nice idea for accuracy of inference at different timepoints #2

hyanwong · 2019-10-06T14:44:03Z

Anders Eriksson suggested a nice way of testing whether our inference methods do well or poorly for different heights in the TS.

We use the (infinite sites) mutations to identify corresponding edges in the true and the inferred TS. Then (since we are guaranteed that the tips under each are the same), we can calculate a topology difference between the subtrees rooted at that node.

petrelharp · 2019-10-06T21:11:36Z

Nice. This gives us a way of identifying nodes also - nodes = ancestral haplotypes, and are mutations that originated in a given haplotype in one tree sequence, are they in the same in another.

hyanwong · 2020-01-29T12:03:18Z

Another possibility, as just discussed with Michelle Kendell, and particularly useful for tsinfer, where we have a known (simulated) TS with branch lengths and an inferred topology with arbitrary lengths. We take all nodes from the known topology that exist between certain timepoints, and select all the pairwise differences (with left-right coords if >1 tree) that split on this node. We then calculate a topology-only pairwise distance metric (e.g. KC) based on only those pairs over that portion of the genome.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A nice idea for accuracy of inference at different timepoints #2

A nice idea for accuracy of inference at different timepoints #2

hyanwong commented Oct 6, 2019

petrelharp commented Oct 6, 2019

hyanwong commented Jan 29, 2020

A nice idea for accuracy of inference at different timepoints #2

A nice idea for accuracy of inference at different timepoints #2

Comments

hyanwong commented Oct 6, 2019

petrelharp commented Oct 6, 2019

hyanwong commented Jan 29, 2020