Added methods to sort and squash segments in the IbdFinder output. #2460

gtsambos · 2022-08-09T06:13:31Z

Fixes #2459 -- see the issue for discussion about why this is needed. Draft only at the moment.

PR Checklist:

Tests that fully cover new/changed functionality.
Documentation including tutorial content if appropriate.
Changelogs, if there are API changes.

codecov · 2022-08-09T06:26:03Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 89.72%. Comparing base (fb87f87) to head (7fe0bb8).
Report is 6 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2460   +/-   ##
=======================================
  Coverage   89.72%   89.72%           
=======================================
  Files          29       29           
  Lines       31578    31578           
  Branches     6117     6117           
=======================================
  Hits        28333    28333           
  Misses       1853     1853           
  Partials     1392     1392

Flag	Coverage Δ
c-tests	`86.55% <ø> (ø)`
lwt-tests	`80.78% <ø> (ø)`
python-c-tests	`88.98% <ø> (ø)`
python-tests	`99.01% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

gtsambos · 2022-08-31T03:08:56Z

Hey @hyanwong! Unfortunately, we both made some pretty involved changes to the test_highlevel.py file in the past few days. I think I've fixed up the merge conflict, but I can't figure out how to get my tests to run with the argument that I added into get_example_tree_sequences(). Currently I'm just skipping these tests because I don't fully understand the syntax of your additions -- do you know how to ensure these tests 'see' the argument I've added in?

hyanwong · 2022-09-01T09:28:00Z

It was @benjeffery who fixed up get_example_tree_sequences() to run in parallel, cached, so that parameterize worked properly. He might be able to suggest a fix?

benjeffery · 2022-09-02T09:40:06Z

I'll take a look now

benjeffery · 2022-09-02T10:22:47Z

I think the simplest thing here is to remove custom_max.
I've done this in a3aa3da
Currently, I've set the N numbers to those generated when custom_max was 15. Once all the IBD tests pass we should add in some even numbers again and skip them in the IBD tests? (I'm assuming that you need them to be odd, as all those generated by custom_max were?

gtsambos · 2022-09-07T03:15:43Z

Thanks @benjeffery and sorry for the late reply -- I had to take a few days off to deal with some urgent moving-to-the-US things.

Currently, I've set the N numbers to those generated when custom_max was 15. Once all the IBD tests pass we should add in some even numbers again and skip them in the IBD tests? (I'm assuming that you need them to be odd, as all those generated by custom_max were?

I added this argument because some of my changes in this PR make IbdFinder run much more slowly, to the point where they look like they're hanging. (This is likely a Python-specific problem -- as we discuss in #2459, the C code works differently because it uses an AVL tree to do the sorting). But I didn't want to omit the tests entirely, as they cover some useful edge cases, so I wanted to run them with smaller sample sizes than the ones hard-coded in.

benjeffery · 2022-09-07T21:59:53Z

I added this argument because some of my changes in this PR make IbdFinder run much more slowly, to the point where they look like they're hanging. (This is likely a Python-specific problem -- as we discuss in #2459, the C code works differently because it uses an AVL tree to do the sorting). But I didn't want to omit the tests entirely, as they cover some useful edge cases, so I wanted to run them with smaller sample sizes than the ones hard-coded in.

We can leave the slow examples in the examples - but skip them in these tests?

Made some small changes to ibd.py to ensure consistent behaviour in tests

benjeffery · 2024-09-20T14:13:28Z

I've rebased this to main - the tests here only add a few seconds on my machine, will see home CI behaves. @jeromekelleher I think this is pretty complete if you want to have a quick look to check?

…again

benjeffery · 2024-09-23T16:09:00Z

@Mergifyio update

mergify · 2024-09-23T16:09:05Z

update

✅ Branch has been successfully updated

gtsambos force-pushed the squash-ibd branch from c3df389 to 49ce776 Compare August 31, 2022 03:04

gtsambos mentioned this pull request Sep 7, 2022

Provide option to squash IBD output? #2459

Closed

gtsambos mentioned this pull request Sep 8, 2022

Fixed ibd.py CLI so that it works with the latest argument inputs #2497

Merged

gtsambos added 5 commits September 20, 2024 14:50

Added methods to sort and squash segments in the IbdFinder output.

220b729

Modified where the min_span argument is used.

146f418

Changed tests to use squashing behaviour.

3ad9d0c

Made some small changes to ibd.py to ensure consistent behaviour in tests

Updated definition of naive IBD.

98aec16

Rehashed squashing procedure in ibd.py for clarity

48d2b92

benjeffery force-pushed the squash-ibd branch from 49ce776 to ce924de Compare September 20, 2024 14:12

Various changes to make the merge conflict go away and the tests run …

62115b3

…again

benjeffery force-pushed the squash-ibd branch from ce924de to 62115b3 Compare September 20, 2024 14:15

benjeffery added this to the Python 0.5.9 milestone Sep 20, 2024

benjeffery added AUTOMERGE-REQUESTED Ask Mergify to merge this PR and removed AUTOMERGE-REQUESTED Ask Mergify to merge this PR labels Sep 23, 2024

benjeffery approved these changes Sep 23, 2024

View reviewed changes

Merge branch 'main' into squash-ibd

7fe0bb8

mergify bot merged commit d6b2303 into tskit-dev:main Sep 23, 2024
21 checks passed

mergify bot removed the AUTOMERGE-REQUESTED Ask Mergify to merge this PR label Sep 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added methods to sort and squash segments in the IbdFinder output. #2460

Added methods to sort and squash segments in the IbdFinder output. #2460

gtsambos commented Aug 9, 2022

codecov bot commented Aug 9, 2022 •

edited

Loading

gtsambos commented Aug 31, 2022

hyanwong commented Sep 1, 2022

benjeffery commented Sep 2, 2022

benjeffery commented Sep 2, 2022

gtsambos commented Sep 7, 2022

benjeffery commented Sep 7, 2022

benjeffery commented Sep 20, 2024 •

edited

Loading

benjeffery commented Sep 23, 2024

mergify bot commented Sep 23, 2024

Added methods to sort and squash segments in the IbdFinder output. #2460

Added methods to sort and squash segments in the IbdFinder output. #2460

Conversation

gtsambos commented Aug 9, 2022

PR Checklist:

codecov bot commented Aug 9, 2022 • edited Loading

Codecov Report

gtsambos commented Aug 31, 2022

hyanwong commented Sep 1, 2022

benjeffery commented Sep 2, 2022

benjeffery commented Sep 2, 2022

gtsambos commented Sep 7, 2022

benjeffery commented Sep 7, 2022

benjeffery commented Sep 20, 2024 • edited Loading

benjeffery commented Sep 23, 2024

mergify bot commented Sep 23, 2024

✅ Branch has been successfully updated

codecov bot commented Aug 9, 2022 •

edited

Loading

benjeffery commented Sep 20, 2024 •

edited

Loading