Initial tree perf #5

jeromekelleher · 2021-07-26T09:31:46Z

Making a start on #2.

Puts in some infrastructure comparing the tskit version of hartigan parsimony with the numba one. The idea would be to add (maybe) a naive C++ version using pointers, a more sophisticated C++ version, and a Pythran version. There's probably a better way to display the data, but here's the initial version of the plot anyway:

We could probably do a bit more work on the numba version, I haven't made any real effort to profile it.

@molpopgen, how does this look? I guess what we'd want for the C++ versions is to take an executable that takes a tree sequence file as input (and a max_sites), and prints the mean and variance of the time taken to run the parsimony algorithm for the first max_sites to stdout. We probably don't need to bother with the variance tbh, but I thought I'd stick it in in case we ever needed it. We can then wrap them nicely in this script.

molpopgen · 2021-07-26T13:14:23Z

I have a start over here. Minor hiccup is needing a non-recursive postorder method for node iteration. Once this matures a bit, I'll get it over here.

jeromekelleher · 2021-07-26T14:03:19Z

It would be nice to get all the code in here if we could - I was imagining a simple Makefile with a few .cc files what we could run directly in here? Is it possible to do something similar with Rust?

benjeffery · 2021-07-26T15:05:10Z

Great stuff - I had a play around but couldn't get the numba code to go any faster, seems numba is already doing some of the tricks I tried and fastmath doesn't help here.

molpopgen · 2021-07-26T15:13:23Z

I think this is a good start. I think we may have a hard time matching this setup exactly in other languages, but we can see how it goes, I guess.

jeromekelleher · 2021-07-26T15:15:33Z

Sounds good @molpopgen - what I'll do is write a C version of what the other languages should do, which should provide a reasonable template, dealing with the annoying stuff like decoding the genotypes etc.

molpopgen · 2021-07-26T15:16:25Z

Sounds good. I'll try to concoct a C++ version of the Usher-like trees that isn't too hackish.

benjeffery · 2021-07-26T15:27:39Z

One thing to add is that I think numba would perform closer if it had a non-recursive implementation as in tskit.

jeromekelleher · 2021-07-26T15:57:33Z

One thing to add is that I think numba would perform closer if it had a non-recursive implementation as in tskit.

Perhaps - we could have "numba-recursive" and "numba-loops"? Not too hard, if a bit fiddly to do. I'd be surprised if it makes much difference though, my earlier experience was that recursion was surprisingly good in numba. Go nuts if you want to try it out!

jeromekelleher added 5 commits July 26, 2021 09:08

Initial framework for performance comparison

c396564

Script to make data.

3541bc9

Rename

6d06d29

Infrastructure for comparison

866fe65

Numba version and benchmarks

9bf01c1

jeromekelleher merged commit 54fe887 into tskit-dev:main Jul 26, 2021

jeromekelleher deleted the initial-tree-perf branch July 26, 2021 15:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial tree perf #5

Initial tree perf #5

jeromekelleher commented Jul 26, 2021

molpopgen commented Jul 26, 2021

jeromekelleher commented Jul 26, 2021

benjeffery commented Jul 26, 2021

molpopgen commented Jul 26, 2021

jeromekelleher commented Jul 26, 2021

molpopgen commented Jul 26, 2021

benjeffery commented Jul 26, 2021

jeromekelleher commented Jul 26, 2021

Initial tree perf #5

Initial tree perf #5

Conversation

jeromekelleher commented Jul 26, 2021

molpopgen commented Jul 26, 2021

jeromekelleher commented Jul 26, 2021

benjeffery commented Jul 26, 2021

molpopgen commented Jul 26, 2021

jeromekelleher commented Jul 26, 2021

molpopgen commented Jul 26, 2021

benjeffery commented Jul 26, 2021

jeromekelleher commented Jul 26, 2021