Integrate the multihit model into the DNSM framework #71

willdumm · 2024-10-23T15:45:58Z

This PR replaces #60, and is a companion to https://github.com/matsengrp/dnsm-experiments-1/pull/29

Here we add the multihit model to the DNSM. We chose to do so by storing the multihit model, as an object, as part of the DNSMDataset object, and bringing it into molevol.py.

This is a bit of a break from the previous work, which was able to stay in tensor land rather than object land. E.g. molevol.py now has objects from models.py.

Here is the rationale.

Previously, we could pre-compute the per-site rates and then all future calculations would be relative to those rates. To emphasize, the per-site rates don't depend on the branch lengths or anything.

Here, we have the multihit multipliers being applied after the codon level calculations. These codon calculations depend on the branch lengths and rates in a nonlinear way. Thus we can't precompute rates and then apply them in a straightforward way.

In theory we could stay more in tensor land by precomputing a 64*64 (this is codon by codon, because the hit classes depend on the source and dest codon) offset per site per sequence, and multiplying that after the codon aggregation step. However, that seems wasteful of memory and overkill.

So we have decided to apply the multihit correction using the forward function of the corresponding object, which is exactly what it's meant to do.

In any case, all of this work happens only in the branch length optimization phase of making the DNSMs, and not the transformer backprop.

Add multihit model in a few more places make format fix shape issue fix rebase issue, but tests still fail force_spawn in tests reformat switch to serial branch length optimization multihit works with threading WIP-- nothing works now comment test+fake multihit working re-enable multihit perhaps working, fixed device mismatch and switched to nonlog correction application I think this might have worked verified working here cleanup for PR

matsen

Some small things here to fix.

netam/hit_class.py

netam/models.py

netam/molevol.py

tests/test_dasm.py

tests/test_dnsm.py

matsen

Two last nitpicks.

netam/dnsm.py

netam/molevol.py

matsen

Great!

matsen and others added 3 commits October 22, 2024 16:24

in-person drafting

d55fed0

format and fix dasm test

f1684fa

willdumm changed the title ~~54 multihit dnsm clean~~ Integrate the multihit model into the DNSM framework Oct 23, 2024

willdumm added 2 commits October 23, 2024 09:06

respond to Erick's comments

3c46cd3

remove multihit warning

4982d4a

matsen requested changes Oct 23, 2024

View reviewed changes

willdumm added 2 commits October 23, 2024 11:01

remove unused imports, reformat

b26de87

respond to Erick's new comments

da20797

matsen requested changes Oct 23, 2024

View reviewed changes

netam/dnsm.py Show resolved Hide resolved

netam/molevol.py Outdated Show resolved Hide resolved

Respond to Erick's comments

62d3479

matsen approved these changes Oct 23, 2024

View reviewed changes

matsen merged commit a5ef50d into main Oct 23, 2024
1 check passed

matsen mentioned this pull request Oct 23, 2024

Integrate multihit models into the DNSM framework #54

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate the multihit model into the DNSM framework #71

Integrate the multihit model into the DNSM framework #71

willdumm commented Oct 23, 2024

matsen left a comment

matsen left a comment

matsen left a comment

Integrate the multihit model into the DNSM framework #71

Integrate the multihit model into the DNSM framework #71

Conversation

willdumm commented Oct 23, 2024

matsen left a comment

Choose a reason for hiding this comment

matsen left a comment

Choose a reason for hiding this comment

matsen left a comment

Choose a reason for hiding this comment