Integrate the multihit model into the DNSM framework #60

matsen · 2024-09-25T11:27:19Z

Here we add the multihit model to the DNSM. We chose to do so by storing the multihit model, as an object, as part of the DNSMDataset object, and bringing it into molevol.py.

This is a bit of a break from the previous work, which was able to stay in tensor land rather than object land. E.g. molevol.py now has objects from models.py.

Here is the rationale.

Previously, we could pre-compute the per-site rates and then all future calculations would be relative to those rates. To emphasize, the per-site rates don't depend on the branch lengths or anything.

Here, we have the multihit multipliers being applied after the codon level calculations. These codon calculations depend on the branch lengths and rates in a nonlinear way. Thus we can't precompute rates and then apply them in a straightforward way.

In theory we could stay more in tensor land by precomputing a 64*64 (this is codon by codon, because the hit classes depend on the source and dest codon) offset per site per sequence, and multiplying that after the codon aggregation step. However, that seems wasteful of memory and overkill.

So we have decided to apply the multihit correction using the forward function of the corresponding object, which is exactly what it's meant to do.

In any case, all of this work happens only in the branch length optimization phase of making the DNSMs, and not the transformer backprop.

matsen

Hey, this is looking almost done!

matsen · 2024-10-03T13:41:55Z

netam/dnsm.py

@@ -142,6 +150,7 @@ def clone(self):
            self.all_rates.copy(),
            self.all_subs_probs.copy(),
            self._branch_lengths.copy(),
+            multihit_model=copy.deepcopy(self.multihit_model),


This deepcopy and the one below are no longer necessary now that you are copying in the init, right?

matsen · 2024-10-03T13:43:30Z

tests/test_dnsm.py

@@ -14,6 +15,15 @@
 from netam.dnsm import DNSMBurrito, train_val_datasets_of_pcp_df


+def force_spawn():


We were going to move this into common, right?

(and remove the other instance)

…ion application

matsen and others added 5 commits September 27, 2024 16:46

in-person drafting

1aac535

initial modification of functions

a2e02e4

Add multihit model in a few more places

9d0b7bf

make format

9d48955

fix shape issue

a73a6ac

willdumm force-pushed the 54-multihit-dnsm branch from faf5239 to a73a6ac Compare September 27, 2024 23:48

willdumm and others added 5 commits September 27, 2024 17:04

fix rebase issue, but tests still fail

35e71b2

force_spawn in tests

2ff2524

reformat

10db510

switch to serial branch length optimization

9b7a186

multihit works with threading

d44a608

matsen commented Oct 3, 2024

View reviewed changes

matsen and others added 9 commits October 3, 2024 06:44

comment

e620b15

WIP-- nothing works now

75c4f52

fix merge

11a6dbe

test+fake multihit working

ae6cf4d

re-enable multihit

2f83eb6

perhaps working, fixed device mismatch and switched to nonlog correct…

4f72778

…ion application

I think this might have worked

2aaa52c

verified working here

3649665

cleanup for PR

48da873

willdumm mentioned this pull request Oct 23, 2024

Integrate the multihit model into the DNSM framework #71

Merged

willdumm closed this Oct 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate the multihit model into the DNSM framework #60

Integrate the multihit model into the DNSM framework #60

matsen commented Sep 25, 2024 •

edited

Loading

matsen left a comment

matsen Oct 3, 2024

matsen Oct 3, 2024

matsen Oct 3, 2024

		@@ -14,6 +15,15 @@
		from netam.dnsm import DNSMBurrito, train_val_datasets_of_pcp_df


		def force_spawn():

Integrate the multihit model into the DNSM framework #60

Integrate the multihit model into the DNSM framework #60

Conversation

matsen commented Sep 25, 2024 • edited Loading

matsen left a comment

Choose a reason for hiding this comment

matsen Oct 3, 2024

Choose a reason for hiding this comment

matsen Oct 3, 2024

Choose a reason for hiding this comment

matsen Oct 3, 2024

Choose a reason for hiding this comment

matsen commented Sep 25, 2024 •

edited

Loading