Add part: lstm block #66

christophmluscher · 2024-12-18T17:21:39Z

Adds LSTM Block

NeoLegends

One minor Q.

NeoLegends · 2024-12-18T17:38:39Z

i6_models/parts/lstm.py

+    enforce_sorted: bool
+
+    @classmethod
+    def from_dict(cls, model_cfg_dict: Dict):


Same Q as in the other PR: why is this necessary now, and hasn't been for the other assemblies?

i6_models/parts/lstm.py

Co-authored-by: Albert Zeyer <[email protected]>

i6_models/parts/lstm.py

albertz · 2024-12-19T12:56:22Z

i6_models/parts/lstm.py

+            if seq_len.get_device() >= 0:
+                seq_len = seq_len.cpu()


Suggested change

if seq_len.get_device() >= 0:

seq_len = seq_len.cpu()

seq_len = seq_len.cpu()

albertz · 2024-12-19T12:57:03Z

i6_models/parts/lstm.py

+        )
+
+    def forward(self, x: torch.Tensor, seq_len: torch.Tensor) -> Tuple[torch.Tensor, torch.Tensor]:
+        if not torch.jit.is_scripting() and not torch.jit.is_tracing():


Why only when not scripting? Don't you want that seq_len is always on CPU?

I followed the example in the blstm part.

if not torch.jit.is_scripting() and not torch.jit.is_tracing(): # during graph mode we have to assume all Tensors are on the correct device, # otherwise move lengths to the CPU if they are on GPU if seq_len.get_device() >= 0: seq_len = seq_len.cpu()

I did not copy the comment over... I did not yet get to look why this is necessary
@JackTemaki you implemented the BLSTM IIRC. You remember why this was done in this way?

Co-authored-by: Albert Zeyer <[email protected]>

christophmluscher added 2 commits December 17, 2024 13:08

add part: lstm block

d9aadb5

cleanup

7c8b691

christophmluscher requested review from albertz, NeoLegends, JackTemaki, Gerstenberger, michelwi and Atticus1806 December 18, 2024 17:21

christophmluscher mentioned this pull request Dec 18, 2024

Add assembly: lstm encoder #67

Open

NeoLegends approved these changes Dec 18, 2024

View reviewed changes

albertz reviewed Dec 18, 2024

View reviewed changes

i6_models/parts/lstm.py Outdated Show resolved Hide resolved

christophmluscher and others added 2 commits December 19, 2024 10:24

fix forward return typing

624f6c1

Co-authored-by: Albert Zeyer <[email protected]>

add import, set var correctly, add doc

6bf9e2e

albertz reviewed Dec 19, 2024

View reviewed changes

i6_models/parts/lstm.py Outdated Show resolved Hide resolved

albertz reviewed Dec 19, 2024

View reviewed changes

i6_models/parts/lstm.py Outdated Show resolved Hide resolved

albertz reviewed Dec 19, 2024

View reviewed changes

i6_models/parts/lstm.py Outdated Show resolved Hide resolved

albertz reviewed Dec 19, 2024

View reviewed changes

typing

4b6e4ef

Co-authored-by: Albert Zeyer <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add part: lstm block #66

Add part: lstm block #66

christophmluscher commented Dec 18, 2024

NeoLegends left a comment

NeoLegends Dec 18, 2024

albertz Dec 19, 2024

albertz Dec 19, 2024

christophmluscher Dec 19, 2024

	if seq_len.get_device() >= 0:
	seq_len = seq_len.cpu()
	seq_len = seq_len.cpu()

Add part: lstm block #66

Are you sure you want to change the base?

Add part: lstm block #66

Conversation

christophmluscher commented Dec 18, 2024

NeoLegends left a comment

Choose a reason for hiding this comment

NeoLegends Dec 18, 2024

Choose a reason for hiding this comment

albertz Dec 19, 2024

Choose a reason for hiding this comment

albertz Dec 19, 2024

Choose a reason for hiding this comment

christophmluscher Dec 19, 2024

Choose a reason for hiding this comment