Time embedding #27

Chirobocea · 2023-04-25T10:51:37Z

Hi! Great code, thanks for sharing!

I noticed something a little bit weird in this code. Is there any reason why you choose to use SiLU() right after the sinusoidal embedding?
It seams unnatural as it might change the desired properties of the embedding.

Maybe you missed to use a learnable projection of embedding like adding this to U-net
self.time_embed = nn.Sequential(
nn.Linear(time_dim, time_dim),
nn.SiLU(),
nn.Linear(time_dim, time_dim),
)

And also changing the forward by adding:
def forward(self, x, t):
t = t.unsqueeze(-1).type(torch.float)
t = self.pos_encoding(t, self.time_dim)
t = self.time_embed(t)

In this conditions, the SiLU() activation for projections of each block make sense, being at all just the activation of the learned embedding.

dome272 · 2023-05-04T01:37:27Z

Thank you for the catch. You are right, that would not make much sense the way it's done and probably gives worse results.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Time embedding #27

Time embedding #27

Chirobocea commented Apr 25, 2023

dome272 commented May 4, 2023

Time embedding #27

Time embedding #27

Comments

Chirobocea commented Apr 25, 2023

dome272 commented May 4, 2023