why is the in_train_phase not working #9

alecGraves · 2019-10-26T03:09:02Z

Line 91 in c26a21d

    
           return K.in_train_phase(reparameterization_trick, mean + 0*logvar, training=training) # TODO figure out why this is not working in the specified tf version???

in_train_phase should call the reparameterization function when K.backend is in its training phase.... It does not appear to be running at all.

alecGraves · 2019-10-26T03:11:57Z

It appears to work in later tf versions......

Brandt-J · 2021-06-10T11:54:40Z

Iam experiencing the same issue, directly in your repository with tensorflow 1.4, but also in an own implementation (using Dense instead of convolutional layers), running with tensorflow 2.3.0.
Apparently, mostly calls the SamplingLayer with tensors of Shape (None, latentSize), which just fully bypasses the sampling and reparametrization function. That line here:

BVAE-tf/bvae/sample_layer.py

Line 70 in c26a21d

if mean.shape[0].value == None or logvar.shape[0].value == None:

However, when the length of my input tensor is of a multiple of the batch size, then the Sampling Layer is called with tensors of shape (batchsize, latentSize), THEN it goes into the sampling but then I get different errors there..
But before investigating these further, shoudln't it mostly go into the sampling and reparametrization trick section??

sokrypton · 2021-06-10T16:58:06Z

I believe, if you wrap the K.in_train_phase() function with a keras.layers.Lambda() function, that should fix the bug.

I noticed in new tensorflow version, when in eager mode, the train mode is only evaluated inside explicit layers. Wrapping a function in a Lambda layer should do the trick.

Brandt-J · 2021-06-11T06:41:51Z

Hmm, that changes a bit what happens, but does not solve my initial issue, let me paraphrase differently. It could be that I am just using it wrong, Iam pretty new to tensorflow :/

The data I want to train on consists of a set of 1-dimensional spectral data with, e.g., 1024 spectral frequencies, so my input_shape is (1024) - no additional dimension as I am just working with some densely connected layers. So, I might have for example 800 spectra for training, which gives me an 800x1024 input tensor. Wheeen I now want to fit the model to that data, different things happen depending on the batch_size, or, more presicely, the relation from batch_size and dataset size (spectra number). If the dataset size is, e.g. (320 x 1024) I am having a multiple of the batch_size (32) and the sampling layer actually proceeds the random sampling (as it receives a 32x1024 tensor as input) . However, for all the cases when my spectra number (i.e. training dataset size) is NOT a multiple of the batch_size, the shape of the tensors going through the network is (None, 1024). Then in the sampling layer, the "# trick to allow setting batch at train/eval time" with the above line gets triggered and the sampling layer just returns the (mean + 0*logvar), so without doing the random sampling(?).

So, as said, probably Iam just doing it wrong and I should always pass in datasets that have a multiple of the batch_size in their first dimension? In other cases of simple feed-forward networks that didn't seem to make any difference, but obviously it does here..
I don't really understand at what situations the "# trick to allow setting batch at train/eval time" should be executed and when not..

alecGraves added the bug Something isn't working label Oct 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why is the in_train_phase not working #9

why is the in_train_phase not working #9

alecGraves commented Oct 26, 2019

alecGraves commented Oct 26, 2019

Brandt-J commented Jun 10, 2021

sokrypton commented Jun 10, 2021 •

edited

Loading

Brandt-J commented Jun 11, 2021

why is the in_train_phase not working #9

why is the in_train_phase not working #9

Comments

alecGraves commented Oct 26, 2019

alecGraves commented Oct 26, 2019

Brandt-J commented Jun 10, 2021

sokrypton commented Jun 10, 2021 • edited Loading

Brandt-J commented Jun 11, 2021

sokrypton commented Jun 10, 2021 •

edited

Loading