make_trainable() does not freeze weights #10

embanner · 2017-03-06T08:15:19Z

You define a function make_trainable() which sets every layer's trainable attribute to either True or False and call this repeatedly during training. However, setting keras.layers.Layer.trainable doesn't have any effect unless you follow it up with recompiling the model. So I'm pretty sure that your layers are unfrozen during the entire training process since you only compile once.

I'll take a stab at verifying this shortly.

The text was updated successfully, but these errors were encountered:

embanner · 2017-03-06T22:51:09Z

Confirmed that make_trainable(discriminator, False) does not actually freeze the weights.

>>> discriminator.predict(X)
array([[ 0.52295244,  0.47704756],
       [ 0.54938567,  0.45061436]], dtype=float32)
>>> make_trainable(discriminator, False)
>>> discriminator.train_on_batch(X, y)
>>> discriminator.predict(X)
array([[ 0.4992643 ,  0.50073564],
       [ 0.64071965,  0.35928035]], dtype=float32)

li-js · 2017-03-22T13:54:55Z

I think you are right. The re-compilation makes the weights frozen.
Do you go further to re-compile the model inside the make_trainable() function?
I perform a similar stuff, the program ends up to incrementally consume more GPU memory in every iteration until OOM error occurs.
Do you have similar experience? Any help is appreciated.

embanner · 2017-03-22T22:20:25Z

Yes, I make a call to compile() inside make_trainable(). And indeed it slows things down quite a bit. I find it interesting that even without freezing the weights that the generator still produces good quality images.

vforvinay · 2017-07-04T13:37:07Z

One this I noticed, but am not totally sure of, is that by adding this line, it makes the the discriminator part of the GAN untrainable before the GAN is compiled. This would make it so that the discriminator model itself is trainable, but the discriminator part of the GAN is not, which is exactly what we would want.

5agado · 2017-07-28T09:34:36Z

With Keras 2.0.4 I tried make_trainable() as defined here, and checking with summary() I can see parameters switching from trainable to non-trainable without the need of recalling compile(). I suggest to check first this on your setup.
Otherwise I also reach OOM errors if recompiling every time.

embanner changed the title ~~Does freezing/unfreezing layers work at all with keras?~~ Are you actually freezing/unfreezing layers? Mar 6, 2017

embanner changed the title ~~Are you actually freezing/unfreezing layers?~~ make_trainable() does not freeze weights Mar 6, 2017

embanner changed the title ~~make_trainable() does not freeze weights~~ make_trainable() does not freeze weights Mar 6, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make_trainable() does not freeze weights #10

make_trainable() does not freeze weights #10

embanner commented Mar 6, 2017 •

edited

Loading

embanner commented Mar 6, 2017

li-js commented Mar 22, 2017

embanner commented Mar 22, 2017

vforvinay commented Jul 4, 2017

5agado commented Jul 28, 2017

make_trainable() does not freeze weights #10

make_trainable() does not freeze weights #10

Comments

embanner commented Mar 6, 2017 • edited Loading

embanner commented Mar 6, 2017

li-js commented Mar 22, 2017

embanner commented Mar 22, 2017

vforvinay commented Jul 4, 2017

5agado commented Jul 28, 2017

embanner commented Mar 6, 2017 •

edited

Loading