Add a batch normalizer that has test and train hooks #11

jrock08 · 2016-04-21T02:06:08Z

No description provided.

tomlepaine · 2016-04-21T02:19:53Z

aiutils/tftools/batch_normalizer.py

+A suggested way to do this is to make execution of the
+model optimizer force it, e.g., by:
+
+  update_assignments = tf.group(bn1.get_assigner(),


I almost get this part but not quite 😜 How does it relate to the optimizer?

Do you think there's a mistake in this example here? The line optimizer = tf.group(update_assignments) doesn't make much sense to me, especially considering this is the second grouping of the updates. Also, I assume we'd just run

# skipping quite a bit loss = create_loss_graph() train = optimizer.minimize(loss) with sess.as_default(): # get the update assignments and batch normalize ... update_assignments.run() train.run(batch_feed_stuff)

I know the documentation for the moving average also calls tf.group twice for some reason. Am I missing something?

So, basically, what this is saying is that you should make it so that the optimizer has to run before the assigners run, then rather than running the optimizer, you should run the assigners, which will automatically run the optimizer. I think the second group can probably be replaced with an identity. But I'm not 100% sure (see below)

From the doc https://www.tensorflow.org/versions/r0.8/api_docs/python/framework.html#Graph.control_dependencies

# WRONG def my_func(pred, tensor): t = tf.matmul(tensor, tensor) with tf.control_dependencies([pred]): # The matmul op is created outside the context, so no control # dependency will be added. return t # RIGHT def my_func(pred, tensor): with tf.control_dependencies([pred]): # The matmul op is created in the context, so a control dependency # will be added. return tf.matmul(tensor, tensor)

Oh, so optimizer is just a reference to the optimizer, and now that it's run by control_dependencies, it's ok to re-assign it to the updater ops?

So, basically what it's saying is

opt = trainer.minimize(loss) with tf.control_dependencies([opt]): new_opt = tf.group(<all batch norm updates>)

So, new_opt is actually the batch norm updates, but it calls the minimizer when you run it.

tomlepaine · 2016-04-21T02:37:26Z

Nice code. But it seems like it requires the user to know things and I wish it didn’t.

Maybe the assignment stuff which I don’t quite understand makes this hard.

I like the class interface, but instead of the user doing:

ewma = tf.train.ExponentialMovingAverage(decay=0.99)
bn = BatchNormalizer(input, 0.001, ewma, True)
update_assignments = bn.get_assigner()
x = bn.normalize(y, train=training)

Can the user do:

x, update_assignments = batch_normalize(input, phrase_train, name)

And batch_normalize takes care of everything else?

tomlepaine · 2016-04-21T02:38:37Z

aiutils/tftools/batch_normalizer.py

+        ewma = tf.train.ExponentialMovingAverage(decay=0.99)
+        bn = BatchNormalizer(input, 0.001, ewma, True)
+        update_assignments = bn.get_assigner()
+        x = bn.normalize(y, train=training?)


There is a stray ? on this line.

kevjshih · 2016-04-21T05:04:36Z

Is it usual to include weight decay for bn parameters?

BigRedT · 2016-04-21T05:56:22Z

aiutils/tftools/batch_normalizer.py

+    """
+
+    def __init__(self, input, epsilon, ewma_trainer, name):
+        rank = len(input.get_shape().as_list())


why not rank = len(input.get_shape()) ?
I tried it, works fine.

Conflicts: aiutils/tftools/batch_normalizer.py

tomlepaine · 2016-04-29T07:23:20Z

So I'm pretty happy with the final interface, but I wish the class wasn't so big 😿 . I guess we can leave that as an exercise for future motivated collaborators. TODO: Shrink the implementation while maintaining the same behavior and interface.

So LGTM 🎉 (once you get the tests to pass).

…it should probably be fixed elsewhere

tomlepaine reviewed Apr 21, 2016
View reviewed changes

BigRedT reviewed Apr 21, 2016
View reviewed changes

jrock08 added 2 commits April 28, 2016 23:10

BatchNorm: adds a batch normalizer that has test and train hooks

3d4f464

cleanup batch_normalizer code for comments

0cbf633

jrock08 force-pushed the batchnorm_redux branch from a1eb434 to 0cbf633 Compare April 29, 2016 04:11

return from batch_normalize is a single object

504e3d4

Conflicts: aiutils/tftools/batch_normalizer.py

jrock08 added 3 commits April 29, 2016 09:11

reset default graph so that weird stuff doesn't happen

c6687a2

update test_batch_normalizer to use graphs local to each test

c748038

remove restore_default_graph from test_var_collect, since changes to …

aa32393

…it should probably be fixed elsewhere

jrock08 merged commit 8bb396b into aiUIUC:master Apr 29, 2016

jrock08 deleted the batchnorm_redux branch April 29, 2016 17:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a batch normalizer that has test and train hooks #11

Add a batch normalizer that has test and train hooks #11

jrock08 commented Apr 21, 2016

tomlepaine Apr 21, 2016

kevjshih Apr 21, 2016 •

edited

Loading

jrock08 Apr 23, 2016

kevjshih Apr 23, 2016

jrock08 Apr 28, 2016

tomlepaine commented Apr 21, 2016 •

edited

Loading

tomlepaine Apr 21, 2016

kevjshih commented Apr 21, 2016

BigRedT Apr 21, 2016

tomlepaine commented Apr 29, 2016

Add a batch normalizer that has test and train hooks #11

Add a batch normalizer that has test and train hooks #11

Conversation

jrock08 commented Apr 21, 2016

tomlepaine Apr 21, 2016

Choose a reason for hiding this comment

kevjshih Apr 21, 2016 • edited Loading

Choose a reason for hiding this comment

jrock08 Apr 23, 2016

Choose a reason for hiding this comment

kevjshih Apr 23, 2016

Choose a reason for hiding this comment

jrock08 Apr 28, 2016

Choose a reason for hiding this comment

tomlepaine commented Apr 21, 2016 • edited Loading

tomlepaine Apr 21, 2016

Choose a reason for hiding this comment

kevjshih commented Apr 21, 2016

BigRedT Apr 21, 2016

Choose a reason for hiding this comment

tomlepaine commented Apr 29, 2016

kevjshih Apr 21, 2016 •

edited

Loading

tomlepaine commented Apr 21, 2016 •

edited

Loading