Added Batch Normalization Layer modules #157

Spnetic-5 · 2023-08-10T02:55:58Z

Addresses #155

@milancurcic , I've included the structure of the batch normalization layer. Could you please review it and confirm whether I'm in the correct direction?

milancurcic · 2023-08-10T13:24:43Z

Thank you @Spnetic-5, great. Yes, this is certainly in the correct direction. I haven't yet reviewed the forward and backward algorithms, let me know when it's ready for review.

Also, despite #156, we should be able to test this layer implementation on its own, without integrating it with the network. In other words, while #156 will be necessary for full integration, we can work on this implementation as a standalone layer before #156.

Spnetic-5 · 2023-08-10T23:05:41Z

@milancurcic Please review the forward and backward pass implementations I've added based on my interpretation of the paper, also could you guide me on how we can test this layer?

milancurcic · 2023-08-11T14:49:19Z

Thanks! Let's review it briefly on the call today.

We can test this layer independently by passing some small, known input, and comparing the result with the corresponding known output. This should be straightforward since the batchnorm operation is relatively simple (just normalization of data). The backward is the inverse operation, so as I understand it, we can pass the same expected output to recover the same expected input.

milancurcic · 2023-08-11T14:51:28Z

@Spnetic-5 I just saw your message on Discourse, no problem; we'll proceed work on this PR as usual.

milancurcic · 2023-08-11T15:49:47Z

See for example a program that tests forward and backward passes of the maxpool2d layer using known inputs and expected outputs:

https://github.com/modern-fortran/neural-fortran/blob/main/test/test_maxpool2d_layer.f90

We'd use the same approach to test a batchnorm layer.

Spnetic-5 · 2023-08-20T03:07:06Z

Hello, @milancurcic. Sorry for the lack of activity over the past few days; this was my final week of internship in Canada, and I'll be returning to India on Monday.

I added a test module for the batch norm layer, however it has some error; I believe I will need your assistance on this.

milancurcic · 2023-08-20T13:28:39Z

No worries at all, thanks for all the work. I'll review it tomorrow.

milancurcic · 2023-08-24T16:19:49Z

src/nf/nf_batchnorm_layer_submodule.f90

+    allocate(res % input(num_features, num_features))
+    allocate(res % output(num_features, num_features))


I don't think the shape for these should be (num_features, num_features), but rather (batch_size, num_features). The batch_size also won't be known until the first forward pass, so we should defer the allocation until then. In the forward pass, we could have a simple allocated check to see if they have not been allocated then, and allocate them to the shape of the input.

Sorry, I meant (num_features, batch_size).

milancurcic · 2023-08-24T16:20:26Z

src/nf/nf_batchnorm_layer_submodule.f90

+      normalized_input => (input - reshape(self % running_mean, shape(input, 1))) &
+        / sqrt(reshape(self % running_var, shape(input, 1)) + self % epsilon) &


running_mean and running_var are not yet updated anywhere, only initialized.

milancurcic · 2023-08-24T16:21:39Z

test/test_batchnorm_layer.f90

+  sample_input = 1.0
+  gradient = 2.0
+
+  !TODO run forward and backward passes directly on the batchnorm_layer instance


I'll add a simple test directly on the batchnorm_layer instance rather than the high-level layer_type instance so you get the idea how it will work.

Spnetic-5 added 2 commits August 9, 2023 22:51

Added batch norm layer modules

3b32c95

Update cmake

1a0ff08

Spnetic-5 requested a review from milancurcic August 10, 2023 02:57

Update forward & backward formulae

e4d8e1e

Added draft test for batch norm layer

42335f1

milancurcic added 5 commits August 24, 2023 11:35

rename batch_norm -> batchnorm

de67a88

Just creating the batchnorm layer for now; actual tests TODO

b1e0d39

Make epsilon a batchnorm variable

e8d040a

Use associate for normalized input in batchnorm forward pass

17b0610

Remove unused code

7fb69f2

milancurcic reviewed Aug 24, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Batch Normalization Layer modules #157

Added Batch Normalization Layer modules #157

Spnetic-5 commented Aug 10, 2023

milancurcic commented Aug 10, 2023

Spnetic-5 commented Aug 10, 2023

milancurcic commented Aug 11, 2023

milancurcic commented Aug 11, 2023

milancurcic commented Aug 11, 2023

Spnetic-5 commented Aug 20, 2023

milancurcic commented Aug 20, 2023

milancurcic Aug 24, 2023

milancurcic Aug 24, 2023

milancurcic Aug 24, 2023

milancurcic Aug 24, 2023

		allocate(res % input(num_features, num_features))
		allocate(res % output(num_features, num_features))

		normalized_input => (input - reshape(self % running_mean, shape(input, 1))) &
		/ sqrt(reshape(self % running_var, shape(input, 1)) + self % epsilon) &

Added Batch Normalization Layer modules #157

Are you sure you want to change the base?

Added Batch Normalization Layer modules #157

Conversation

Spnetic-5 commented Aug 10, 2023

milancurcic commented Aug 10, 2023

Spnetic-5 commented Aug 10, 2023

milancurcic commented Aug 11, 2023

milancurcic commented Aug 11, 2023

milancurcic commented Aug 11, 2023

Spnetic-5 commented Aug 20, 2023

milancurcic commented Aug 20, 2023

milancurcic Aug 24, 2023

Choose a reason for hiding this comment

milancurcic Aug 24, 2023

Choose a reason for hiding this comment

milancurcic Aug 24, 2023

Choose a reason for hiding this comment

milancurcic Aug 24, 2023

Choose a reason for hiding this comment