Dice function with batch_size>1 #16

RSly · 2017-02-22T12:00:13Z

Hi,

I tried to correct the dice function to work for batch_size >1, but didn't much succeed... in particular in the backward function...
any chance you could consider to update you implementation for batch_size>1 ?

That can really help :)
Thanks

RSly · 2017-02-22T18:04:27Z

it seems it is solved by re-writing top[0].data[0]=np.sum(dice)/float(bottom[0].data.shape[0])

I still have some problems... I will investigate and let you know

RSly · 2017-02-23T14:17:42Z

here is more details regarding the batch_size problem:
figure 1. with batch_size : 1 the network learns nicely, and achieves 90% accuracy

figure 2. with batch_size : 8, it stays at accuracy of 20% even after many epochs

gattia · 2017-02-23T15:37:20Z

Have you tried changing learning rate when you change batch size? This conversation says that this paper indicates that learning rate should be changed. If you are trying to increase batch size it indicates that learning rate should be decreased. Their relationship indicates if you are doubling the batch size that the learning rate should decrease to ~0.7 of the original learning rate. This likely wont fix everything you are indicating but it might help.

I also thought there was another comment in the Vnet issues that indicates that vnet worked with >1 batch size out of the box. That issue indicates that the dice would be reported as the sum of the 2 volumes in the batch, if Im not mistaken this could result in dice scores upto 2.0 but it shouldnt make a difference, essentially you can divide by 2 to get the average dice of the 2 volumes.

RSly · 2017-02-24T08:52:55Z

@getta, thanks a lot for the suggestion and the links. I tried the 0.7 decrease. it actually helps, so using this trick, the results with batch_size:1 and batch_size 10 are now Comparable! nice!

however, I was hoping that having a greater batch size to work as a data balancing trick since I have a very unbalanced data (90% class A, 10% class B). but I still don't get better results using batch_size >1 ...

=> regarding the Vnet dice out-of-the-box, it is true that it can work but when batch_size >1 then this dice wont be normalized, so it can not be used along with other normalized loss functions together. this is the main reason it is a good idea to normalize it by the bacth_size, so the dice-loss stays <1 using the following:
top[0].data[0]=np.sum(dice)/float(bottom[0].data.shape[0])

gattia · 2017-02-24T12:24:34Z

Glad it helped!

sagarhukkire · 2017-03-01T07:57:04Z

Hi @gattia @RSly

I am facing issue with deterministic result of loss values. For example if I run 10 iteration I get 10 loss values. If I rerun model then I get different 10 loss values? Is it normal or do you know what is workaround.

Thanks in advance
sagar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dice function with batch_size>1 #16

Dice function with batch_size>1 #16

RSly commented Feb 22, 2017 •

edited

Loading

RSly commented Feb 22, 2017 •

edited

Loading

RSly commented Feb 23, 2017

gattia commented Feb 23, 2017

RSly commented Feb 24, 2017 •

edited

Loading

gattia commented Feb 24, 2017

sagarhukkire commented Mar 1, 2017

Dice function with batch_size>1 #16

Dice function with batch_size>1 #16

Comments

RSly commented Feb 22, 2017 • edited Loading

RSly commented Feb 22, 2017 • edited Loading

RSly commented Feb 23, 2017

gattia commented Feb 23, 2017

RSly commented Feb 24, 2017 • edited Loading

gattia commented Feb 24, 2017

sagarhukkire commented Mar 1, 2017

RSly commented Feb 22, 2017 •

edited

Loading

RSly commented Feb 22, 2017 •

edited

Loading

RSly commented Feb 24, 2017 •

edited

Loading