Adding optional parameter activation_dtype to models #327

phborba · 2020-04-23T19:17:44Z

As I mentioned on issue #323 , when using mixed_precision, the training does not converge when softmax conversion is also converted. So, to overcome this problem, I've create a parameter activation_dtype that allows user to set the data type of the activations.

I've modified all models and also the EfficientNet ones (I'll pull request the changes to the other repository).

I've tested mixed precision training with the proposed changes and all went well. I've also tested training the normal way and my changes did not break anything, also all unittests passed.

I've also updated the requirements to include noisy students weights to segmentation_models using EfficientNet.

Please feel free to suggest any changes to this pull request, I just want to contribute to this great project.

phborba added 7 commits April 18, 2020 17:41

update on requirements to get noisy students weights

ba9b183

activation_dtype on inception_resnet and inception_v3

766be51

fpn and linknet with activation_dtype

a43d081

unet and pspnet

e52c86d

setting restrictions only to softmax activations

cda0ec1

removing parameter where it does not make sense

f342915

bug fix

f0b0727

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding optional parameter activation_dtype to models #327

Adding optional parameter activation_dtype to models #327

phborba commented Apr 23, 2020

Adding optional parameter activation_dtype to models #327

Are you sure you want to change the base?

Adding optional parameter activation_dtype to models #327

Conversation

phborba commented Apr 23, 2020