Missing argument error in trainer.py when setting lr_scheduler #3

bc-bytes · 2023-04-27T09:37:45Z

In trainer.py (line 101) there is a missing argument. Here's how I define optimiser and lr_scheduler in train.py:

optimizer = torch.optim.Adam(params, lr=0.001, weight_decay=0.0001)
scheduler = lr_scheduler.ReduceLROnPlateau(optimizer, factor=0.1, patience=6, verbose=1, min_lr=0.000001)
trainer = Trainer(
    model,
    criterion=BCEDiceLoss(),
    optimizer=optimizer,
    lr_scheduler=scheduler,   
    epochs=500
)

That throws the following error:

File "/home/seg/backbones_unet/utils/trainer.py", line 127, in _train_one_epoch
    self.lr_scheduler.step() # this was originally: self.lr_scheduler.step()
TypeError: step() missing 1 required positional argument: 'metrics'

If I then set line 101 to self.lr_scheduler.step(loss) that seems to fix the error. However, when I start training I get this:

Training Model on 500 epochs:   0%|                            | 0/500 [00:00<?, ?it/s
Epoch 00032: reducing learning rate of group 0 to 1.0000e-04.  | 31/800 [00:03<00:54, 14.03 training-batch/s, loss=1.1]
Epoch 00054: reducing learning rate of group 0 to 1.0000e-05.  | 53/800 [00:04<00:51, 14.38 training-batch/s, loss=1.09]
Epoch 00062: reducing learning rate of group 0 to 1.0000e-06.  | 61/800 [00:05<00:54, 13.45 training-batch/s, loss=1.06]
^Epoch 1:  13%|██████████████████                              | 104/800 [00:08<00:58, 11.84 training-batch/s, loss=1.05]

I haven't seen that before when training models with code from other repos. If that is normal, then all is OK, I just wanted to report the missing argument error in trainer.py.

The text was updated successfully, but these errors were encountered:

bc-bytes · 2023-04-29T10:29:55Z

Just to add to this, even with the lr_scheduler set, the learning rate never seems to change during training.

mberkay0 · 2023-04-29T13:01:37Z

Hello @bc-bytes, firstly, thank you for bringing up the issue. The library can not support the "REDUCELRONPLATEAU" and "COSINEANNEALINGWARMRESTARTS" lr schedulers. However, all other schedulers are supported. Therefore, receiving this error message is quite normal. If you use either of these two schedulers, the learning rate will not change. I intend to add additional support soon, thus making all lr scheduler classes available. Until then, you may use all other schedulers except for these two.

Supported Schedulers:

LAMBDALR
MULTIPLICATIVELR
STEPLR
MULTISTEPLR
CONSTANTLR
LINEARLR
EXPONENTIALLR
POLYNOMIALLR
COSINEANNEALINGLR
CHAINEDSCHEDULER
SEQUENTIALLR
CYCLICLR
ONECYCLELR

bc-bytes · 2023-04-29T13:13:37Z

Many thanks for the suggestions. However, I am currently producing a set of baseline results from other repositories, and I have trained all those models using ReduceLROnPlateau, so changing the scheduler type would mean that the results would not be comparable to my previous ones.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing argument error in trainer.py when setting lr_scheduler #3

Missing argument error in trainer.py when setting lr_scheduler #3

bc-bytes commented Apr 27, 2023

bc-bytes commented Apr 29, 2023

mberkay0 commented Apr 29, 2023

bc-bytes commented Apr 29, 2023

Missing argument error in trainer.py when setting lr_scheduler #3

Missing argument error in trainer.py when setting lr_scheduler #3

Comments

bc-bytes commented Apr 27, 2023

bc-bytes commented Apr 29, 2023

mberkay0 commented Apr 29, 2023

bc-bytes commented Apr 29, 2023