Models loaded using the `from_archive` method need to be saved with original config #5211

AkshitaB · 2021-05-18T19:28:40Z

When allennlp train is used to fine-tune a pretrained model (model A) using from_archive(path_to_A), the finetuned model (model B) is saved with the config that contains from_archive. This means that if you try to now finetune the model B, it needs the original model A at the exact path_to_A, as well as model B. In the normal usecase, this will fail if the user does not have access to the original model A. On beaker, depending on how the code is setup, if the path to the pretrained model remains the same in experiment A -> B and experiment B -> C, it will cause a maximum recursion depth error.

Potential solution is to store the original configuration when saving a fine-tuned model (i.e., the from_archive case).

The text was updated successfully, but these errors were encountered:

dirkgr · 2021-05-27T21:08:24Z

We need a Model.to_params API.

AkshitaB added the bug label May 18, 2021

AkshitaB added this to the 2.6 milestone May 18, 2021

dirkgr self-assigned this May 28, 2021

epwalsh mentioned this issue Mar 25, 2022

requiring all inherited model.tar.gz after transfer learning coreference #5604

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Models loaded using the `from_archive` method need to be saved with original config #5211

Models loaded using the `from_archive` method need to be saved with original config #5211

AkshitaB commented May 18, 2021

dirkgr commented May 27, 2021

Models loaded using the from_archive method need to be saved with original config #5211

Models loaded using the from_archive method need to be saved with original config #5211

Comments

AkshitaB commented May 18, 2021

dirkgr commented May 27, 2021

Models loaded using the `from_archive` method need to be saved with original config #5211

Models loaded using the `from_archive` method need to be saved with original config #5211