Does it make sense to change the # hidden layers of a pretrained transformer? #12264

bhartm3 · 2023-02-09T16:16:37Z

bhartm3
Feb 9, 2023

Hello,
does it make sense to increase or decrease the number of hidden layers of a pretrained transformer for training a model for a new entity for example to achieve better performance? Or are the hidden layers only set once when training models from scratch?
Thank you!

Answered by rmitsch

Feb 9, 2023

Hi @bhartm3, no, it doesn't make sense to do that. At this point the model architecture is fixed.

View full answer

rmitsch · 2023-02-09T16:58:47Z

rmitsch
Feb 9, 2023
Maintainer

Hi @bhartm3, no, it doesn't make sense to do that. At this point the model architecture is fixed.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does it make sense to change the # hidden layers of a pretrained transformer? #12264

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Does it make sense to change the # hidden layers of a pretrained transformer? #12264

bhartm3 Feb 9, 2023

Replies: 1 comment

rmitsch Feb 9, 2023 Maintainer

bhartm3
Feb 9, 2023

rmitsch
Feb 9, 2023
Maintainer