This project describes the methods of our arXiv paper "Effective Domain Knowledge Transfer with Soft Fine-tuning".
In this paper, I just propose one small opinion: instead of the traditional fine-tuning schedule, maybe learning with the source domain improves generalization.
Anyway, welcome to discuss with me if you are insteresed in this opinion.