The reproduction result is not good on the Overall indicator. #10

TracyYannn · 2024-06-07T07:02:28Z

The reproduction of the results on Overall is not very good. I ran it on V100, and here are my parameter settings and experimental results. May I ask what the reason is, or how should I reproduce it correctly? Thank you!
python main.py --token_level word-level
--model_type roberta
--model_dir dir_base
--task mixatis
--data_dir data
--attention_mode label
--do_train
--do_eval
--num_train_epochs 100
--intent_loss_coef 0.5
--learning_rate 1e-5
--train_batch_size 32
--num_intent_detection
--use_crf

python main.py --token_level word-level
--model_type roberta
--model_dir misca
--task mixatis
--data_dir data
--attention_mode label
--do_train
--do_eval
--num_train_epochs 100
--intent_loss_coef 0.5
--learning_rate 1e-5
--num_intent_detection
--use_crf \
--base_model dir_base
--intent_slot_attn_type coattention

BillKiller · 2024-06-11T12:22:28Z

I can not reproduce performance too. I hope author can provide more detail information. Same issue issue

thinhphp · 2024-06-28T17:51:32Z

We have checked and updated more detailed instruction. In general, for the model with PLM, after having the “base" model, we load it and freeze the PLM encoder (simply add .detach() after encoder output). The final stage is fine-tuning the full model, remember to perform grid search to make sure it achieves best performance. In our experiment, we use this checkpoint for MixATIS and this checkpoint for MixSNIPS as base model. In the case of MixATIS, you could try learning rate 3e-5 (freezing) and 3e-6 (after freezing).
Hope it will help you. Should you have any further question, do not hesitate to contact me [email protected] where I more often check the inbox.

TracyYannn · 2024-08-04T11:43:21Z

We have checked and updated more detailed instruction. In general, for the model with PLM, after having the “base" model, we load it and freeze the PLM encoder (simply add .detach() after encoder output). The final stage is fine-tuning the full model, remember to perform grid search to make sure it achieves best performance. In our experiment, we use this checkpoint for MixATIS and this checkpoint for MixSNIPS as base model. In the case of MixATIS, you could try learning rate 3e-5 (freezing) and 3e-6 (after freezing). Hope it will help you. Should you have any further question, do not hesitate to contact me [email protected] where I more often check the inbox.

Thank you for the update. May I ask on which graphics card the experiment was conducted？Thanks，happy everyday！

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The reproduction result is not good on the Overall indicator. #10

The reproduction result is not good on the Overall indicator. #10

TracyYannn commented Jun 7, 2024

BillKiller commented Jun 11, 2024

thinhphp commented Jun 28, 2024

TracyYannn commented Aug 4, 2024

The reproduction result is not good on the Overall indicator. #10

The reproduction result is not good on the Overall indicator. #10

Comments

TracyYannn commented Jun 7, 2024

BillKiller commented Jun 11, 2024

thinhphp commented Jun 28, 2024

TracyYannn commented Aug 4, 2024