This repo contains preliminary code of the AAAI2020 paper named "Learning from Easy to Complex: Adaptive Multi-curricula Learning for Neural Dialogue Generation".
This codebase is built upon the ParlAI project.
Check parlai/agents/adaptive_learning
for experimental models implementation.
RL-based multi-curriculum learning lies in parlai/tasks/adaptive_learning
.
Running scripts can be found in projects/adaptive_learning
.
- Python3
- Pytorch 1.2 or newer
Dependencies of the core modules are listed in requirement.txt.
The datasets used in the paper can be download from here.
Put it in data/
and unzip it using tar -xzvf AdaptiveLearning.tar.gz
git clone [email protected]:hengyicai/Adaptive_Multi-curricula_Learning_for_Dialog.git ~/Adaptive_Multi-curricula_Learning_for_Dialog
cd ~/Adaptive_Multi-curricula_Learning_for_Dialog; python setup.py develop
echo "export PARLAI_HOME=~/Adaptive_Multi-curricula_Learning_for_Dialog" >> ~/.bashrc; source ~/.bashrc
cd ~/Adaptive_Multi-curricula_Learning_for_Dialog
bash projects/adaptive_learning/shell/run.sh
The last line of projects/adaptive_learning/shell/run.sh
specifies preliminary arguments for the training:
# train_model MODEL_NAME TASK_NAME SUB_TASK T VALIDATION_EVERY_N_SECS VALIDATION_EVERY_N_EPOCHS NUM_EPOCHS
train_model seq2seq personachat_h3 combine 11000 -1 0.2 30
This run will apply the multi-curriculum learning framework on Seq2seq
model using dataset PersonaChat
. The duration of curriculum learning is 11000
steps.
Applying the single specificity
curriculum dialogue learning on model CVAE
using dataset DailyDialog
, with curriculum learning duration 8000
:
train_model cvae daily_dialog specificity 8000 -1 0.2 30
See projects/adaptive_learning/shell/run.sh
for details.
If you find our code/models or ideas useful in your research, please consider citing the paper:
@InProceedings{Hengyi_2020_AAAI,
author={Hengyi Cai and Hongshen Chen and Cheng Zhang and Yonghao Song and Xiaofang Zhao and Yangxi Li and Dongsheng Duan and Dawei Yin},
title={Learning from Easy to Complex: Adaptive Multi-curricula Learning for Neural Dialogue Generation},
booktitle = {Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI)},
year = {2020}
}