Continual Dialogue State Tracking via Reason-of-Select Distillation

Thank you for your interest in our work, and this is the original implementation of "Continual Dialogue State Tracking via Reason-of-Select Distillation", accepted to the ACL 2024 Findings.

Local Setup

conda create -n CDST python=3.8
conda activate CDST
pip install -r requirements.txt

Step 1. Teacher’s Reasoning Generation

The preprocessed SGD dataset is provided in the "/data" folder. You can then employ different teacher models to generate RoS reasoning.

Get ChatGPT's rationales:

./scripts/run_ChatGPT_reasoning.sh

Get LLaMA-2-70B's rationales:

./scripts/run_LLaMA2_70B_reasoning.sh

This step will generate $G$ ($G$ = 5 in our settings) candidate reasonings $\mathcal{R}_i$ as well as $N$ ($N$ = 6 in our settings) perturbed reasonings $\mathcal{PR}_i$. They will be saved in "./data" directory.

Step 2. Semantic Contrastive Reasoning Selection Stage

To ensure faithful teaching, we exploit semantic similarity to select optimal reasoning.

./scripts/run_contrastive_selection.sh

Finally, obtained reasoning data is added to the original training dataset and the new reasoning dataset for model fine-tuning is constructed.

Step 3. Training Student Models

We conducted experiments on four different student models:

LLaMA-7B (`finetune_ContinualDST_LLaMA7B.py`)

./scripts/run_train_LLaMA7B.sh

FlanT5-XL (`finetune_ContinualDST_T5XL.py`)

./scripts/run_train_FlanT5XL.sh

T5-base (`finetune_ContinualDST_T5.py`)

./scripts/run_train_T5base.sh

T5-small (`finetune_ContinualDST_T5.py`)

./scripts/run_train_T5small.sh

For LLaMA-7B and FlanT5-XL, we use LoRA to accelerate the speed of fine-tuning process. At the end of training, the student's fine-tuned weights will be stored in $checkpoint_files. We provide all the fine-tuning weights in the Checkpoint_files folder for reproducibility.

Inference

We use three metrics to measure the performance of our model for Continual Learning. (You can directly load the weights that we have provided directly from the \checkpoint folder, and make inference.)

Avg.JGA score

./scripts/run_generate_avgJGA.sh

Forward Transfer (FWT)

./scripts/run_generate_FWT.sh

Backward Transfer (BWT)

./scripts/run_generate_BWT.sh

After inference, the generated prediction results will be stored at \output folder.

Evaluation

Then we can calculate these metrics by running

./scripts/eval_avgJGA.sh
./scripts/eval_FWT.sh
./scripts/eval_BWT.sh

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
scripts		scripts
utils		utils
LLaMa2-70B_reasoning_get.py		LLaMa2-70B_reasoning_get.py
README.md		README.md
Semantic_Contrastive_Reasoning_Selection.py		Semantic_Contrastive_Reasoning_Selection.py
chatgpt_reasoning_get.py		chatgpt_reasoning_get.py
eval_avgJGA_reasoning.py		eval_avgJGA_reasoning.py
eval_bwt_reasoning.py		eval_bwt_reasoning.py
eval_fwt_reasoning.py		eval_fwt_reasoning.py
finetune_ContinualDST_LLaMA7B.py		finetune_ContinualDST_LLaMA7B.py
finetune_ContinualDST_T5.py		finetune_ContinualDST_T5.py
finetune_ContinualDST_T5XL.py		finetune_ContinualDST_T5XL.py
generate_avgJGA_reasoning.py		generate_avgJGA_reasoning.py
generate_bwt_reasoning.py		generate_bwt_reasoning.py
generate_fwt_reasoning.py		generate_fwt_reasoning.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Continual Dialogue State Tracking via Reason-of-Select Distillation

Local Setup

Step 1. Teacher’s Reasoning Generation

Step 2. Semantic Contrastive Reasoning Selection Stage

Step 3. Training Student Models

LLaMA-7B (`finetune_ContinualDST_LLaMA7B.py`)

FlanT5-XL (`finetune_ContinualDST_T5XL.py`)

T5-base (`finetune_ContinualDST_T5.py`)

T5-small (`finetune_ContinualDST_T5.py`)

Inference

Avg.JGA score

Forward Transfer (FWT)

Backward Transfer (BWT)

Evaluation

About

Releases

Packages

Languages

WoodScene/RoS

Folders and files

Latest commit

History

Repository files navigation

Continual Dialogue State Tracking via Reason-of-Select Distillation

Local Setup

Step 1. Teacher’s Reasoning Generation

Step 2. Semantic Contrastive Reasoning Selection Stage

Step 3. Training Student Models

LLaMA-7B (finetune_ContinualDST_LLaMA7B.py)

FlanT5-XL (finetune_ContinualDST_T5XL.py)

T5-base (finetune_ContinualDST_T5.py)

T5-small (finetune_ContinualDST_T5.py)

Inference

Avg.JGA score

Forward Transfer (FWT)

Backward Transfer (BWT)

Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

LLaMA-7B (`finetune_ContinualDST_LLaMA7B.py`)

FlanT5-XL (`finetune_ContinualDST_T5XL.py`)

T5-base (`finetune_ContinualDST_T5.py`)

T5-small (`finetune_ContinualDST_T5.py`)

Packages