Paper summary

Task

The task of answer selection is to select an answer from candidate answers to satisfy the users' information needs.

Model

Architecture of our proposed intent-calibrated self-training (ICAST) framework. The dashed and solid line represent the workflow of teacher model and student model, respectively. The blue and green solid line represent intent-aware and context-aware workflow, respectively. The intent-calibrated pseudo labeling module estimates intent confidence gain to select samples with high-quality intent labels, and calibrates the answer labels by incorporating selected intent labels as an extra input for answer selection.

Running

Requirements

Python == 3.9.7
torch == 1.11.0
apex == 0.1
scipy == 1.8.0
transformers == 4.17.0
accelerate == 0.9.0

Datasets

We use MSDIALOG and MANTIS datasets for training and testing.
The datasets consist of four subfolders: teacher/MSDIALOG, student/MSDIALOG, teacher/MANTIS and student/MANTIS.
After downloading the datasets, place them into /datasets in teacher/MS-dialog, student/MS-dialog, teacher/Multi-domain-IS and student/Multi-domain-IS, respectively.
Please download the pre-trained model BERT-base-uncased into prev_trained_model/bert-base-uncased.

Training and testing

Training the teacher model

## MSDIALOG dataset

cd teacher/MS-dialog
# Training
bash scripts/run_training.sh
# Testing
bash scripts/run_testing.sh

## MANTIS dataset

cd teacher/Multi-domain-IS
# Training
bash scripts/run_training.sh
# Testing
bash scripts/run_testing.sh

Training the student model

## MSDIALOG dataset

cd student/MS-dialog
# Training
bash scripts/run_training.sh
# Testing
bash scripts/run_testing.sh

## MANTIS dataset

cd student/Multi-domain-IS
# Training
bash scripts/run_training.sh
# Testing
bash scripts/run_testing.sh

To use our codes, train the teacher model first and select best teacher model with development sets. After that, put the checkpoint of teacher model into the folder of pre-trained model. Finally, train the student model with labeled and unlabeled datasets and select the best student model with development sets. After training, select threshold of probabilities for each experimental setting.

Name	Name	Last commit message	Last commit date
Latest commit dengwentao commit Jul 21, 2023 3cd607d · Jul 21, 2023 History 1 Commit
img	img	commit	Jul 21, 2023
student	student	commit	Jul 21, 2023
teacher	teacher	commit	Jul 21, 2023
README.md	README.md	commit	Jul 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Paper summary

Task

Model

Running

Requirements

Datasets

Training and testing

Training the teacher model

Training the student model

About

Releases

Packages

Languages

dengwentao99/ICAST

Folders and files

Latest commit

History

Repository files navigation

Paper summary

Task

Model

Running

Requirements

Datasets

Training and testing

Training the teacher model

Training the student model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages