DoubleTransfer at MEDIQA 2019:
Multi-Source Transfer Learning for Natural Language Understanding in the Medical Domain

This PyTorch package implements DoubleTransfer for the MEDIQA 2019 competition, as described in:

Yichong Xu, Xiaodong Liu, Chunyuan Li, Hoifung Poon and Jianfeng Gao
DoubleTransfer at MEDIQA 2019: Multi-Source Transfer Learning for Natural Language Understanding in the Medical Domain
The BioNLP workshop, ACL 2019.
arXiv version

Please cite the above paper if you use this code.

Results

We report results produced by this package as follows.

Task	Score(%)	Rank
Question Answering (QA)	78.0 (Accuracy), 81.91 (Precision)	1st
Medical Natural Language Inference (MedNLI)	93.8	3rd
Recognizing Question Entailment (RQE)	66.2	7th

Quickstart

Use docker:

pull docker:
> docker pull yichongx/doubletransfer_mediqa2019
run docker
> docker run -it --rm --runtime nvidia yichongx/doubletransfer_mediqa2019 bash
Please refer to the following link if you first use docker: https://docs.docker.com/

Train a DoubleTransfer Model

Download the data using links in the MEDIQA 2019 website.
Prepare MNLI data as well as pretrained BERT models. > ./download.sh
preprocess data with BERT and SciBERT vocabularies
> ./prepro.sh
train a model using train.py.
> python train.py --train_datasets mednli,rqe,mediqa,medquad --save_last --save_best --mediqa_score adjusted --mediqa_score_offset -2.0 --freeze_bert_first --batch_size 16 --max_seq_len 384 --data_dir ../data/mediqa_processed/mt_dnn_mediqa_384_v2/ --init_checkpoint /path/to/pretrained/model/ --float_medquad --external_datasets mnli --mtl_opt 0 --output_dir /output/path See example codes in run.sh
To ensemble predictions:
> python ensemble_preds.py /path/to/file1/ /path/to/file2/
All the input files will be ensembled.

Notes and Acknowledgments

The code is developed based on the original MT-DNN code: https://github.com/namisan/mt-dnn

Related: MultiTask-MRC MT-DNN

by [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
bert		bert
config		config
data_utils		data_utils
docker		docker
module		module
mt_dnn		mt_dnn
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
download.sh		download.sh
get_mediqa_gt.py		get_mediqa_gt.py
get_mediqa_gt_processed.py		get_mediqa_gt_processed.py
prepro.sh		prepro.sh
prepro_mediqa.py		prepro_mediqa.py
run.sh		run.sh
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DoubleTransfer at MEDIQA 2019:
Multi-Source Transfer Learning for Natural Language Understanding in the Medical Domain

Results

Quickstart

Use docker:

Train a DoubleTransfer Model

Notes and Acknowledgments

About

Releases

Packages

Languages

License

xycforgithub/DoubleTransfer_MEDIQA2019

Folders and files

Latest commit

History

Repository files navigation

DoubleTransfer at MEDIQA 2019: Multi-Source Transfer Learning for Natural Language Understanding in the Medical Domain

Results

Quickstart

Use docker:

Train a DoubleTransfer Model

Notes and Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

DoubleTransfer at MEDIQA 2019:
Multi-Source Transfer Learning for Natural Language Understanding in the Medical Domain

Packages