Skip to content

EMNLP 2020 paper (Constrained Fact Verification for FEVER)

Notifications You must be signed in to change notification settings

murali1996/constrained-fact-verification

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Constrained Fact Verification

This repository contains the code, data and models corresponding to the EMNLP 2020 paper, Constrained Fact Verification for FEVER.

Datasets

To download the RuleTaker-CWA, RuleTaker-Skip-fact and anonymized FEVER datasets, please refer to the link.

Alternatively, to prepare these datasets from scratch, refer to code/build_datasets.

Pretrained models

To reproduce the results presented in the paper, first download the checkpoints as below,

cd checkpoints
python download_checkpoints.py
cd ..

Download the three evaluation sets, Standard FEVER, Symmetric FEVER and Anonymized FEVER from the link.

For evaluating the FEVER models, please refer to code/bert-concat. To evaluate the pretrained RuleTaker-CWA and RuleTaker-Skip-fact models, refer to code/ruletaker_pretraining.

Alternatively, if you wish to train new models or experiment further, please follow the below steps.

Training

RuleTaker pretraining

This involves two steps,

  1. Pretraining on RACE (Lai et al. 2017) (refer to code/race_pretraining)
  2. Further fine-tune on the RuleTaker dataset. (refer to code/ruletaker_pretraining).

FEVER finetuning

After the above RuleTaker pretraining, we experiment with three networks for training on FEVER. We use the fine-tuned BERT weights from previous step to initialize the encoder.

  1. BERT-concat (refer to code/bert-concat).
  2. KGAT (Liu et al. 2020) (refer to code/kgat).
  3. Transformer-XH (Zhao et al. 2020) (refer to code/transformer-xh).

Alternatively, you can use the fine-tuned BERT weights from previous step (RuleTaker pretraining) to further train on any other fact verification dataset of your choice.

Other Resources

  • For more information about the FEVER 1.0 shared task, please refer to fever.ai.
  • For more information about the original RuleTaker dataset (Clark et al. 2020), please refer to RuleTaker.
  • To experiment with other transformer-based encoders like RoBERTa, checkout huggingface.

Citation

If you find these resources helpful in your research, consider citing the paper,

@inproceedings{pratapa-etal-2020-constrained,
    title = "{C}onstrained {F}act {V}erification for {FEVER}",
    author = "Pratapa, Adithya and Jayanthi, Sai Muralidhar and Nerella, Kavya",
    booktitle = "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)",
    month = nov,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.emnlp-main.629",
    pages = "7826--7832",
}

For any questions/issues, please feel free to create an issue.

About

EMNLP 2020 paper (Constrained Fact Verification for FEVER)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 94.9%
  • Shell 5.1%