GitHub - morningmoni/EDE: Code for paper "Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation" EMNLP 2021 and "Constrained Abstractive Summarization: Preserving Factual Consistency with Constrained Generation" arXiv 2020

Code

[update] Code for Constrained Abstractive Summarization is in folder CAS/

The code is based on the seq2seq examples of huggingface transformers (3.0 <= version < 4.0)

The most important files are as follows:

DDBA.py: core functions for constrained generation, including a PyTorch implementation of DBA [1], adapted from the official MXNet implementation

finetune.py: model training

run_eval.py: model inference with or without constraints

transformers_local/generation_utils.py: modified the functions related to model decoding for enforcing constraints

transformers_local/modeling_bart.py: implemented BART+copy mechanism and other side functions

[1] "Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation", NAACL 2018

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
CAS		CAS
transformers_local		transformers_local
DDBA.py		DDBA.py
README.md		README.md
callbacks.py		callbacks.py
conf.py		conf.py
finetune.py		finetune.py
lightning_base.py		lightning_base.py
my_utils.py		my_utils.py
run_eval.py		run_eval.py
sentence_splitter.py		sentence_splitter.py
utils.py		utils.py