PyTorch Implementation of Monotonic Chunkwise Attention

Requirements

PyTorch 0.4

TODOs

Soft MoChA
Hard MoChA
Linear Time Decoding
Experiment with Real-world dataset

Model figure

Linear Time Decoding

It's not clear if authors' TF implementation supports decoding in linear time. They calculate energies for whole encoder outputs instead of scanning from previously attended encoder output.

References

Colin Raffel, Minh-Thang Luong, Peter J. Liu, Ron J. Weiss and Douglas Eck. Online and Linear-Time Attention by Enforcing Monotonic Alignments (ICML 2017)
Chung-Cheng Chiu and Colin Raffel. Monotonic Chunkwise Attention (ICLR 2018)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Readme.md

Readme.md

PyTorch Implementation of Monotonic Chunkwise Attention

Requirements

TODOs

Model figure

Linear Time Decoding

References

Files

Readme.md

Latest commit

History

Readme.md

File metadata and controls

PyTorch Implementation of Monotonic Chunkwise Attention

Requirements

TODOs

Model figure

Linear Time Decoding

References