Skip to content

Latest commit

 

History

History
21 lines (16 loc) · 925 Bytes

Readme.md

File metadata and controls

21 lines (16 loc) · 925 Bytes

PyTorch Implementation of Monotonic Chunkwise Attention

Requirements

  • PyTorch 0.4

TODOs

  • Soft MoChA
  • Hard MoChA
  • Linear Time Decoding
  • Experiment with Real-world dataset

Model figure

Model figure 1

Linear Time Decoding

It's not clear if authors' TF implementation supports decoding in linear time. They calculate energies for whole encoder outputs instead of scanning from previously attended encoder output.

References