This repository has been archived by the owner on Jan 15, 2024. It is now read-only.
[Refactor]Add a switch for attention to return an unnormalized weight matrix. Move _get_attention_cell function position#1007
Open
fierceX wants to merge 5 commits intodmlc:v0.xfrom fierceX:tinybert
+115-64