Skip to content

Commit

Permalink
FLAN-T5 has bad performance when using regional compilation (#1744)
Browse files Browse the repository at this point in the history
  • Loading branch information
chaojun-zhang authored Feb 5, 2025
1 parent d15132c commit 9b59b75
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion optimum/habana/transformers/models/t5/modeling_t5.py
Original file line number Diff line number Diff line change
Expand Up @@ -406,7 +406,7 @@ def gaudi_T5Stack_forward(

if self.gradient_checkpointing and self.training:
layer_outputs = self._gradient_checkpointing_func(
layer_module.forward,
layer_module.__call__,
hidden_states,
extended_attention_mask,
position_bias,
Expand Down

0 comments on commit 9b59b75

Please sign in to comment.