Add unit tests for shared prefix masked attention with torch.FlexAttention
#427
Job | Run time |
---|---|
6s | |
7m 58s | |
8m 4s |
torch.FlexAttention
#427
Job | Run time |
---|---|
6s | |
7m 58s | |
8m 4s |