Skip to content

Can TMA support copy part of data? like copy_if. #1601

Closed Answered by thakkarV
defei-coder asked this question in Q&A
Discussion options

You must be logged in to vote

the full shape is 6464, but I want to load 3264 elements in block0 and load 12*64 elements in block1 and others in block2.

Okay, create an affine layout to represent your tensors. you clearly have some layout in mind that is not 2D. For TMA, the gmem layout of your tensor must be affine. If it is not, then TMA cannot help you. If you have OOB elements, they will be zero filled by TMA but each CTA must have a consistent view of the smem "vector" it is going to load from gmem

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@defei-coder
Comment options

@thakkarV
Comment options

Answer selected by defei-coder
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants