-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Required Arguments to Herd Function #669
Comments
Some actions items from this list that I think make sense after further discussion:
|
Adding here for "documentation": Multiple herds can exist in a segment and are placed to fit and the segment must have enough resources for that placement, the herds are guaranteed to run/exist at the same time within the spatial fabric the segment reserves. Same goes for segments within a launch, they are guaranteed to exist on the device at the same time or the launch cannot be executed. Therefore both herds and segments should have a shape. Though the shape of a segment is much more physical and must account for the structure of a target device's dataflow units (DUs), where a DU is the smallest unit of reservation/configuration. Launch has an "iteration space" for invocations of that launch and those can exist in space or time. It is up for debate whether the iteration space is a simple count or a shape. |
Here are a few points from discussion today:
From this, the favored solution would be:
Outstanding implementation question: where should validation happen?
|
I am not sure why a Segment corresponds to an iteration space? It is a reservation of physical resources needed for the herds and memory allocations it encapsulates.
This is a bit more nuanced, it could be either. But there is a physicality to herds that each worker is physically co-located in spatial resources. |
The sizes correspond to physical cores which defines the spatial iteration space. We do not try to map a virtual iteration space onto the physical iteration space.
IMHO we should only allow segment size = [1] until shape is well defined for segment. The most common thinking is that it's a spatial unroll factor, or equivalently, a resource reservation. It is generally not thought of as an iteration space like in a scf.parallel or air.launch.
Maybe, but for now herd size == physical shape == iteration space. We do let the compiler to physically place the herd in the segment.
Agreed for future work, but for now it is a physical description onto a 2d grid. |
Sounds good. Let's go with that. |
Thank you for chiming in @fifield and @jgmelber! I do think I need a bit more clarification for how to map this to actionable implementation changes. From talking with @jgmelber, a segment may be meant to map to the data unit (which, to my understanding, is limited to 1 column). That would mean that only herds of specific dimensions could fit into a segment if we pin the segment size to [1] and map the herd size to a direct physical representation. I don't have any examples currently working with segments size != [1], so that part doesn't much change how I am constructing examples. However, currently, there are quite a few (working) examples I've made that have herd size [2,2] in a singular segment ([1]) - and, if we map segment size to DUs - this wouldn't currently be correct/legal. How should I change my examples to be correct mlir-air if this is true? I could change the herd size to be 1x4 in the examples. That would remove example code showing two dimensional indexing within the herd, which is functionally okay but from an instructional standpoint would perhaps be a bit of a loss. I could change the examples to use multiple segments but then they would cease to work (and cease to function as pseudo "unit tests" for the python bindings) until there is support for multiple segments. I was also under the impression that sometimes a [2,2] herd is interpreted as [1,4] physically. If that's true, then there is also a departure between our intended API and our implementation which would need to be addressed. Any guidance would be appreciated! |
A dataflow unit (DU) is 1 column. A segment is a reservation of 1 or more DUs/.
For this example the segment should be of size [2] in order to have enough resources for a [2,2] herd to be placed.
This is a possibility we are not supporting currently after @fifield 's suggestions above, the size/shape is physical not logical. If a pass transforms the [2,2] to [1,4] explicitly then it would be mapped as such physically. A [2,2] herd is mapped to a physical 2 x 2 square grid of compute cores. |
Agreed, if @fifield means that segment size is 1 dimension for now until shape is well defined. The segment is a reservation of 1 or more "dataflow units" (DUs), which for most spatial devices we target now is 1 column of compute/memory/shim tiles. |
I mean [1], i.e. one dim of size one. DU is not defined or used anywhere in mlir-air. The segment op supports 2d anchor and size attributes like a herd, in terms of tiles, but that is the only place resources are mentioned (as currently implemented). The attributes get used in the herd placement pass. The size parameter (not attribute) was intended to be used slightly differently, as a physical unroll factor (iteration space) of the segment. That is, a [2,2] segment would be physically duplicated in a 2x2 grid and all resources must be available to start the [four copies of the] segment. We could introduce something like mlir-aie targetmodel which talks about DUs for a given device/platform/whatever but mlir-air doesn't have anything like that yet. |
There are transformations to achieve this, but it is not the default behavior. |
I think I have some better understanding of However, I am still feeling fuzzy on what implementation changes should be made to make this more clear through the API. To focus that more general question specifically on what a valid herd
|
While I was writing some of the python examples, I realized that there is some inconsistency between the number of arguments and the
sizes
parameter between segment and herd. I haven't fully explored what is allowed (syntactically) in launch because multi-launches are a bit up in the air (pun intended) in terms of implementation/meaning.Anyways, both herd and segments have a
sizes
argument. If sizes in not specified, it defaults to[1, 1]
.I think the segment usage makes sense to me. The following are valid python snippets for segment:
However, herd MUST have the arguments below, even when sizes is not specified. E.g., these are valid ways to define your herd:
(but I couldn't figure out how to get
sizes=[1]
to be valid)I was wondering if I could get some help understanding what should be valid in terms of specifying sizes for a herd, and also if it's a good idea or not to figure out how to hide the arguments when the herd is 1x1 the same way the segment works (because why would one need size or indices in that case??)
I'd appreciate some feedback on this issue.
The text was updated successfully, but these errors were encountered: