-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[AMD] [FrontEnd] Check is_within_2gb only when buffer ops on (#5898)
Updates the AMD compiler backend to optimize the implementation of `is_within_2gb` and gate its invocation behind buffer ops being enabled. From internal testing we found that this check can be expensive and since as part of the cache key it will be executed on every kernel invocation. Therefore I believe it makes sense to add a fast path that avoids it when buffer ops related optimizations are disabled. This also may have an added benefit that if a kernel cannot benefit from the buffer operations then the kernel won't need to be recompiled because we have unnecessarily changed the keys based on the 2GB threshold and therefore caused a cache miss. --------- Co-authored-by: Nick Riasanovsky <[email protected]> Co-authored-by: Nick Riasanovsky <[email protected]>
- Loading branch information
1 parent
3f2fb59
commit 37ff43c
Showing
2 changed files
with
56 additions
and
28 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters