Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Dev][AMD] Implement LDS Async Copy for CDNA Arch #246

Merged
merged 37 commits into from
Nov 15, 2024

Conversation

LeiWang1999
Copy link
Contributor

This pull request includes several changes to update submodule references, fix installation scripts, and add new test scripts for matrix multiplication and layout testing. The most important changes include updating the TVM submodule, fixing the LLVM installation script, and adding new test scripts for matrix multiplication and layout configuration.

Submodule Update:

  • 3rdparty/tvm: Updated submodule commit to the latest version.

Installation Script Fix:

  • install_amd.sh: Fixed the command to append LLVM repository URLs to sources.list instead of overwriting it.

New Test Scripts:

  • integration/ComposableKernel/test_block_gemm.py: Added a comprehensive test script for matrix multiplication using TVM's tensor language. This includes functions for matrix multiplication, running GEMM, and comparing results with PyTorch.
  • integration/ComposableKernel/test_layout.py: Added a script to test different layout configurations for GEMM fragments. This script dynamically modifies a source file, builds the project, and logs the results for various layout configurations.

@LeiWang1999 LeiWang1999 merged commit fd5f06e into microsoft:main Nov 15, 2024
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants