Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[tune gemm v3.4] Add xcd-based pid remapping and change back to rocprofv1 #630

Merged
merged 5 commits into from
Aug 19, 2024

Commits on Aug 17, 2024

  1. Change to rocprofv1

    zhanglx13 committed Aug 17, 2024
    Configuration menu
    Copy the full SHA
    05aead8 View commit details
    Browse the repository at this point in the history
  2. improve post processing of rocprof results

    - set --iters=200 as default. This is enough since the time is stable
    after the first few runs.
    - Filter out kernel time that is too large. We use the first kernel
    time as the threshold. There must be something wrong with the kernel
    if its elapsedTime is larger than the first run. We need to
    investigate the reason. For now, just filter them out.
    zhanglx13 committed Aug 17, 2024
    Configuration menu
    Copy the full SHA
    e21d43c View commit details
    Browse the repository at this point in the history

Commits on Aug 18, 2024

  1. Add xcd-based pid remapping

    zhanglx13 committed Aug 18, 2024
    Configuration menu
    Copy the full SHA
    e355a42 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    cba3d19 View commit details
    Browse the repository at this point in the history

Commits on Aug 19, 2024

  1. Update readme

    zhanglx13 committed Aug 19, 2024
    Configuration menu
    Copy the full SHA
    907605a View commit details
    Browse the repository at this point in the history