Fix counter collection inconsistency with rocprofv3 #589

feizheng10 · 2025-03-02T21:13:50Z

For some case, we noticed collected total row numbers are not the same across all perfmon csv files:
pmc_perf.csv, SQ_IFETCH_LEVEL.csv, SQ_INST_LEVEL_LDS.csv, SQ_INST_LEVEL_SMEM.csv, SQ_INST_LEVEL_VMEM.csv.
The quick solution is: inner join if collected csv are not on the same page.
We might need to dig into and check the root reason.

* Fix post analysis gui in standalone binary * Add post analysis gui assets and required server libraries for GUI server and web page * Add port forwarding to docker test compose * Update README me to use `docker compose up` instead of `docker compose run` to run containers with port forwarding and to leverage other functionalities of docker compose

vedithal-amd

LGTM, just some minor comments

vedithal-amd · 2025-03-03T14:06:06Z

src/rocprof_compute_soc/soc_base.py

 def using_v3():
-    return "ROCPROF" in os.environ.keys() and os.environ["ROCPROF"] == "rocprofv3"
+    return "ROCPROF" in os.environ.keys() and "rocprofv3" in os.environ["ROCPROF"]


Just curious as to why does this hack do substring match?

I get it now, ROCPROF env. var. will contain the path to rocprofv3, in that case should we use os.environ["ROCPROF"].endswith("rocprofv3")

vedithal-amd · 2025-03-10T22:00:47Z

src/utils/file_io.py

@@ -209,7 +209,8 @@ def create_single_df_pmc(raw_data_dir, node_name, kernel_verbose, verbose):
                    dfs.append(tmp_df)
                    coll_levels.append(f[:-4])

-        final_df = pd.concat(dfs, keys=coll_levels, axis=1, copy=False)
+        # TODO: double check the case if all tmp_df.shape[0] are not on the same page
+        final_df = pd.concat(dfs, keys=coll_levels, axis=1, join="inner", copy=False)


As discussed in the meeting, inner join based on index is the only option we have since kernel name and dispatch ids might be non-deterministic

inner join if collected csv are not on the same page

5362e56

feizheng10 requested a review from koomie as a code owner March 2, 2025 21:13

feizheng10 requested review from vedithal-amd and removed request for koomie March 2, 2025 21:18

feizheng10 added 2 commits March 2, 2025 14:54

format code

20f58fa

fix potential rocprofv3 env path hack

c1c3f30

feizheng10 requested review from coleramos425 and dgaliffiAMD as code owners March 3, 2025 00:42

feizheng10 removed the request for review from dgaliffiAMD March 3, 2025 00:48

vedithal-amd and others added 4 commits March 3, 2025 16:57

Fix rocprofv1 output processing. (ROCm#588)

f2d6150

Merge branch 'develop' into fix_collection_inconsistency

37c08fa

Merge branch 'develop' into fix_collection_inconsistency

30afef2

ywang103-amd approved these changes Mar 10, 2025

View reviewed changes

vedithal-amd approved these changes Mar 10, 2025

View reviewed changes

feizheng10 added 2 commits March 10, 2025 17:47

improve detecting rocprofv3

bc9cf07

format code

ce5c803

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix counter collection inconsistency with rocprofv3 #589

Fix counter collection inconsistency with rocprofv3 #589

feizheng10 commented Mar 2, 2025

vedithal-amd left a comment

vedithal-amd Mar 3, 2025

vedithal-amd Mar 10, 2025

feizheng10 Mar 11, 2025

vedithal-amd Mar 10, 2025

Fix counter collection inconsistency with rocprofv3 #589

Are you sure you want to change the base?

Fix counter collection inconsistency with rocprofv3 #589

Conversation

feizheng10 commented Mar 2, 2025

vedithal-amd left a comment

Choose a reason for hiding this comment

vedithal-amd Mar 3, 2025

Choose a reason for hiding this comment

vedithal-amd Mar 10, 2025

Choose a reason for hiding this comment

feizheng10 Mar 11, 2025

Choose a reason for hiding this comment

vedithal-amd Mar 10, 2025

Choose a reason for hiding this comment