Add some default NVTX ranges #7633

Kh4L · 2023-06-23T09:08:34Z

This PR leverages torch's NVTX API to add default nvtx ranges to help profiling with nsys. Those can be enabled with the NVIDIA_NVTX_RANGES environment variable.

How to use:

NVIDIA_NVTX_RANGES=1 nsys profile -s none -t cuda,nvtx,osrt,cudnn,cublas --force-overwrite=true python train.py

Result

It adds ranges for the following functions:

Dataloaders's collate_fn
Dataset's __getitem__
MessagePassing's main member functions
NeighborSampler's

codecov · 2023-06-23T09:15:26Z

Codecov Report

Merging #7633 (b01f5ef) into master (0c0d9c1) will decrease coverage by 0.69%.
Report is 14 commits behind head on master.
The diff coverage is 50.00%.

@@            Coverage Diff             @@
##           master    #7633      +/-   ##
==========================================
- Coverage   90.44%   89.75%   -0.69%     
==========================================
  Files         455      456       +1     
  Lines       26334    26406      +72     
==========================================
- Hits        23818    23702     -116     
- Misses       2516     2704     +188

Files Changed	Coverage Δ
torch_geometric/nn/conv/message_passing.py	`94.68% <ø> (ø)`
torch_geometric/loader/__init__.py	`61.90% <20.00%> (-38.10%)`	⬇️
torch_geometric/data/dataset.py	`94.70% <100.00%> (+0.02%)`	⬆️
torch_geometric/nn/conv/pna_conv.py	`100.00% <100.00%> (ø)`
torch_geometric/sampler/neighbor_sampler.py	`93.47% <100.00%> (+0.02%)`	⬆️

... and 33 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

Signed-off-by: Serge Panev <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Serge Panev <[email protected]>

puririshi98 · 2023-08-02T15:43:15Z

LGTM!

akihironitta · 2023-08-04T12:22:51Z

torch_geometric/nn/conv/message_passing.py

+                          "0") == "1" and torch.cuda.is_available():
+            self._nvtx_handles = dict()
+
+            def get_hooks_for(func_name):


I think this is not an issue if NVIDIA_NVTX_RANGES is not meant to be used by the community, but I prefer to define the hook at the top-level to avoid PicklingError in a multiprocessing setting. https://docs.python.org/3/library/pickle.html#what-can-be-pickled-and-unpickled

rusty1s assigned Kh4L Jun 24, 2023

rusty1s added 0 - Priority P0 feature benchmark labels Jun 24, 2023

Kh4L force-pushed the default_nvtx_pyg branch from b39b36b to da89da4 Compare June 28, 2023 10:52

Kh4L force-pushed the default_nvtx_pyg branch from 84c7a6d to 1827f62 Compare July 13, 2023 09:43

Kh4L marked this pull request as ready for review July 13, 2023 09:49

Kh4L requested review from mananshah99, a team and EdisonLeeeee as code owners July 13, 2023 09:49

Kh4L and others added 5 commits July 27, 2023 19:39

Add some default NVTX ranges

9f65ca7

Signed-off-by: Serge Panev <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

44f51c5

for more information, see https://pre-commit.ci

Typo fix

7ec1b67

Signed-off-by: Serge Panev <[email protected]>

Fix

4eed050

Signed-off-by: Serge Panev <[email protected]>

Fix

0061570

Signed-off-by: Serge Panev <[email protected]>

Kh4L force-pushed the default_nvtx_pyg branch from 1827f62 to 0061570 Compare July 27, 2023 10:43

Merge branch 'master' into default_nvtx_pyg

9a0731d

Fix line length

b01f5ef

akihironitta reviewed Aug 4, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add some default NVTX ranges #7633

Add some default NVTX ranges #7633

Kh4L commented Jun 23, 2023

codecov bot commented Jun 23, 2023 •

edited

Loading

puririshi98 commented Aug 2, 2023

akihironitta Aug 4, 2023

Add some default NVTX ranges #7633

Are you sure you want to change the base?

Add some default NVTX ranges #7633

Conversation

Kh4L commented Jun 23, 2023

How to use:

Result

codecov bot commented Jun 23, 2023 • edited Loading

Codecov Report

puririshi98 commented Aug 2, 2023

akihironitta Aug 4, 2023

Choose a reason for hiding this comment

codecov bot commented Jun 23, 2023 •

edited

Loading