Speed up mask sampling with rejection sampling #2585

anc2001 · 2023-11-06T01:45:53Z

Speeds up mask sampling and avoids OOM errors for large images / masks by avoiding torch.nonzero with rejection sampling. Found that this solves many of the issues related to inefficient sampling with masks.

Ex.
rays / sec without masks

rays / sec with current mask sampling

rays / sec with new mask sampling

…ch.nonzero

KevinXu02 · 2023-11-07T08:19:04Z

The speeding up is really significant ,but when it comes to sparse data such as lidar it will mostly fail. I think it may be added as an optional method instead of changing the original code.

blacksino · 2023-11-30T12:00:59Z

you could save nonzero index as a member either.

akristoffersen · 2024-01-08T04:05:06Z

I recently relied on this PR to dramatically speed up training with masks. @anc2001 would you be interested in getting this PR over the finish line? Perhaps with a optional flag as @KevinXu02 suggests, though I think it may make more sense for this to be the default behavior, and revert to the slower behavior with an optional flag.

anc2001 · 2024-01-08T15:11:43Z

Yes @akristoffersen I'd love to push this PR over the finish line! Just been a little busy recently and have had some issues setting up the dev environment.

machenmusik · 2024-01-08T16:12:34Z

Do we have any usage data on how often this is used with sparse data such as lidar? My impression is that sparse data usage is less common, and so enabling this option by default would be better for the majority.

anc2001 · 2024-01-10T19:58:26Z

A better solution than the optional flag might be some kind of adaptive thresholding where if the number of valid pixels in the mask is below a certain threshold percentage revert functionality back to original and if above rejection sample. This might be better for datasets with a mix of sparse and dense masks, but I'm not sure if that's necessary so will leave to any future PRs if people want that kind functionality.

akristoffersen

LGTM, small comment but otherwise looks good to merge :D

akristoffersen · 2024-01-11T04:31:04Z

nerfstudio/data/pixel_samplers.py

+                        ).long()
+                        indices[~chosen_indices_validity] = replacement_indices
+
+                if num_valid != batch_size:


Instead of just raising warning, I think it would make sense to default back to the slow non-rejection sampling if this occurs. I would still issue a warning, but it would suck for training to fail on the off chance that not enough valid indices are generated in time.

akristoffersen

small nit: With the most recent change, if the first iter fails with rejection sampling, a warning will be generated but the indices that will be returned will not necessarily only contained valid locations within the mask. Every other sampling will be fine, but the first one will still have invalid indices.

I think instead, we should throw away the current generated indices and start from scratch with the non-rejection sampling method, as well as change the config flag to use the non-rejection sampling method for all future iterations.

akristoffersen · 2024-01-16T00:52:47Z

LGTM, thanks again @anc2001! feel free to merge :D

… into rejection_mask_sampling

anc2001 · 2024-01-18T18:00:58Z

Hey @akristoffersen, I don't have write access to merge this PR! Do you know who we would need to @ to get this merged?

akristoffersen · 2024-01-18T18:11:29Z

That would be me! Should merge once all checks are done

* change masked pixel sampling to use rejection sampling instead of torch.nonzero * black reformat code * pyright unbound variable num_valid * pyright type issues with num_valid * add configuration settings for rejection sampling masks * black reformat * maybe this fixes it? * revert behavior if mask sampling failed, still raise warning * on iteration failure, use non-rejection sampling to generate indices * ruff --------- Co-authored-by: adrian_chang <[email protected]> Co-authored-by: Alexander Kristoffersen <[email protected]>

nepfaff · 2024-09-26T02:06:12Z

@anc2001, did you get these timing results with --pipeline.datamanager.masks-on-gpu True --pipeline.datamanager.images-on-gpu True or without these options? Not including these still seem to be slow for me but maybe I have a bug somewhere

I only seem to have these problems when using depth with depth-nerfacto

change masked pixel sampling to use rejection sampling instead of tor…

dd5bad4

…ch.nonzero

anc2001 mentioned this pull request Nov 6, 2023

speedup mask sampling, properly save masks in images pipeline #2346

Open

anc2001 added 3 commits November 6, 2023 13:19

black reformat code

714d838

pyright unbound variable num_valid

572cc0a

pyright type issues with num_valid

99230e0

anc2001 and others added 4 commits January 9, 2024 14:12

Merge branch 'nerfstudio-project:main' into rejection_mask_sampling

dd0a07a

add configuration settings for rejection sampling masks

9dfd758

black reformat

037dc2b

maybe this fixes it?

6addfd4

akristoffersen self-requested a review January 11, 2024 04:29

akristoffersen approved these changes Jan 11, 2024

View reviewed changes

revert behavior if mask sampling failed, still raise warning

5c06502

akristoffersen reviewed Jan 12, 2024

View reviewed changes

on iteration failure, use non-rejection sampling to generate indices

900f7f2

anc2001 and others added 2 commits January 18, 2024 10:20

Merge branch 'main' of https://github.com/nerfstudio-project/nerfstudio…

a8cc4e0

… into rejection_mask_sampling

ruff

8c927f3

Merge branch 'main' into rejection_mask_sampling

bbebc1c

akristoffersen enabled auto-merge (squash) January 18, 2024 18:11

akristoffersen merged commit a78ca29 into nerfstudio-project:main Jan 18, 2024
4 checks passed

nepfaff mentioned this pull request Sep 27, 2024

Speed up PairPixelSampler #3452

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up mask sampling with rejection sampling #2585

Speed up mask sampling with rejection sampling #2585

anc2001 commented Nov 6, 2023

KevinXu02 commented Nov 7, 2023 •

edited

Loading

blacksino commented Nov 30, 2023

akristoffersen commented Jan 8, 2024

anc2001 commented Jan 8, 2024

machenmusik commented Jan 8, 2024 •

edited

Loading

anc2001 commented Jan 10, 2024

akristoffersen left a comment

akristoffersen Jan 11, 2024

akristoffersen left a comment

akristoffersen commented Jan 16, 2024

anc2001 commented Jan 18, 2024

akristoffersen commented Jan 18, 2024

nepfaff commented Sep 26, 2024 •

edited

Loading

Speed up mask sampling with rejection sampling #2585

Speed up mask sampling with rejection sampling #2585

Conversation

anc2001 commented Nov 6, 2023

KevinXu02 commented Nov 7, 2023 • edited Loading

blacksino commented Nov 30, 2023

akristoffersen commented Jan 8, 2024

anc2001 commented Jan 8, 2024

machenmusik commented Jan 8, 2024 • edited Loading

anc2001 commented Jan 10, 2024

akristoffersen left a comment

Choose a reason for hiding this comment

akristoffersen Jan 11, 2024

Choose a reason for hiding this comment

akristoffersen left a comment

Choose a reason for hiding this comment

akristoffersen commented Jan 16, 2024

anc2001 commented Jan 18, 2024

akristoffersen commented Jan 18, 2024

nepfaff commented Sep 26, 2024 • edited Loading

KevinXu02 commented Nov 7, 2023 •

edited

Loading

machenmusik commented Jan 8, 2024 •

edited

Loading

nepfaff commented Sep 26, 2024 •

edited

Loading