manifest: add range annotations #3759

anish-shanbhag · 2024-07-15T16:04:45Z

manifest: add range annotations

This change adds a "range annotation" feature to Annotators , which are computations that aggregate some value over a specific key range within a level. Range annotations use the same B-tree caching behavior as regular annotations, so queries remain fast even with thousands of tables because they avoid a sequential iteration over a level's files.

This PR only sets up range annotations without changing any existing behavior. See #3793 for some potential use cases.

BenchmarkNumFilesRangeAnnotation shows that range annotations are significantly faster than using version.Overlaps to aggregate over a key range:

pkg: github.com/cockroachdb/pebble/internal/manifest
BenchmarkNumFilesRangeAnnotation/annotator-10         	  306010	      4015 ns/op	      48 B/op	       6 allocs/op
BenchmarkNumFilesRangeAnnotation/overlaps-10          	    2223	    513519 ns/op	     336 B/op	       8 allocs/op

cockroach-teamcity · 2024-07-15T16:04:52Z

This change is

jbowens

Do you mind extracting the conversion to generics into its own commit/PR so that it can be reviewed independently?

Reviewable status: 0 of 11 files reviewed, all discussions resolved (waiting on @itsbilal)

anish-shanbhag

Definitely - that PR is now at #3760. I'll rebase this one to just include the range annotation changes once that one is merged.

Reviewable status: 0 of 11 files reviewed, all discussions resolved (waiting on @itsbilal)

cockroachdb#3760 contained a bug which causes Annotator values to be incorrectly aggregated when pointer values should be overwritten. This is because the `vtyped` variable was not being modified due to being on the stack. This change fixes this and adds a unit test for `PickFileAggregator` to catch this issue in the future. cockroachdb#3759 should already not be affected by this due to the different way it handles aggregation.

#3760 contained a bug which causes Annotator values to be incorrectly aggregated when pointer values should be overwritten. This is because the `vtyped` variable was not being modified due to being on the stack. This change fixes this and adds a unit test for `PickFileAggregator` to catch this issue in the future. #3759 should already not be affected by this due to the different way it handles aggregation.

jbowens

Cool!

Reviewable status: 0 of 13 files reviewed, 1 unresolved discussion (waiting on @anish-shanbhag and @itsbilal)

internal/manifest/annotator.go line 104 at r3 (raw file):

	lowerBound []byte,
	// upperBound is a UserKeyBoundary that may be inclusive or exclusive.
	upperBound *base.UserKeyBoundary,

what impact does this have on the existing annotation benchmarks without any ranges?

I expect there is some overhead to combining these two routines, and we might be better off duplicating the function (using mutual recursion when an entire subtree is contained within the bounds). We're also less likely to accidentally begin caching an annotation that does not apply across the node's width.

I think we could also avoid adding a new scratch field to every annotation, and have the caller pass in a *T into which they want the value accumulated. Callers can avoid an allocation by allocating the T with their own data types.

anish-shanbhag

Reviewable status: 0 of 13 files reviewed, 1 unresolved discussion (waiting on @itsbilal and @jbowens)

internal/manifest/annotator.go line 104 at r3 (raw file):

Previously, jbowens (Jackson Owens) wrote…

what impact does this have on the existing annotation benchmarks without any ranges?

I expect there is some overhead to combining these two routines, and we might be better off duplicating the function (using mutual recursion when an entire subtree is contained within the bounds). We're also less likely to accidentally begin caching an annotation that does not apply across the node's width.

I think we could also avoid adding a new scratch field to every annotation, and have the caller pass in a *T into which they want the value accumulated. Callers can avoid an allocation by allocating the T with their own data types.

You're right, there was some extra overhead introduced:

goos: darwin
goarch: arm64
pkg: github.com/cockroachdb/pebble/internal/manifest
                     │     old     │                new                 │
                     │   sec/op    │   sec/op     vs base               │
NumFilesAnnotator-10   1.536µ ± 1%   1.664µ ± 1%  +8.33% (p=0.000 n=10)

                     │    old     │                new                │
                     │    B/op    │    B/op     vs base               │
NumFilesAnnotator-10   536.0 ± 0%   537.0 ± 0%  +0.19% (p=0.000 n=10)

                     │    old     │              new               │
                     │ allocs/op  │ allocs/op   vs base            │
NumFilesAnnotator-10   7.000 ± 0%   7.000 ± 0%  ~ (p=1.000 n=10) ¹
¹ all samples are equal

Agreed that it's better to separate the two functions - I've updated with that change, and the logic is definitely looking cleaner.

I realized that we can just use a single scratch field for the entire Annotator, rather than one for each annotation. My updated implementation takes that route, but let me know if you think there's still a benefit to allowing callers to pass a custom *T.

This change adds a "range annotation" feature to Annotators , which are computations that aggregate some value over a specific key range within a level. Range annotations use the same B-tree caching behavior as regular annotations, so queries remain fast even with thousands of tables because they avoid a sequential iteration over a level's files. This PR only sets up range annotations without changing any existing behavior. See cockroachdb#3793 for some potential use cases. `BenchmarkNumFilesRangeAnnotation` shows that range annotations are significantly faster than using `version.Overlaps` to aggregate over a key range: ``` pkg: github.com/cockroachdb/pebble/internal/manifest BenchmarkNumFilesRangeAnnotation/annotator-10 306010 4015 ns/op 48 B/op 6 allocs/op BenchmarkNumFilesRangeAnnotation/overlaps-10 2223 513519 ns/op 336 B/op 8 allocs/op ```

jbowens

Nice!

Reviewed 1 of 13 files at r2, 1 of 2 files at r4, 1 of 1 files at r5, all commit messages.
Reviewable status: 3 of 13 files reviewed, all discussions resolved (waiting on @itsbilal)

anish-shanbhag

TFTR!

Reviewable status: 3 of 13 files reviewed, all discussions resolved (waiting on @itsbilal)

anish-shanbhag force-pushed the range-annotator branch 3 times, most recently from d1f0f5b to 35aa22e Compare July 15, 2024 18:15

anish-shanbhag marked this pull request as ready for review July 15, 2024 18:23

anish-shanbhag requested a review from a team as a code owner July 15, 2024 18:23

anish-shanbhag requested review from jbowens and itsbilal July 15, 2024 18:23

jbowens reviewed Jul 15, 2024

View reviewed changes

anish-shanbhag commented Jul 15, 2024

View reviewed changes

anish-shanbhag marked this pull request as draft July 15, 2024 20:34

anish-shanbhag changed the title ~~manifest: refactor Annotator with generics and add range annotations~~ manifest: add range annotations Jul 15, 2024

anish-shanbhag force-pushed the range-annotator branch from 35aa22e to 79bdbc1 Compare July 26, 2024 16:24

anish-shanbhag mentioned this pull request Jul 26, 2024

improve performance of aggregations over a key range using range annotations #3793

Open

anish-shanbhag force-pushed the range-annotator branch 3 times, most recently from b046563 to 282a3b1 Compare July 26, 2024 18:57

anish-shanbhag marked this pull request as ready for review July 26, 2024 19:05

anish-shanbhag mentioned this pull request Jul 30, 2024

manifest: fix incorrect Annotator pointer aggregation #3808

Merged

anish-shanbhag force-pushed the range-annotator branch from 282a3b1 to 2d16b80 Compare August 8, 2024 15:37

anish-shanbhag requested a review from jbowens August 8, 2024 17:56

jbowens reviewed Aug 9, 2024

View reviewed changes

anish-shanbhag force-pushed the range-annotator branch from 2d16b80 to 5761b73 Compare August 13, 2024 15:45

anish-shanbhag commented Aug 13, 2024

View reviewed changes

anish-shanbhag force-pushed the range-annotator branch from 5761b73 to e375244 Compare August 13, 2024 16:06

anish-shanbhag requested a review from jbowens August 13, 2024 16:15

anish-shanbhag force-pushed the range-annotator branch from e375244 to 4776fa3 Compare August 13, 2024 18:35

jbowens approved these changes Aug 14, 2024

View reviewed changes

anish-shanbhag commented Aug 14, 2024

View reviewed changes

anish-shanbhag merged commit a6d2952 into cockroachdb:master Aug 14, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

manifest: add range annotations #3759

manifest: add range annotations #3759

anish-shanbhag commented Jul 15, 2024 •

edited

Loading

cockroach-teamcity commented Jul 15, 2024

jbowens left a comment

anish-shanbhag left a comment

jbowens left a comment

anish-shanbhag left a comment

jbowens left a comment

anish-shanbhag left a comment

manifest: add range annotations #3759

manifest: add range annotations #3759

Conversation

anish-shanbhag commented Jul 15, 2024 • edited Loading

cockroach-teamcity commented Jul 15, 2024

jbowens left a comment

Choose a reason for hiding this comment

anish-shanbhag left a comment

Choose a reason for hiding this comment

jbowens left a comment

Choose a reason for hiding this comment

anish-shanbhag left a comment

Choose a reason for hiding this comment

jbowens left a comment

Choose a reason for hiding this comment

anish-shanbhag left a comment

Choose a reason for hiding this comment

anish-shanbhag commented Jul 15, 2024 •

edited

Loading