Add custom comparators #1182

matthew-mcallister · 2025-02-24T22:14:57Z

Closes #1180.

The diff is large due to having to add a type parameter C and update constraints in numerous places, but the logic changes are not large.

This trait adds the Comparator and Equivalator traits, which generalize the Comparable and Equivalent traits by taking a self parameter in addition to the two keys to compare, making it possible to change the comparison function used without changing the key type.

Summary

Adds trait Comparator<L, R> and trait Equivalator<L, R>.
Adds struct OrdComparator, which is a zero-sized type that implements Comparator using Borrow and Ord.
Adds a type parameter C to SkipList, SkipMap, and SkipSet.
- C is defaulted to OrdComparator. This preserves the existing behavior of functions like get(), insert(), etc.
Adds SkipList::with_comparator(), SkipSet::with_comparator(), and SkipMap::with_comparator() constructors.
- SkipList::new(), SkipMap::new(), and SkipSet::new() always use C = OrdComparator. It would be nice if it worked for any C that implements Default, but that would break type inference. This is exactly the reason std::collections::HashMap::new() always uses S = RandomState for its defaulted parameter even if S implements Default.
Adds a test using a wrapper around Box<dyn Fn(&[u8], &[u8]) -> Ordering> as the comparator.

Hashing

Because the goal of these traits is to support totally different comparison operations from the default for the underlying key type, Equivalator explicitly does not require two equivalent keys to have the same hash. If someone, someday wants to support Equivalator on their hash map, they will have to generalize the Hash trait as well.

Safety

My one minor question is whether unsafe impl<Q, R, K, V, C> Send for RefRange<'_, Q, R, K, V, C> should require C: Sync. I don't think so because it doesn't require K or V to be sync, but it was enough to make me second-guess myself.

taiki-e · 2025-02-25T17:29:09Z

crossbeam-skiplist/tests/base.rs

+    impl<'a> Equivalator<Foo> for FooComparator {
+        fn equivalent(&self, lhs: &Foo, rhs: &Foo) -> bool {
+            lhs == rhs
+        }
+    }
+
+    impl<'a> Equivalator<Foo, FooRef<'a>> for FooComparator {
+        fn equivalent(&self, foo: &Foo, key: &FooRef<'a>) -> bool {
            let a = u64::from_be_bytes(key.data[..8].try_into().unwrap());
            let b = u32::from_be_bytes(key.data[8..].try_into().unwrap());
-            a == self.a && b == self.b
+            a == foo.a && b == foo.b
        }
    }

-    impl<'a> Comparable<FooRef<'a>> for Foo {
-        fn compare(&self, key: &FooRef<'a>) -> std::cmp::Ordering {
+    impl<'a> Comparator<Foo> for FooComparator {
+        fn compare(&self, lhs: &Foo, rhs: &Foo) -> std::cmp::Ordering {
+            Ord::cmp(lhs, rhs)
+        }
+    }
+
+    impl<'a> Comparator<Foo, FooRef<'a>> for FooComparator {
+        fn compare(&self, foo: &Foo, key: &FooRef<'a>) -> std::cmp::Ordering {
            let a = u64::from_be_bytes(key.data[..8].try_into().unwrap());
            let b = u32::from_be_bytes(key.data[8..].try_into().unwrap());
-            Foo { a, b }.cmp(self)
+            Foo { a, b }.cmp(foo)
        }
    }

-    let s = SkipList::new(epoch::default_collector().clone());
+    let s = SkipList::with_comparator(epoch::default_collector().clone(), FooComparator);


This seems like an unfortunate increase in complexity for users who want to make stateless comparisons.

It's about a dozen lines of new boilerplate. Regardless, it would be very easy to support Equivalent/Comparable on the default comparator. Rename OrdComparator to BasicComparator and define it like this:

struct BasicComparator; impl<K: ?Sized, Q: ?Sized> Equivalator<K, Q> for BasicComparator where K: Equivalent<Q>, { #[inline] fn equivalent(&self, lhs: &K, rhs: &Q) -> bool { <K as Equivalent<Q>>::equivalent(lhs, rhs) } } impl<K: ?Sized, Q: ?Sized> Comparator<K, Q> for BasicComparator where K: Comparable<Q>, { #[inline] fn compare(&self, lhs: &K, rhs: &Q) -> Ordering { <K as Comparable<Q>>::compare(lhs, rhs) } }

In some sense this is the best of both worlds. The only downside would be that there are now two sets of traits (Equivalent/Comparable, Equivalator/Comparator), which might be mildly confusing at first sight.

IMO, two sets of traits are reasonable. Actually, it is just one set of traits for stateless comparison, because, for stateless comparison, users just use the BasicComparator (the default one) and only need to implement Equivalent and Comparable. They do not need to care about Equivalentor and Comparator.

taiki-e · 2025-02-25T17:29:57Z

cc @al8n

Add custom comparators

9e2c8fc

taiki-e reviewed Feb 25, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add custom comparators #1182

Add custom comparators #1182

matthew-mcallister commented Feb 24, 2025

taiki-e Feb 25, 2025

matthew-mcallister Feb 26, 2025

al8n Feb 27, 2025

taiki-e commented Feb 25, 2025

Add custom comparators #1182

Are you sure you want to change the base?

Add custom comparators #1182

Conversation

matthew-mcallister commented Feb 24, 2025

Summary

Hashing

Safety

taiki-e Feb 25, 2025

Choose a reason for hiding this comment

matthew-mcallister Feb 26, 2025

Choose a reason for hiding this comment

al8n Feb 27, 2025

Choose a reason for hiding this comment

taiki-e commented Feb 25, 2025