Time dimension in keyspace #15

adammck · 2022-06-06T03:22:17Z

The keyspace currently only has a single dimension. Ranges have a start key and an end key and that's it. We do however store the full range history, and include the relevant parts of it with PrepareAddRange commands, so when rebalancing, storage services can choose whether to (a) move existing/historical state around, so that queries can fetch old+new writes from a single replica (and benefit from predicate push-down) or (b) to simply send future writes to their new home, and stitch old+new writes back together at query time. I haven't made an example implementation of the latter yet, but I've tried to be careful to support it.

Both state-moving and time-stitching have their benefits. The former allows reads and writes to be balanced more precisely based on actual utilization, but is complex to implement, and increases load on the destination node (and possibly the source node) while moving things around. The latter allows write load to be quickly and cheaply rebalanced, at the cost of more complexity and constraints at query time; queries must be supported across multiple storage nodes, which can't always be implemented efficiently. Services must pick their poison.

However! Wouldn't it be splendid if services could have it both ways, by splitting ranges as of right now to quickly balance writes, and moving around historical data to balance reads? There is currently no way for Ranger to tell nodes "split range 19 at time: 2022-06-01", leaving two new ranges covering the same key ranges for different time ranges (presumably [-inf, 2022-06-01] and (2022-06-01, +inf]). Much like #14, the current implementation would be a strict subset of this API, since t would simply always be infinity on both ends. So nodes not wanting this could simply tell Ranger not to split on time, and use a Rangelet interface not including that option.

The text was updated successfully, but these errors were encountered:

adammck added the maybe label Jun 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Time dimension in keyspace #15

Time dimension in keyspace #15

adammck commented Jun 6, 2022

Time dimension in keyspace #15

Time dimension in keyspace #15

Comments

adammck commented Jun 6, 2022