Optimize recompression for non-segmentby chunks #7632

kpan2034 · 2025-01-28T23:00:45Z

Enables the segmentwise recompression flow to be used for chunks without segmentby columns.

This should be more performant than doing a full recompression.

Enables the segmentwise recompression flow to be used for chunks without segmentby columns. This should be more performant than doing a full recompression.

codecov · 2025-01-28T23:21:37Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 81.30%. Comparing base (59f50f2) to head (5ab9b12).
Report is 718 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #7632      +/-   ##
==========================================
+ Coverage   80.06%   81.30%   +1.23%     
==========================================
  Files         190      240      +50     
  Lines       37181    44696    +7515     
  Branches     9450    11159    +1709     
==========================================
+ Hits        29770    36340    +6570     
- Misses       2997     3967     +970     
+ Partials     4414     4389      -25

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

antekresic · 2025-01-30T11:59:37Z

tsl/src/compression/recompress.c

@@ -168,6 +168,10 @@ recompress_chunk_segmentwise_impl(Chunk *uncompressed_chunk)

 	CompressedSegmentInfo *current_segment = palloc0(sizeof(CompressedSegmentInfo) * n_keys);

+	// For chunks with no segmentby settings, we can still do segmentwise recompression
+	// The entire chunk is treated as a single segment
+	elog(ts_guc_debug_compression_path_info ? INFO : DEBUG1, "using non-segmentby index for recompression") ;


This will log every time that you are using non-segmentby index but thats not true. You should log the index name you are using instead (its easy to check what index is being used that way).

antekresic · 2025-01-30T12:01:01Z

tsl/test/expected/recompress_chunk_segmentwise.out

@@ -483,7 +483,7 @@ select compressed_chunk_name as compressed_chunk_name_after_recompression from c
 select :'compressed_chunk_name_before_recompression' as before_recompression, :'compressed_chunk_name_after_recompression' as after_recompression;
    before_recompression    |    after_recompression     
 ----------------------------+----------------------------
- compress_hyper_13_14_chunk | compress_hyper_13_15_chunk
+ compress_hyper_13_14_chunk | compress_hyper_13_14_chunk


Comment above on line 475 needs updating since it is incorrect with this change.

antekresic · 2025-01-30T12:02:03Z

tsl/test/sql/recompress_chunk_segmentwise.sql

@@ -291,6 +291,23 @@ insert into nullseg_many values (:'start_time', 1, NULL, NULL);
 SELECT compress_chunk(:'chunk_to_compress');
 select * from :compressed_chunk_name;

+-- Test behaviour when no segmentby columns are present
+SET timescaledb.debug_compression_path_info TO ON;


Lets enable this GUC for the complete test so we can verify that the correct index is being used for each recompression.

Optimize recompression for non-segmentby chunks

5ab9b12

Enables the segmentwise recompression flow to be used for chunks without segmentby columns. This should be more performant than doing a full recompression.

kpan2034 requested review from antekresic and svenklemm January 28, 2025 23:00

github-actions bot assigned kpan2034 Jan 28, 2025

antekresic reviewed Jan 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize recompression for non-segmentby chunks #7632

Optimize recompression for non-segmentby chunks #7632

kpan2034 commented Jan 28, 2025

codecov bot commented Jan 28, 2025

antekresic Jan 30, 2025

antekresic Jan 30, 2025

antekresic Jan 30, 2025

Optimize recompression for non-segmentby chunks #7632

Are you sure you want to change the base?

Optimize recompression for non-segmentby chunks #7632

Conversation

kpan2034 commented Jan 28, 2025

codecov bot commented Jan 28, 2025

Codecov Report

antekresic Jan 30, 2025

Choose a reason for hiding this comment

antekresic Jan 30, 2025

Choose a reason for hiding this comment

antekresic Jan 30, 2025

Choose a reason for hiding this comment