Implement S3 partitioned reads #618

arthurpassos · 2025-02-11T13:34:43Z

Describe the new feature

Being able to read from S3 partitioned table.

Writes to partitioned table is already supported, but reading not. I suspect the reason is that it is not clearly defined how to determine if a file belongs to the table or not. Should it be treated as a star wildcard? Or we need to create a regex expression for the partition id?

The implementation will be useful for multi file tables

Use case

CREATE TABLE s3 (event_date DateTime64, val str) ENGINE=S3('aws-s3-bucket/{_partition_id}/file.parquet') PARTITION BY event_date SETTINGS s3_create_new_file_on_insert=1;

select * from s3;

Alternative solutions

Are there other ways to solve the problem this new feature addresses? Are any of them possible now? If so, how would the solution you're proposing be better?

Additional context

Add any other context about your proposed feature here.

arthurpassos · 2025-02-11T13:35:00Z

This should go to upstream

arthurpassos mentioned this issue Feb 11, 2025

Add random and uuid macros to S3 table engine path #619

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement S3 partitioned reads #618

Implement S3 partitioned reads #618

arthurpassos commented Feb 11, 2025

arthurpassos commented Feb 11, 2025

Implement S3 partitioned reads #618

Implement S3 partitioned reads #618

Comments

arthurpassos commented Feb 11, 2025

Describe the new feature

Use case

Alternative solutions

Additional context

arthurpassos commented Feb 11, 2025