Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement S3 partitioned reads #618

Open
arthurpassos opened this issue Feb 11, 2025 · 1 comment
Open

Implement S3 partitioned reads #618

arthurpassos opened this issue Feb 11, 2025 · 1 comment

Comments

@arthurpassos
Copy link
Collaborator

Describe the new feature

Being able to read from S3 partitioned table.

Writes to partitioned table is already supported, but reading not. I suspect the reason is that it is not clearly defined how to determine if a file belongs to the table or not. Should it be treated as a star wildcard? Or we need to create a regex expression for the partition id?

The implementation will be useful for multi file tables

Use case

CREATE TABLE s3 (event_date DateTime64, val str) ENGINE=S3('aws-s3-bucket/{_partition_id}/file.parquet') PARTITION BY event_date SETTINGS s3_create_new_file_on_insert=1;

select * from s3;

Alternative solutions

Are there other ways to solve the problem this new feature addresses? Are any of them possible now? If so, how would the solution you're proposing be better?

Additional context

Add any other context about your proposed feature here.

@arthurpassos
Copy link
Collaborator Author

This should go to upstream

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant