Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Iceberg Overwrite partition mode #2606

Closed
danikv opened this issue Jan 11, 2024 · 1 comment · Fixed by #2632
Closed

Iceberg Overwrite partition mode #2606

danikv opened this issue Jan 11, 2024 · 1 comment · Fixed by #2632
Assignees
Labels
enhancement New feature or request

Comments

@danikv
Copy link

danikv commented Jan 11, 2024

Is your idea related to a problem? Please describe.
We currently use the functionality of wrangler.s3.to_parquet(mode="overwrite_partitions") to write our data to glue tables, that is because we run daily pipelines and write it to a daily partition, and if we want to backfill the table data with new columns we overwrite those partitions.
we want to start using iceberg as our format for schema evolution but we cannot achieve that functionality with wrangler.athena.to_iceberg()

Describe the solution you'd like
wrangler.athena.to_iceberg(mode="overwrite_partition" or "update") would be nice or maybe some other solution we can do now, I have been thinking about executing "INSERT OVERWRITE" statment and filter the overwrite to the partition I want to overwrite

EDIT

After looking at the source code I saw the merge_cols option in to_iceberg, this seems like a good fit but it lacks the ability to delete, it either update or insert but what happened when there are rows in the original tables which are not in the new table(we want to delete those rows).

@jaidisido
Copy link
Contributor

Thanks for raising this, it's being worked on in PR #2632

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants