Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

wizard: feature request - add dataset.non_redistributable: true as default to private steps #3989

Open
spoonerf opened this issue Feb 17, 2025 · 0 comments
Assignees
Labels
priority 2 - important wizard Issues related to wizard tool

Comments

@spoonerf
Copy link
Contributor

One-liner

Let's make it a bit simpler to ensure private datasets are non-redistributable.

Context & details

Currently, I believe there are three things you need to do to ensure a dataset is non-redistributable in the ETL:

  • In the Snapshot .dvc have is_public: false
  • Use data-private and snapshot-private in the dag
  • Set the dataset metadata as non_redistributable: true, typically in garden

The first two are done automatically in Wizard when you select that the dataset is private. I think it would make sense to also make the third one automatic, unless there is some complication I'm not thinking of?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
priority 2 - important wizard Issues related to wizard tool
Projects
None yet
Development

No branches or pull requests

4 participants