Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for adding an "internal note" to a dataset #3821

Open
lucasrodes opened this issue Jan 9, 2025 · 5 comments
Open

Support for adding an "internal note" to a dataset #3821

lucasrodes opened this issue Jan 9, 2025 · 5 comments
Labels

Comments

@lucasrodes
Copy link
Member

One-liner

Option to add an internal note to datasets.

Description

Sometimes, datasets in Admin could benefit from having an internal description. E.g. in the case that the dataset is for external use, for experimenting, etc.

In the dataset edit page, it looks as if we had thought of it, since there is the "Internal notes" field.

Image

The help text for this field suggests that its content came from the dataset.description. However, I've been trying to see if this works (see #3784), but it doesn't seem to.

@lucasrodes lucasrodes added the enhancement New feature or request label Jan 9, 2025
@lucasrodes lucasrodes changed the title Add option to add internal note to a dataset Support for adding an "internal note" to a dataset Jan 9, 2025
@Marigold
Copy link
Collaborator

That field has essentially been deprecated. Internal notes were taken from the first source’s description.

The question is how much "internal" should it be. If we're fine with having it in ETL repository, we can add a new field internal_notes to DatasetMeta and create a new column in datasets table in MySQL. If we want it to be private, we'd have to make it work through metadata editing (that no one uses).

@larsyencken
Copy link
Contributor

Hey @lucasrodes, what would you use them for, vs having comments in the code, or notes at the chart level?

Note that even "internal" notes are public.

Is this about caveats, anomalies, or judgements of data quality, for example?

@lucasrodes
Copy link
Member Author

lucasrodes commented Jan 23, 2025

hey @larsyencken, it is mostly about caveats and minor details, that would be nice to make easily accessible to authors via the Admin. I think it is generally OK to have these be public. Otherwise we can think of something that can just be added through the admin.

More context:
Some datasets may be in production for a particular use other than just creating charts. E.g., specific collaboration with external providers, specific projects from an author, etc.

This issue has appeared when doing housekeeping and checking at datasets with no chart associated. See the following issues:

In general, if a dataset is kept for a good reason (even if not used by charts), it'd be nice for anyone in the team to know that reason. This can help doing housekeeping tasks.

Also, brief thread in slack →

@pabloarosado
Copy link
Contributor

We could have a new ETL dataset metadata field, but it could land on the grapher admin dataset "Internal notes" (which currently is not used).
Maybe it's something we could easily edit from the ETL dashboard, and display it there.
This would give us the possibility to have notes to remember on our upcoming update.

But maybe we'd need to have a few examples to justify how much we need this. @lucasrodes can you think of any example? Thanks.

@lucasrodes
Copy link
Member Author

lucasrodes commented Feb 6, 2025

Thanks for weighing in, @pabloarosado.

On your point:

But maybe we'd need to have a few examples to justify how much we need this

  • As I mentioned, it would have been extremely helpful to me when doing housekeeping tasks. When archiving datasets, is often not clear if the dataset is there for a reason, even if it doesn't have any chart associated. See work from https://github.com/owid/owid-issues/issues/1740 or https://github.com/owid/owid-issues/issues/1798. Some examples:
  • I've asked the team in case they have other instances where this could be helpful (see slack thread):
    • Hannah:

      Probably yes. In the past I have used it to flag to others if it’s a test dataset, or one that’s not yet ready to be promoted. I’d add something like “Do not share”.
      Not sure if others actually found this useful.

      I’ve also — on occasion — added notes like “Used for XXX, do not delete”.
      Although that’s less common recently.

    • Ed:

      Yes, definitely! We've used this in the past and would use it again if it was editable.

I'll keep adding examples as they appear.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants