Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Explorers: move to ETL as export steps #4072

Open
lucasrodes opened this issue Mar 4, 2025 · 4 comments
Open

Explorers: move to ETL as export steps #4072

lucasrodes opened this issue Mar 4, 2025 · 4 comments

Comments

@lucasrodes
Copy link
Member

lucasrodes commented Mar 4, 2025

Our landscape of explorers is very diverse. This is an attempt to study bringing all our public explorers (45) into ETL as export steps, to leverage our latest tooling.

An idea for this is to start backporting some explorers to ETL:

Explorer types

There are three general categories:

  1. Some explorers rely on CSVs
  2. Some others are built using charts from our site.
  3. Some others are based on references to indicators.

However, within each of these, there are some features (one explorer may be a mix of these):

  1. CSV-based
  • Is it in ETL?
  • Is it from GitHub? owid/owid-datasets, owid/importers, or another?
  • Is there processing in owid-content?
  1. Indicator-based
  • Uses variable IDs
  • Uses catalog paths
  • Is it from ETL (export://) or manually created?

I've tried classifying all our explorers in this spreadsheet. I've added the number of explorer views and number of user views, which should help us prioritise any work here.

@pabloarosado
Copy link
Contributor

Thank you @lucasrodes for this exploration (on explorers!). Given that it's unclear how we'll tackle the transition, maybe we can close this issue (but we keep it as reference). And during the next shaping session we can decide how to move forward.

@lucasrodes
Copy link
Member Author

Hi @pabloarosado I rather keep this one open.

I'm currently working on TSV-to-JSON, which I think falls under the umbrella of this issue.

@pabloarosado
Copy link
Contributor

All right @lucasrodes, thanks for rephrasing the description. Feel free to close it when you are done with the TSV-to-JSON, or let me know if you need any help.

@Marigold
Copy link
Collaborator

Marigold commented Mar 7, 2025

@lucasrodes FYI this code by Lars could be useful (we talked about it yesterday). It parses TSV files and saves them as JSON to MySQL. It could also be useless though ,and we might want to use an entirely different structure, so don't feel any pressure.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants