Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Write an archiver for the EPA Priority Climate Action Plan Directory #524

Closed
3 of 10 tasks
zaneselvans opened this issue Jan 20, 2025 · 0 comments · Fixed by #544
Closed
3 of 10 tasks

Write an archiver for the EPA Priority Climate Action Plan Directory #524

zaneselvans opened this issue Jan 20, 2025 · 0 comments · Fixed by #544
Labels

Comments

@zaneselvans
Copy link
Member

zaneselvans commented Jan 20, 2025

Motivation and context:

EPA’s Priority Climate Action Plan (PCAP) Directory organizes data collected from 211 PCAPs submitted by states, Metropolitan Statistical Areas (MSAs), Tribes, and territories under EPA’s Climate Pollution Reduction Grants (CPRG) program. PCAPs are a compilation of each jurisdiction’s identified priority actions (or measures) to reduce greenhouse gas (GHG) emissions. The directory presents information from more than 30 data categories related to GHG inventories, GHG reduction measures, benefits for low-income and disadvantaged communities (LIDACs), and other PCAP elements.

The directory is designed to help CPRG planning grantees identify and leverage approaches within PCAPs to support the development of their Comprehensive Climate Action Plans (CCAPs) due under the CPRG program.

The PCAP Directory has three components:

  • Searchable data tables. PCAP data from states, MSAs, Tribes, and territories is presented in tables focused on PCAP elements (e.g., GHG reduction measures) that can be used to browse, filter, and extract information from the directory’s full dataset. Tables include page number citations and hyperlinks as appropriate to quickly access the primary source materials for each entry.
  • Summary tables. Summary tables highlight patterns and trends within the PCAP Directory data (e.g., economic sectors with the largest number of GHG reduction measures) across and within different jurisdictional levels.
  • Downloadable spreadsheets. The data collected by EPA from state, MSA, Tribal, and territorial PCAPs is available to download. This includes data featured in the searchable tables, data used to create the summary tables, and other PCAP data that does not appear in the directory web pages.

It looks like the "complete" source data is actually just the 3 downloadable spreadsheets, and they do not appear to have historical updates.

Links

Include a link to the dataset webpage and any metadata documentation.

The XLSX sheets for archiving are currently at:

However the real source data is in the PDFs that were published by the various local state/MSA/tribal jurisdictions. Those are currently hosted on the EPA site, and also need to be archived. The URLs for those documents are contained in the spreadsheets, but can also be obtained from the searchable tables interfaces:

Requirements for archiving

To be archived on Zenodo, a dataset must be:

  • published under an open license that permits reuse and redistribution
  • less than 50Gb in size (when zipped)
  • relevant to energy modelling and research

Checklist for archive creation

Based on the README documentation on creating a new archive:

Links to published archives:

Include a link to the published sandbox archive for review.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Done
1 participant