Skip to content

Commit

Permalink
Post v2024.10.0 release errata cleanup (#3917)
Browse files Browse the repository at this point in the history
* Consolidate timeseries row groups in VCE RARE

* Fix bad S3 download link for VCE RARE parquet file.

* Update docs/data_access.rst

Co-authored-by: E. Belfer <[email protected]>

---------

Co-authored-by: E. Belfer <[email protected]>
  • Loading branch information
zaneselvans and e-belfer authored Oct 20, 2024
1 parent fd4c0a2 commit 878b122
Show file tree
Hide file tree
Showing 4 changed files with 25 additions and 9 deletions.
16 changes: 14 additions & 2 deletions README.rst
Original file line number Diff line number Diff line change
Expand Up @@ -29,10 +29,22 @@ The Public Utility Data Liberation Project (PUDL)
:alt: Schedule a 1-on-1 chat with us about PUDL.
.. |mastodon| image:: https://img.shields.io/mastodon/follow/110855618428885893?domain=https%3A%2F%2Fmastodon.energy&style=social&color=%23000000&link=https%3A%2F%2Fmastodon.energy%2F%40catalystcoop
:target: https://mastodon.energy/@catalystcoop
:alt: Follow CatalystCoop on Mastodon
:alt: Follow Catalyst Cooperative on Mastodon
.. |linkedin| image:: https://img.shields.io/badge/LinkedIn-0077B5?style=flat&logo=linkedin&logoColor=white
:target: https://linkedin.com/company/catalyst-cooperative/
:alt: Follow Catalyst Cooperative on LinkedIn
.. |bluesky| image:: https://img.shields.io/badge/Bluesky-0285FF?logo=bluesky&logoColor=fff&style=flat
:target: https://bsky.app/profile/catalyst.coop
:alt: Follow @catalyst.coop on BlueSky
.. |kaggle| image:: https://img.shields.io/badge/Kaggle-20BEFF?style=flat&logo=Kaggle&logoColor=white
:target: https://www.kaggle.com/datasets/catalystcooperative/pudl-project
:alt: The PUDL Dataset on Kaggle
.. |aws| image:: https://img.shields.io/badge/Amazon_AWS-FF9900?style=flat&logo=amazonaws&logoColor=white
:target: https://registry.opendata.aws/catalyst-cooperative-pudl/
:alt: PUDL in the AWS Open Data Registry

|repo-status| |pytest| |codecov| |rtd| |ruff| |pre-commit-ci| |zenodo-doi|
|office-hours| |mastodon|
|office-hours| |mastodon| |linkedin| |bluesky| |kaggle| |aws|

What is PUDL?
-------------
Expand Down
2 changes: 1 addition & 1 deletion docs/data_access.rst
Original file line number Diff line number Diff line change
Expand Up @@ -130,7 +130,7 @@ so we have moved to publishing all our hourly tables using the compressed, colum
* `FERC-714 Hourly Estimated State Demand <https://s3.us-west-2.amazonaws.com/pudl.catalyst.coop/nightly/out_ferc714__hourly_estimated_state_demand.parquet>`__
* `FERC-714 Hourly Planning Area Demand <https://s3.us-west-2.amazonaws.com/pudl.catalyst.coop/nightly/out_ferc714__hourly_planning_area_demand.parquet>`__
* `GridPath RA Toolkit Hourly Available Capacity Factors <https://s3.us-west-2.amazonaws.com/pudl.catalyst.coop/nightly/out_gridpathratoolkit__hourly_available_capacity_factor.parquet>`__
* `VCE Resoruce Adequacy Renewable Energy Dataset <https://s3.us-west-2.amazonaws.com/pudl.catalyst.coop/nightly/ out_vcerare__hourly_available_capacity_factor.parquet>`__
* `VCE Resource Adequacy Renewable Energy (RARE) Dataset <https://s3.us-west-2.amazonaws.com/pudl.catalyst.coop/nightly/out_vcerare__hourly_available_capacity_factor.parquet>`__

Raw FERC DBF & XBRL data converted to SQLite
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Expand Down
8 changes: 4 additions & 4 deletions src/pudl/metadata/resources/vcerare.py
Original file line number Diff line number Diff line change
Expand Up @@ -42,19 +42,19 @@
),
"schema": {
"fields": [
"state",
"county_or_lake_name",
"datetime_utc",
"hour_of_year",
"report_year",
"hour_of_year",
"county_id_fips",
"county_or_lake_name",
"state",
"latitude",
"longitude",
"capacity_factor_solar_pv",
"capacity_factor_onshore_wind",
"capacity_factor_offshore_wind",
],
"primary_key": ["datetime_utc", "state", "county_or_lake_name"],
"primary_key": ["state", "county_or_lake_name", "datetime_utc"],
},
"sources": ["vcerare"],
"field_namespace": "vcerare",
Expand Down
8 changes: 6 additions & 2 deletions src/pudl/transform/vcerare.py
Original file line number Diff line number Diff line change
Expand Up @@ -282,8 +282,12 @@ def out_vcerare__hourly_available_capacity_factor(
for df_name, df in raw_dict.items()
}
# Combine the data and perform a few last cleaning mechanisms
return _combine_all_cap_fac_dfs(clean_dict).pipe(
_combine_cap_fac_with_fips_df, fips_df
# Sort the data by primary key columns to produce compact row groups
return (
_combine_all_cap_fac_dfs(clean_dict)
.pipe(_combine_cap_fac_with_fips_df, fips_df)
.sort_values(by=["state", "county_or_lake_name", "datetime_utc"])
.reset_index(drop=True)
)


Expand Down

0 comments on commit 878b122

Please sign in to comment.