Skip to content

Releases: catalyst-cooperative/pudl

PUDL 0.5.0

12 Nov 05:43
v0.5.0
5404710
Compare
Choose a tag to compare

Update to include 2020 annual data

See the more extensive release notes in our documentation.

Merged Pull Requests

New Contributors

Full Changelog: v0.4.0...v0.5.0

PUDL 0.4.0

16 Aug 18:32
v0.4.0
bea27ab
Compare
Choose a tag to compare

This is our first release in more than a year and a half, and it contains lots of new data and analyses (and breaking changes...) but it doesn't yet include 2020 datasets for FERC and EIA.

See the complete v0.4.0 release notes for details.

Merged Pull Requests

Read more

v0.3.2: Integration of EIA 860 data for 2009-2010

17 Feb 18:55
v0.3.2
3c689d3
Compare
Choose a tag to compare

The primary changes in this release:

  • The 2009-2010 data for EIA 860 have been integrated, including updates
    to the data validation test cases.
  • Output tables are more uniform and less restrictive in what they
    include, no longer requiring PUDL Plant & Utility IDs in some tables.
  • This release was used to compile v1.1.0 of the PUDL Data Release, which is archived at Zenodo under this DOI: https://doi.org/10.5281/zenodo.3672068

With this release, the EIA 860 & 923 data now (finally!) cover the same span of time. We do not anticipate integrating any older EIA 860 or 923 data at this time.

v0.3.1: Bug fixes required for PUDL data release

06 Feb 00:37
v0.3.1
6e91751
Compare
Choose a tag to compare
A couple of minor bugs were found in the preparation of the first PUDL
data release:
* No maximum version of Python was being specified in setup.py. PUDL
  currently only works on Python 3.7, not 3.8.
* epacems_to_parquet conversion script was erroneously attempting to
  verify the availability of raw input data files, despite the fact that
  it now relies on the packaged post-ETL epacems data. Didn't catch this
  before since it was always being run in a context where the original
  data was lying around... but that's not the case when someone just
  downloads the released data packages and tries to load them.

v0.3.0: 2020 Q1 PUDL release in support of data archiving

30 Jan 18:58
22879b6
Compare
Choose a tag to compare
This release is mostly about getting the infrastructure in place to do
regular data releases via Zenodo, and updating ETL with 2018 data.

Added lots of data validation / quality assurance test cases in
anticipation of archiving data. See the pudl.validate module for more
details.

New data since v0.2.0 of PUDL:
* EIA Form 860 for 2018
* EIA Form 923 for 2018
* FERC Form 1 for 1994-2003 and 2018 (select tables)

We removed the FERC Form 1 accumulated depreciation table from PUDL
because it requires detailed row-mapping in order to be accurate across
all the years. It and many other FERC tables will be integrated soon,
using new row-mapping methods.

Lots of new plants and utilities integrated into the PUDL ID mapping
process, for the earlier years (1994-2003).  All years of FERC 1 data
should be integrated for all future ferc1 tables.

Command line interfaces of some of the ETL scripts have changed, see
their help messages for details.

v0.2.0: Data package based output without PostgreSQL

18 Sep 02:35
v0.2.0
fd2f867
Compare
Choose a tag to compare
This is the first release of PUDL to generate data packages as the
canonical output, rather than loading data into a local PostgreSQL
database. The data packages can then be used to generate a local SQLite
database, without relying on any software being installed outside of
the Python requirements specified for the catalyst.coop package.

This change will enable easier installation of PUDL, as well as
archiving and bulk distribution of the data products in a platform
independent format.

v0.1.0: Reference release of PUDL using PostgreSQL

12 Sep 23:37
v0.1.0
81f4692
Compare
Choose a tag to compare
This is the only release of PUDL that will be made that makes use of
PostgreSQL. It is provided for reference, in case there are users
relying on this setup who need access to a well defined release.

v0.1.0rc1: RC1 for PostgreSQL based legacy PUDL release

12 Sep 20:09
v0.1.0rc1
6ecf00b
Compare
Choose a tag to compare
v0.1 of PUDL will be the only release we make of the PostgreSQL based
system, mainly for archival / reference purposes, in case there are
users who need to be able to install this version to support their
existing systems while transitioning to the datapackage / SQLite
version, which will be released within the next couple of days as v0.2

v0.1.0a4: Fixed Windows console UTF-8 encoding issue.

12 Sep 02:46
v0.1.0a4
65a81af
Compare
Choose a tag to compare
Output from the pudl_setup script was generating an unprintable
character on the Windows console, causing a UnicodeEncodingError,
revealed by the conda-forge tests. This release fixes that output.

v0.1.0a3: Updated/simplified specification of dependencies.

11 Sep 23:16
v0.1.0a3
7b6d073
Compare
Choose a tag to compare
The previous release used extras_require in setup.py to specify a couple
of packages required for dealing with parquet files. This prevented the
parquet functionality from being available after a "vanilla" pip install
of the package, which was confusing for test users. The parquet packages
have now been added to install_requires, and compilation issues were
worked around by making their installation conditional in the
readthedocs build.