Pit templates #12

elray1 · 2024-10-22T21:04:48Z

This PR makes it so that templates for the Shaake shuffle can be constructed based on PIT values from past forecasts rather than just past observed data. This is also an intermediate step toward estimating copulas.

elray1 · 2024-10-22T21:08:16Z

tests/postpredict/dependence/conftest.py

@@ -167,6 +167,6 @@ def obs_data():
        "location": ["a"] * 20 + ["b"] * 20,
        "population": [100.0] * 10 + [150.0] * 10 + [200.0] * 10 + [250.0] * 10,
        "age_group": (["young"] * 10 + ["old"] * 10) * 2,
-        "date": [datetime.strptime("2020-01-01", "%Y-%m-%d") + timedelta(i) for i in range(10)] * 4,
+        "date": [datetime.strptime("2020-01-14", "%Y-%m-%d") + timedelta(i) for i in range(10)] * 4,


needed to update date here to get observed data that were in the same range of prediction target dates, so that I could test correct results based on PIT scores.

elray1 · 2024-10-23T00:31:59Z

src/postpredict/dependence.py

-                  reference_time_col: str = "reference_date",
-                  horizon_col: str = "horizon", pred_col: str = "value",
-                  idx_col: str = "output_type_id",


these have moved to the .fit method instead of .transform

elray1 · 2024-10-23T00:33:02Z

src/postpredict/dependence.py

-                                             horizon_col, idx_col, pred_col)
-        min_horizon = model_out[horizon_col].min()
-        max_horizon = model_out[horizon_col].max()
+        wide_model_out = self._pivot_horizon(model_out)


reference_time_col, horizon_col, idx_col, and pred_col are now saved as attributes of the object, so we no longer need to pass them around as arguments and we use self.horizon_col, etc.

elray1 · 2024-10-23T00:34:07Z

src/postpredict/dependence.py

+        if self.model_out_train is not None:
+            wide_model_out_train = self._pivot_horizon(self.model_out_train)
+        else:
+            wide_model_out_train = None


refactoring to-do: move this inside of self._build_train_X_Y. see #2

bsweger · 2024-10-23T19:55:55Z

src/postpredict/dependence.py

-        time_col: name of column in `df` that contains the time index.
-        obs_col: name of column in `df` that contains observed values.
-        feat_cols: names of columns in `df` with features
+        target_data_train: pl.DataFrame


Checked the docstring types against the types in the function parameter 👍

bsweger · 2024-10-23T19:59:42Z

tests/postpredict/dependence/test_build_train_X_Y.py

+
+
+def test_build_train_X_Y_pit_templates(obs_data, wide_model_out, monkeypatch):
+    # we use monkeypatch to remove abstract methods from the


This is a neat trick! At some point, it might be worth creating a fixture for this pattern, since it's in the code base quite a few times.

good idea, i filed issue #17 for that

bsweger

Thanks for the notes throughout--those were helpful.

The Python re-factoring and docstrings looked good. I also ran through the local setup instructions and ran the test suite--all is working as expected!

elray1 added 3 commits October 22, 2024 11:18

start outlining pit templates

be984b1

Merge branch 'main' into pit_templates

82ed5ba

templates based on PIT values

1aa203a

elray1 commented Oct 22, 2024

View reviewed changes

elray1 added 5 commits October 22, 2024 17:10

clean up some comments

72b5867

double quotes and import order

24b4d8b

rename self.df, add training set model outputs as argument to fit method

df4f1b3

_pivot_horizon uses object properties rather than named arguments

d2d1395

remove debugging print statements

b91b7ec

elray1 commented Oct 23, 2024

View reviewed changes

add pit templates results to notebook

f976d5d

bsweger reviewed Oct 23, 2024

View reviewed changes

bsweger approved these changes Oct 23, 2024

View reviewed changes

elray1 merged commit de8e7d6 into main Oct 28, 2024
1 check passed

elray1 deleted the pit_templates branch October 28, 2024 12:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pit templates #12

Pit templates #12

elray1 commented Oct 22, 2024 •

edited

Loading

elray1 Oct 22, 2024 •

edited

Loading

elray1 Oct 23, 2024

elray1 Oct 23, 2024

elray1 Oct 23, 2024

bsweger Oct 23, 2024

bsweger Oct 23, 2024

elray1 Oct 28, 2024

bsweger left a comment



		def test_build_train_X_Y_pit_templates(obs_data, wide_model_out, monkeypatch):
		# we use monkeypatch to remove abstract methods from the

Pit templates #12

Pit templates #12

Conversation

elray1 commented Oct 22, 2024 • edited Loading

elray1 Oct 22, 2024 • edited Loading

Choose a reason for hiding this comment

elray1 Oct 23, 2024

Choose a reason for hiding this comment

elray1 Oct 23, 2024

Choose a reason for hiding this comment

elray1 Oct 23, 2024

Choose a reason for hiding this comment

bsweger Oct 23, 2024

Choose a reason for hiding this comment

bsweger Oct 23, 2024

Choose a reason for hiding this comment

elray1 Oct 28, 2024

Choose a reason for hiding this comment

bsweger left a comment

Choose a reason for hiding this comment

elray1 commented Oct 22, 2024 •

edited

Loading

elray1 Oct 22, 2024 •

edited

Loading