Function To Cast InferenceData Into `tidy_draws` Format #36

AFg6K7h4fhy2 · 2024-10-28T14:20:40Z

For the scope of this PR, please refer to issue #18 .

…medium

…dy_draws

…nferencedata-into-tidy_draws-format

…draws-format

AFg6K7h4fhy2 · 2025-02-04T20:44:10Z

Requested review from @dylanhmorris. Once a review is received, the author will request another review from Damon.

Beyond the tidy connections, the author is also looking for comments on the changes made in the pre-commit configuration file, along with the pre-commit related edits across other files.

dylanhmorris

Thanks @AFg6K7h4fhy2. Getting close but needs a few tweaks.

.pre-commit-config.yaml

forecasttools/__init__.py

forecasttools/daily_to_epiweekly.py

forecasttools/idata_to_tidy.py

dylanhmorris · 2025-02-04T21:11:24Z

forecasttools/idata_to_tidy.py

+                    ((pl.col(".chain") - 1) * pl.col("draws_per_chain"))
+                    + pl.col(".iteration")
+                ).alias(".draw")
+            )


Need to drop "draws_per_chain", but also it's not a given that all chains will have the same number of draws. Instead, more robust do compute this as .iteration + <n_draws_in_all_previous_chains>. Many ways to do that in polars.

forecasttools/idata_w_dates_to_df.py

dylanhmorris · 2025-02-04T21:18:41Z

tests/test_idata_to_tidy.py

+
+@pytest.fixture
+def mock_inference_data():
+    np.random.seed(42)


Running np.random.seed changes global state. Better practice to do something like this https://builtin.com/data-science/numpy-random-seed

dylanhmorris · 2025-02-04T21:19:08Z

tests/test_idata_to_tidy.py

+    posterior_predictive = xr.Dataset(
+        {
+            "observed_hospital_admissions": ("chain", np.random.randn(2, 100)),
+        },
+        coords={"chain": [0, 1]},
+    )


Why not run the test on the provided inference_data_1.nc? Or are you planning to remove it?

I am on the fence about removing it. Seems good to have a canonical pyrenew-hew .nc file on hand esp. given that forecasttools-py does / will even more so interface abundantly with pyrenew models. On the other hand, having adequate and general idata / xarray representations seems good for testing too. I do not know if the latter must exist at the cost of the former. I lean towards having both, with the .nc file perhaps being used in notebooks and the "fake" idatas being used for testing.

dylanhmorris · 2025-02-04T21:20:08Z

tests/test_idata_to_tidy.py

+        col in df.columns
+        for col in [".chain", ".draw", ".iteration", "variable", "value"]
+    )
+


Would be good to check that individual values are as expected, not just that the draws are unique and one for each row.

… readme

…nferencedata-into-tidy_draws-format

Co-authored-by: Dylan H. Morris <[email protected]>

.pre-commit-config.yaml

…nferencedata-into-tidy_draws-format

dylanhmorris · 2025-02-11T16:22:40Z

pyproject.toml

@@ -42,6 +43,8 @@ patsy = "^0.5.6"
 nbformat = "^5.10.4"
 nbclient = "^0.10.0"
 jupyter = "^1.1.1"
+pandas = "^2.2.3"
+metaflow = "^2.13.9"


Do you want this?

…nferencedata-into-tidy_draws-format

initial commit for this PR; begin skeleton experimentation file

3865001

AFg6K7h4fhy2 self-assigned this Oct 28, 2024

AFg6K7h4fhy2 linked an issue Oct 28, 2024 that may be closed by this pull request

Function to cast InferenceData into tidy_draws format #18

Open

AFg6K7h4fhy2 added feature A new tool or utility being added. High Priority A task that is of higher relative priority. labels Oct 28, 2024

AFg6K7h4fhy2 added this to the [October 28, November 8] milestone Oct 28, 2024

AFg6K7h4fhy2 added Medium Priority A task that is of medium relative priority. and removed High Priority A task that is of higher relative priority. labels Oct 28, 2024

AFg6K7h4fhy2 added 5 commits October 28, 2024 15:27

some unfinished experimentation code; priority status change high to …

763355e

…medium

add first semi-failed attempt at converting entire idata object to ti…

44e7fe2

…dy_draws

add attempt at option 2

31c7b72

slightly modify spread draws example

9a87902

more minor changes to tidy draws notebook

c632ae8

AFg6K7h4fhy2 mentioned this pull request Nov 6, 2024

Utilities Pipeline #16

Open

light edits during DHM convo

123ad51

AFg6K7h4fhy2 modified the milestones: [October 28, November 8], [November 11, November 22] Nov 8, 2024

AFg6K7h4fhy2 modified the milestones: [November 11, November 22], [November 25, December 6] Nov 22, 2024

AFg6K7h4fhy2 added 5 commits November 25, 2024 10:04

Merge remote-tracking branch 'origin/main' into 18-function-to-cast-i…

a3c2d17

…nferencedata-into-tidy_draws-format

Merge remote-tracking branch 'origin/main' into 18-function-to-cast-i…

f44a6ee

…nferencedata-into-tidy_draws-format

Merge remote-tracking branch 'origin/main' into 18-function-to-cast-i…

cb883e3

…nferencedata-into-tidy_draws-format

Merge remote-tracking branch 'origin/main' into 18-function-to-cast-i…

df922d4

…nferencedata-into-tidy_draws-format

Merge remote-tracking branch 'origin/main' into 18-function-to-cast-i…

21968be

…nferencedata-into-tidy_draws-format

AFg6K7h4fhy2 modified the milestones: [November 25, December 6], [December 9, December 20] Dec 9, 2024

AFg6K7h4fhy2 added 4 commits December 9, 2024 11:22

Merge remote-tracking branch 'origin/main' into 18-function-to-cast-i…

7dcd7d3

…nferencedata-into-tidy_draws-format

a DB conversion attempt

718ba85

Merge remote-tracking branch 'origin/main' into 18-function-to-cast-i…

7394d4d

…nferencedata-into-tidy_draws-format

begin references file; create external program folder

4a77d50

AFg6K7h4fhy2 added 10 commits February 3, 2025 14:45

pre-commit edits

2c34af7

more prudent usage of ruff; pre-commit fixes

dba6e7a

revert version edits from black error

e1cdbb9

use selectors; finally, nice to figure that out

9884aa9

fix some conflicts with daily to epiweekly

90f5484

Merge branch 'main' into 18-function-to-cast-inferencedata-into-tidy_…

683a7bf

…draws-format

fix pre-commit errors

3547ab6

add pivot to make life easier; not sure if to aggregate by first

2999ec3

further debugging edits

04d6a50

draws and iterations debugged

cdbe464

AFg6K7h4fhy2 requested a review from dylanhmorris February 4, 2025 20:43

revert versioning in test yaml

987c273

dylanhmorris requested changes Feb 4, 2025

View reviewed changes

AFg6K7h4fhy2 and others added 7 commits February 5, 2025 11:13

update location table; add united states data; update descriptions in…

8361c36

… readme

Merge remote-tracking branch 'origin/main' into 18-function-to-cast-i…

41a8136

…nferencedata-into-tidy_draws-format

remove extraneous united states parquet call

bb55bf4

Update forecasttools/idata_to_tidy.py

1cdef9d

Co-authored-by: Dylan H. Morris <[email protected]>

Update forecasttools/idata_to_tidy.py

519c048

Co-authored-by: Dylan H. Morris <[email protected]>

fix docstring; fix chain equation

1019ac0

revert ensure listlike import

92c34bd

This was referenced Feb 6, 2025

Correct Type-Checking #61

Open

Namespace Resolution #62

Open

AFg6K7h4fhy2 added 2 commits February 6, 2025 09:37

remove tab ignoral

b3f6367

switch from melt to pivot

84bb99e

damonbayer reviewed Feb 7, 2025

View reviewed changes

.pre-commit-config.yaml Outdated Show resolved Hide resolved

AFg6K7h4fhy2 added 2 commits February 10, 2025 11:18

lightweight change to dev deps

1a94da4

Merge remote-tracking branch 'origin/main' into 18-function-to-cast-i…

7a737f6

…nferencedata-into-tidy_draws-format

dylanhmorris reviewed Feb 11, 2025

View reviewed changes

Merge remote-tracking branch 'origin/main' into 18-function-to-cast-i…

740f9c8

…nferencedata-into-tidy_draws-format

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Function To Cast InferenceData Into `tidy_draws` Format #36

Function To Cast InferenceData Into `tidy_draws` Format #36

AFg6K7h4fhy2 commented Oct 28, 2024 •

edited

Loading

AFg6K7h4fhy2 commented Feb 4, 2025 •

edited

Loading

dylanhmorris left a comment

dylanhmorris Feb 4, 2025 •

edited

Loading

dylanhmorris Feb 4, 2025

dylanhmorris Feb 4, 2025

AFg6K7h4fhy2 Feb 6, 2025

dylanhmorris Feb 4, 2025

dylanhmorris Feb 11, 2025

Function To Cast InferenceData Into tidy_draws Format #36

Are you sure you want to change the base?

Function To Cast InferenceData Into tidy_draws Format #36

Conversation

AFg6K7h4fhy2 commented Oct 28, 2024 • edited Loading

AFg6K7h4fhy2 commented Feb 4, 2025 • edited Loading

dylanhmorris left a comment

Choose a reason for hiding this comment

dylanhmorris Feb 4, 2025 • edited Loading

Choose a reason for hiding this comment

dylanhmorris Feb 4, 2025

Choose a reason for hiding this comment

dylanhmorris Feb 4, 2025

Choose a reason for hiding this comment

AFg6K7h4fhy2 Feb 6, 2025

Choose a reason for hiding this comment

dylanhmorris Feb 4, 2025

Choose a reason for hiding this comment

dylanhmorris Feb 11, 2025

Choose a reason for hiding this comment

Function To Cast InferenceData Into `tidy_draws` Format #36

Function To Cast InferenceData Into `tidy_draws` Format #36

AFg6K7h4fhy2 commented Oct 28, 2024 •

edited

Loading

AFg6K7h4fhy2 commented Feb 4, 2025 •

edited

Loading

dylanhmorris Feb 4, 2025 •

edited

Loading