8 write create trends ensemble function #11

lshandross · 2024-11-14T17:16:03Z

No description provided.

lshandross · 2024-11-14T17:44:08Z

The lint check is only failing due to get_baseline_predictions() having a cyclomatic complexity of 16 after I fixed a validation. I don't feel like it's a great use of my time to refactor that function given that flu forecasting starts on Wednesday, so I filed an issue (#12) about refactoring and would like to merge this PR once everything else looks good.

elray1

I reviewed this and it basically looks good. Made 1 or 2 minor suggestions. Two bigger comments:

I had some questions around the temporal aggregation, but I thought we had discussed (or maybe I imagined discussing or planned to discuss and didn't follow through) that we were going to ditch the stuff in here that's related to temporal aggregation because the available data going forward in the near future will only be for the weekly scale. So I propose to essentially go ahead with what's here, regardless of any questions on this front. We just need to get something that handles a single temporal resolution of "weekly" in place.
My main question is actually how we're going to deal with the sampling. Two sub-questions on this:
1. I think that we should add an n_sim argument to this top level function to allow us to separate the concepts of (a) how many samples are generated from the predictive distribution for subsequent summarizing into predictive quantiles; and (b) how many samples are returned if we have a sample output type
2. Because hubEnsembles::linear_pool only handles the simplest case where the number of samples for the ensemble is unrestricted, we need to figure out a way to deal with the requirement of getting to 100 samples for the ensemble that we submit. I see two options: (a) update hubEnsembles::linear_pool to allow for specification of a target number of samples for the ensemble, doing sampling if necessary. (b) within this function, pick the number of samples to output from each baseline model so that in total you end up with 100 samples. In our setting with a target of 100 samples for the ensemble and 8 baseline models, we would have 4 baselines generate 12 samples and 4 generate 13. I'm ok with going with option (b) in the short term if it's easier, but note that we will want to do option (a) in the near future as well.

elray1 · 2024-11-14T18:27:37Z

R/aggregate_daily_to_weekly.R

Ideally, it would be good to have some tests for this function. However, I am going to file an issue for that as something we could do later because: (a) it probably works! (b) I feel like we don't even know what our data will look like and if we will use this.

R/create_trends_ensemble.R

elray1 · 2024-11-14T18:47:17Z

tests/testthat/test-create_trends_ensemble.R

+                           quantile_levels = c(.1, .5, .9),
+                           n_samples = NULL,
+                           return_baseline_predictions = FALSE) |>
+    expect_error(regex = "Currently `component_variations` may only contain one unique temporal resolution value",


Low priority -- is this true? I thought I saw stuff about splitting by the temporal resolution up above.

Yes, because the current ensembling for samples makes it so that having multiple temporal resolution values results in different numbers of samples per model, but I wanted to put the check closer to the top to avoid unnecessary calculations. This validation will be removed later on once support is added, but I figured I would just put it in until then

lshandross · 2024-11-14T20:51:11Z

I'm not sure if we discussed the topic of ditching all temporal aggregation, but I thought of implementing it for the final ensemble function so that we would have a way to regenerate the trends ensemble from past seasons. Open to having further discussion about this later.
Took care of adding the n_sim arg to all functions instead of just the lowest level. As for the other point, I may do option b for this week if I run out of time before Wednesday (currently working on a script for generating the trends ensemble and have some other functions for this package drafted), but I will put option a as my next highest priority with a hope of getting it implemented within the first few weeks of FluSight submission.

elray1

lgtm!

lshandross added 7 commits November 13, 2024 17:39

Write create_trends_ensemble() function

4df345f

Write helper aggregate_daily_to_weekly() for target data

99382e7

Update DESCRIPTION

a8e58ce

Update NAMESPACE

b8c9f3d

Write create_trends_ensemble() tests

8f6483b

Add component_variations validation

a7b94df

Fix quantiles validation

f1ad4e7

lshandross linked an issue Nov 14, 2024 that may be closed by this pull request

Write create_trends_ensemble function #8

Closed

Update DESCRIPTION

2bc003b

lshandross requested a review from elray1 November 14, 2024 17:46

elray1 requested changes Nov 14, 2024

View reviewed changes

lshandross added 2 commits November 14, 2024 15:29

Fix documentation typo

4d752b1

Update docs

d6d9094

Make n_sim a parameter for all functions

47f1c95

elray1 approved these changes Nov 14, 2024

View reviewed changes

lshandross merged commit 8c94339 into main Nov 14, 2024
5 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

8 write create trends ensemble function #11

8 write create trends ensemble function #11

lshandross commented Nov 14, 2024

lshandross commented Nov 14, 2024 •

edited

Loading

elray1 left a comment

elray1 Nov 14, 2024

elray1 Nov 14, 2024

lshandross Nov 14, 2024

lshandross commented Nov 14, 2024 •

edited

Loading

elray1 left a comment

8 write create trends ensemble function #11

8 write create trends ensemble function #11

Conversation

lshandross commented Nov 14, 2024

lshandross commented Nov 14, 2024 • edited Loading

elray1 left a comment

Choose a reason for hiding this comment

elray1 Nov 14, 2024

Choose a reason for hiding this comment

elray1 Nov 14, 2024

Choose a reason for hiding this comment

lshandross Nov 14, 2024

Choose a reason for hiding this comment

lshandross commented Nov 14, 2024 • edited Loading

elray1 left a comment

Choose a reason for hiding this comment

lshandross commented Nov 14, 2024 •

edited

Loading

lshandross commented Nov 14, 2024 •

edited

Loading