-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ideas to simplify the generative model for variant data #252
Comments
If we go with kind of model, e.g. fit on the introduction times rather than the whole variant time series, then I would suggest not including |
Adding options: We could consider the Dirichlet-multinomial which is like the "noisey" version of the multinomial. |
Fitting on the early growth rate (rather than introduction time)Another alternative discussed f2f. Instead, of i) fitting on the full variant frequency time series or ii) just the introduction time (see above), we could connect the early exponential growth in new variant freq to growth in the hospitalisation following from the new variant. |
I really like all of these ideas (though noting I don't have a totally clear view of what is in currently). A few comments:
Fitting to the early exponential growth seems like a strong option. The caveat here is that often the early growth is higher than the long run growth due to biased sampling etc.
It sounds like possible model fit times are coming from model data mismatch. Could it also be the case that the current model is too rigid (especially if it is making hard assumptions about the link between variants and waves). Perhaps adding a little more flexibility here could help without major model changes (if this isn't the case it will just further slow things down as the model will be more complex) |
Yes, I think this is a problem which has been revealed in posterior predictive plotting (sorry I don't have a link as I've only seen them f2f). |
An extra consideration is that whilst the hospitalisation data is specific to the modelling scale (that is state level). The variant freq data is actually pooled across CDC region, I think this underlines that a more flexible generative approach would be a good idea here. |
#itssorandom |
Why this issue?
In f2f discussion, we talked about the difficulty in having a generative model for both weekly hospitalisation and bi-weekly variant frequencies simultaneously. There are a few possible reasons why this is hard to achieve, for example, the variant frequencies are from a region and might not represent the local dynamics of a state.
On the other hand, the intrinsic dynamics of the model requires new variants as the primary mechanism whereby a new wave is trigger; that is the mechanistic assumption is that new waves are primarily driven by immunity loss.
Simpler generative models for variants
We need a model for the arrival of a new variant because its intrinsic to the dynamics, but we can simplify what this model is generating.
The simplest model I can think of that gives us what we need for the dynamics, and connects to the data, has these features:
The text was updated successfully, but these errors were encountered: