Scaling of Data #34

stefan37 · 2021-12-09T12:15:59Z

Hi,
I've noticed is the scaling of the data can have an effect on the result, but I am not sure why it would and can't find any reason for it in the code or references. Below I have the CP probabilities for the same data with or without a constant factor, which are somewhat different.

Are there some assumptions about the input data I am missing?
Thanks

hildensia · 2021-12-09T12:28:31Z

The student T likelihood scales with the squared mean distance, which is non-linear w.r.t. to data scaling.

bayesian_changepoint_detection/bayesian_changepoint_detection/offline_likelihoods.py

Line 138 in 2dd95f5

+ 0.5 * ((data[t:s] - mean) ** 2).sum(0)

Also intuitively that makes sense, because the difference of your generative models is now different, and thus the probability of them being the same/different should be different

stefan37 · 2021-12-09T14:07:20Z

Thanks for quick reply. Confusion for me is that often the scale is arbitrary, such as if there are multiple ways to make some data dimensionless, yet that could yield vastly different results; my assumption before was that I should always just always normalize over the entire time series. Is there some prior used here in calculating the student T likelihood that I should keep in mind with how I scale my data, or any other way to decide the scale?

hildensia · 2021-12-09T14:29:59Z

Good question. I would believe that mean centering your data is probably a good idea. But w.r.t. scaling I have to think a bit more. It has probably to do with an implicit prior somewhere, but I cannot pinpoint it right now.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scaling of Data #34

Scaling of Data #34

stefan37 commented Dec 9, 2021

hildensia commented Dec 9, 2021 •

edited

Loading

stefan37 commented Dec 9, 2021

hildensia commented Dec 9, 2021

Scaling of Data #34

Scaling of Data #34

Comments

stefan37 commented Dec 9, 2021

hildensia commented Dec 9, 2021 • edited Loading

stefan37 commented Dec 9, 2021

hildensia commented Dec 9, 2021

hildensia commented Dec 9, 2021 •

edited

Loading