-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Correctly interpolate seasons in Grouper #2019
base: main
Are you sure you want to change the base?
Conversation
I just realised that the factor of 1/6 is assuming that all seasons have the same length which in gregorian calendars is not necessarily true but I am not sure it matters too much at least the function should be smooth. |
Warning This Pull Request is coming from a fork and must be manually tagged |
Weirdly and contrary to what I showed yesterday, today I am still getting clear transitions as if there still wasn't any linear interpolation. |
@saschahofmann We recently changed the layout of xclim to use a |
I reinstalled xclim but I am still getting very similar results to before the "fix". You have any advice on where else I could look? |
Could it be that you have obsolete |
I managed to install the environment, for some reason I only had the branch "main" when I cloned the fork yesterday
import inspect
print(inspect.getsource(sdba.base.Grouper.get_index))
I'll try to have look later. Maybe the |
I am pretty sure that the |
It's simply from xclim import sdba
QM = sdba.EmpiricalQuantileMapping.train(
ref, hist, nquantiles=15, group="time.season", kind="+"
)
scen = QM.adjust(sim, extrapolation="constant", interp="nearest")
scen_interp = QM.adjust(sim, extrapolation="constant", interp="linear")
outd = {
"Reference":ref,
"Model - biased":hist,
"Model - adjusted - no interp":scen,
"Model - adjusted - linear interp":scen_interp,
}
for k,da in outd.items():
da.groupby("time.dayofyear").mean().plot(label=k)
plt.legend() This doesn't reproduce your figure however. It seems your figure above was matching the reference very well, better than what I have even with the linear interpolation. But it does get rid of obvious discontinuities. |
@coxipi I think only mention this in the original issue: my analysis is done with |
Yes, I have seen simlilar things by playing with the choice of how |
I see your point. Maybe the sufficiently high number of quantiles (20 or 50) and the fact that you average over 15 years is enough to make this smooth. If you look directly at the time series, the "nearest" should be less smooth? Anyways, great work, thanks a lot! |
I made the same error above when I commented the extrapolate haha... maybe I influenced your reading |
Hum, in the QDM case, the linear interpolation seems to have some issues? |
Dang now I see it too somehow I was focusing on the nearest. Let's see. |
Ok here the linearly interpolated I believe this is due to the problem I mentioned in December and better summarised by you Gonna think about this tomorrow. |
Do you have any resources on someone else doing this? The originally paper Cannon et al. (2015 ) doesn't seem to look at monthly or seasonal adjustments/ the only thing I could find was them saying to use a sliding window:
|
No, unfortunately, I'm searching right now. @aulemahal , Sascha fixed one problem, in the extrapolation, it was assumed that values of season/month would start at 0, but for season with periodic condition, it can go below zero, this needed a change. But now, we have the problem I describe , e.g.:
Do you agree this is a problem? Do you know if people explored specifically the use of QDM with seasons in the litterature? |
I was also thinking the other option for a fix for the extrapolation without needing to change the extrapolate function is changing the mapping of the seasons to start at 1 (so that cyclic_bounds would add 0 and 5). Not sure which of the two you prefer. I think the current fix might be more robust to future changes because it simply uses the coordinate values. |
I don't think I have ever read such a paper (neither for QDM nor for any other QMs). Maybe @huard remembers if we had sources in mind when implementing it ? My "fear" is that we implemented it because it was possible, because |
I am moving the discussion to #2048 and will leave the issue #2014 open for now. As discussed in the development meeting I will add warnings that link to that discussion. Where should I place these warnings? I guess I could raise a Once this is done, I suggest we merge this in since all changes still apply IMO. |
I discovered another issue when running QDM with a 360_day calendar. To reproduce you can just convert ref and hist with One of the issues is related to @aulemahal PR #2038, here I am using a different approach. |
And another fix for using sdba.Scaling in utils.pyL222 |
Merged @aulemahal fix and this is now ready for merging |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good! Thanks!
I think we can avoid a breaking change in test_timeseries
.
@@ -475,9 +477,10 @@ def interp_on_quantiles( | |||
return out | |||
|
|||
if prop not in xq.dims: | |||
xq = xq.expand_dims({prop: group.get_coordinate()}) | |||
prop_coords = group.get_coordinate(newx) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't remember this code enough, but is there a possibility to have prop in xq.dims
but not prop in yq.dims
? That would make the code fail as prop_coords
is declared only in the first if
.
calendar : str or None | ||
Whether to use a calendar. If a calendar is provided, cftime is used. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I approve this change, but however this function is public... Could we avoid the breaking change by simply adding calendar
as a new argument, keeping cftime
?
To avoid a breaking change and avoid having to pass two arguments in the new state, I suggest we simply ignore cftime
when calendar
is given ?
I would also mention this change in the changelog.
Pull Request Checklist:
number
) and pull request (:pull:number
) has been addedWhat kind of change does this PR introduce?
This PR adds a line to correctly interpolate seasonal values. It also changes the test_timeseries function that now accepts a
calendar
argument instead ofcftime
. Not providing it or providingNone
is equivalent tocftime=False
andcalendar='standard
to the previouscftime=True
. This allows for testing different calendar implementations e.g. 360_day calendars