Clarification on how metrics are calculated #19

elephaint · 2025-03-03T09:09:49Z

If I run the following:

import pandas as pd
df_timesfm = pd.read_csv("results/timesfm_2_0_500m/all_results.csv")
print(f"TimesFM MASE: {df_timesfm['eval_metrics/MASE[0.5]'].mean():.2f} \n"
      f"TimesFM CRPS: {df_timesfm['eval_metrics/mean_weighted_sum_quantile_loss'].mean():.2f}")

the output is:

TimesFM MASE: 1.71 
TimesFM CRPS: 0.25

whereas the leaderboard here states:

Can you explain / detail how the leaderboard is calculated or point me to where it is explained?

The text was updated successfully, but these errors were encountered:

cuthalionn · 2025-03-03T15:11:10Z

Hi @elephaint,

The results for each model are standardized by seasonal naive results and then we take the geometric mean across datasets for each model. You can find the details in the source code for the leaderboard ](https://huggingface.co/spaces/Salesforce/GIFT-Eval/tree/main/src)

elephaint · 2025-03-03T18:57:42Z

Ah, thanks, completely overlooked that, scrolling down further it becomes obvious when seeing SeasonalNaive at 1.0. Thanks!

Tiny thing: I noticed that in your Naive notebook, you actually use the SeasonalNaive. So I assume that what in this repo notebook called naive is actually SeasonalNaive on the leaderboard? (because the notebook naive will produce a table with name naive , but the results are SeasonalNaive as far as I can tell.

cuthalionn · 2025-03-04T01:34:52Z

No worries, I am glad it clarifies the confusion!

In the notebook we actually use Naive because the predictor is set to NaivePredictor. We use seasonal naive as the fallback model. But the same notebook can be easily adapted for seasonal naive too. One would just need to create a SeasonalNaivePredictor(StatsForecastPredictor) predictor class and set the model type accordingly.

So the terms Naive and Seasonal Naive on the leaderboard and in the repository represent their respective models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarification on how metrics are calculated #19

Clarification on how metrics are calculated #19

elephaint commented Mar 3, 2025

cuthalionn commented Mar 3, 2025 •

edited

Loading

elephaint commented Mar 3, 2025

cuthalionn commented Mar 4, 2025

Clarification on how metrics are calculated #19

Clarification on how metrics are calculated #19

Comments

elephaint commented Mar 3, 2025

cuthalionn commented Mar 3, 2025 • edited Loading

elephaint commented Mar 3, 2025

cuthalionn commented Mar 4, 2025

cuthalionn commented Mar 3, 2025 •

edited

Loading