-
Notifications
You must be signed in to change notification settings - Fork 386
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add LoveDADataModule to the trainer tests #966
Conversation
Looks great! I'll try to figure out the failing tests, enjoy your vacation! |
Minimum tests have a pytorchlightning warning that gets treated as an error, I tried to ignore it but messed something up |
Just noticed that our data loader made their news: https://github.com/Junjue-Wang/LoveDA#news |
Okay, so looking into this more, I think it's strange to have a datamodule where test doesn't report an accuracy. I think we should instead split val (dataset) into val/test (datamodule) and move test (dataset) to predict (datamodule). We also use test during predict in the Inria datamodule, so there is precedence for this. In terms of split sizes, we have:
I think it makes the most sense to split val in half so that we have enough samples to train on and can still evaluate val/test accuracy. @nilsleh @Junjue-Wang do you have any opinions on this? |
Accurately, The test scores can be evaluated on the Codalab: https://codalab.lisn.upsaclay.fr/competitions/421#results |
Will the test set masks ever be made public? If we can only make predictions on test and can't evaluate them locally then the datamodule becomes pretty limited. |
I don't think so, even without test set masks you need the data module for training with torchgeo which is a massive plus. it will also make it easy to generate test predictions. |
Another option would be the following mapping:
This feels a bit redundant, although it's what inria does when test_split_pct == 0. |
Another option would be:
The datamodule wouldn't have a test_dataloader since there's no way to evaluate it anyway. |
96a7ccf
to
644bfdc
Compare
Depends on #975 to get the tests to pass. |
05269be
to
7fc74e0
Compare
* Add loveda to trainer tests * Delete direct loveda datamodule test * Ignoring deprecation warning for lightning * Remove ignore * test -> predict * Fix typo * Add comment explaining mismatch * More coverage Co-authored-by: Adam J. Stewart <[email protected]>
@adamjstewart the problem with LoveDA you likely ran into is that the test set doesn't have any masks, just imagery (which we accurately reflected when creating the test data)