Add Great African Food Company Crop Type Tanzania Dataset #511
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR adds the Crop Type Tanzania dataset from Radiant MLHub.
It features time series data with polygon crop type annotations as segmentation masks. I have implemented it as a
GeoDataset
to allow for train/val/test splits based on geographical location and am looking to do that with other datasets of this type (also converting CV4A_Kenya_Crop_Type dataset to a GeoDataset). Following theTODO
incv4a_kenya_crop_type.py
this implementation is populating the rtree index by usingstac.json
files. The__getitem__
method returns a tensor with dimensionstime x num_bands x height x width
. This dataset has both rasters as input, as well as vector annotations, so sort of a hybrid betweenRasterDataset
andVectorDataset
.Dataset Features:
Dataset Format:
Issues:
RandomGeoSampler
will also sample time instances and a given bounding box query will not return all timesteps for a given geographical XY location, as it is maybe expected.Example
: