Empty point extraction job results should not be downloaded #250

kvantricht · 2025-01-17T13:15:19Z

We have quite a few empty parquet files resulting from point extraction jobs. The reason for this is that currently we have to buffer point-based extractions because occasional single-point jobs are not yet supported in openEO (Open-EO/openeo-geopyspark-driver#996)

Very small buffers sometimes do not contain any pixel center of a collection and the resulting extraction is nodata for all observations. These get dropped, finally resulting in an empty dataframe.

Things to do:

Change post-job action to detect empty dataframes and don't download these (they give downstream issues)
Make sure current point-extractions have square 5m buffer for the actual extraction, avoiding this getting dropped
waiting for final fix in openEO, after which none of the above should be essential anymore (load_stac: support Point features Open-EO/openeo-geopyspark-driver#996)

kvantricht · 2025-01-24T15:23:50Z

Should be addressed by #266

kvantricht self-assigned this Jan 17, 2025

kvantricht added this to the System V2 milestone Jan 20, 2025

kvantricht mentioned this issue Jan 24, 2025

Epic: system V2 features #251

Open

15 tasks

kvantricht closed this as completed Jan 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Empty point extraction job results should not be downloaded #250

Empty point extraction job results should not be downloaded #250

kvantricht commented Jan 17, 2025 •

edited

Loading

kvantricht commented Jan 24, 2025

Empty point extraction job results should not be downloaded #250

Empty point extraction job results should not be downloaded #250

Comments

kvantricht commented Jan 17, 2025 • edited Loading

kvantricht commented Jan 24, 2025

kvantricht commented Jan 17, 2025 •

edited

Loading