Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Ice-Wedge Network Centerline and Polygon Area, Bernard River Watershed, Banks Island Canada (2010-2020) #92

Open
eliasm56 opened this issue May 20, 2024 · 4 comments
Labels
layer Displaying a specific data product in the PDG portal pdg Permafrost Discovery Gateway priority: high Topography data layer category: topography

Comments

@eliasm56
Copy link

I would like to add the following dataset published on Arctic Data Center to the PDG: https://arcticdata.io/catalog/view/doi%3A10.18739%2FA2GH9BB6M

It is fully documented on the ADC and contains:
(1) Geopackage with polyline layers with ice-wedge centerlines that represent the total length added to the tundra capillary system if all the mapped ice wedges were to degrade and form a connected network of trough-channels (i.e., capillary hydrological system).

(2) Polygon shapefile of IWP area and percentage of land cover within 1 kilometer (km) x 1 km rectangular grid cells covering the Bernard River Watershed.

@eliasm56 eliasm56 added pdg Permafrost Discovery Gateway layer Displaying a specific data product in the PDG portal priority: high Topography data layer category: topography labels May 20, 2024
@julietcohen
Copy link

The files are archived (not public) on Datateam: /var/data/10.18739/A2FJ29F5F/

@julietcohen
Copy link

Due to needing making the dataset public and then private again, this dataset was given a new DoI: A2610VT6C. Now the data is located on Datateam at: /var/data/10.18739/A2610VT6C

The paper associated with this dataset was published today! See here.

Exploration of polygon data

The polygon shapefile is small, just 2.48 MB, so it does not need to be tiled and can be uploaded to the PDG portal as a geojson file. I transformed the shapefile to CRS EPSG:4326 and converted it to a geojson file, simple operations with geopandas.

import geopandas as gpd

data = gpd.read_file("/home/jcohen/capillary_iwp/IWP_area_grid_sum/IWP_sum_Banks_proj.shp")
data_WGS84 = data.to_crs(epsg = 4326, inplace = False)
data_WGS84.to_file("/home/jcohen/capillary_iwp/IWP_area_grid_sum.json", driver="GeoJSON")

Justin helpfully uploaded this geojson file to Metacat. I pointed the demo PDG XML to the URL for this object: https://arcticdata.io/metacat/d1/mn/v2/object/urn:uuid:dc273c5c-341b-4fb7-89cf-a22fa1d96135

Visualization

I visualized 2 attributes that are generally normally distributed but both have a slight right skew:

  • IWP_cover: "Percentage of land cover occupied by IWPs within each cell; calculated using the the total IWP area and cell area."
image image
  • Area_sum: "Total ice-wedge polygon IWP area within 1 km x 1 km rectangular grid cells across the Bernard River Watershed"
image image

I can also visualize the other attribute, Cell_area described in the ADC dataset if desired. I imagine we will only want 1 layer from this file on the portal, and that will likely be IWP_cover since that attribute is derived from the other 2. I used just 2 colors for the palette, a light shade of purple for the min value and a dark shade of purple for the max value.

Exploration of line geometries

The other file in this dataset, IWP_net_Bernard.gpkg is 21.8 GB and contains 1,548,915 MULTILINESTRING geometries

Description: "This geopackage contains 139 line layers, covering the extent of the Bernard River watershed, that represent the centerline of ice-wedges calculated by buffering ice-wedge polygons detected from Maxar satellite imagery using a deep learning model, then skeletonizing the buffers."

image

The attributes in this file are not applicable to distribute a palette:

  • arcid is a unique identifier for the line segments
  • grid_code has the same value for every geometry
  • LENGTH and Length_km are described as non-essential data and have plenty of NA values
  • from_node and to_node are unique identifiers for the start and end points of the line segments.

Therefore, maybe the palette specified in the XML could just be 1 color assigned to the value for the attribute grid_code.

I converted this file to geojson as well, but as expected due to its larger size I am unable to upload it as a single file as a new layer.

While I was on vacation, @rushirajnenuji helpfully picked up the task of tiling the larger file with the 3d tiles workflow (which includes staging and then 3dtiling) with modifications to both packages. This work is in progress, but for today we will publish the ADC package with just the original data files as well as the smaller file's geojson version as an alternative file for the data. Then when the tiling for the multiline geometries is complete, the dataset will be updated with the new tilesets.

Thank you to @justinkadi and the data curation team for their works towards this dataset 🙌🏼

@julietcohen
Copy link

Rushi has uploaded the smaller file that contains polygons geometries that represents IWP coverage as a layer to the PDG 🎉

image

He is working on adding functionality for non-polygon geometries to tile the larger files that represents the centerlines of the "capillaries" of the Arctic.

@julietcohen
Copy link

The published ADC package is here: https://arcticdata.io/catalog/view/doi%3A10.18739%2FA2610VT6C

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
layer Displaying a specific data product in the PDG portal pdg Permafrost Discovery Gateway priority: high Topography data layer category: topography
Projects
Status: In Progress
Development

No branches or pull requests

2 participants