You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Guessing this is an internet connection issue, but reporting here in case:
quarto preview /home/robin/github/robinlovelace/tartu24/Mobile-Tartu-2024/source/notebooks/Intro-to-data.qmd --no-browser --no-watch-inputs
Starting python3 kernel...Done
Executing 'Intro-to-data.ipynb'
Cell 1/39: ''...Done
Cell 2/39: ''...Done
Cell 3/39: ''...Done
Cell 4/39: ''...Done
Cell 5/39: ''...Done
Cell 6/39: ''...Done
Cell 7/39: ''...Done
Cell 8/39: ''...Done
Cell 9/39: ''...Done
Cell 10/39: ''...Done
Cell 11/39: ''...Done
Cell 12/39: ''...Done
Cell 13/39: ''...Done
Cell 14/39: ''...Done
Cell 15/39: ''...Done
Cell 16/39: ''...Done
Cell 17/39: ''...Done
Cell 18/39: ''...Done
Cell 19/39: ''...ERROR:
An error occurred while executing the following cell:
------------------
# Fetch the bounding box of the area
bbox = admin_area.total_bounds.tolist()
# Fetch the data from overturemaps
table = overturemaps.record_batch_reader("place", bbox).read_all()
# Merge all chunks
table = table.combine_chunks()
# Convert Arrow Table into a DataFrame
df = table.to_pandas()
# Convert geometries from WKB format to normal geometries
df["geometry"] = shapely.from_wkb(df["geometry"])
# Create a GeoDataFrame out of the results
gdf = gpd.GeoDataFrame(df, crs="epsg:4326")
gdf.head()
------------------
---------------------------------------------------------------------------
OSError Traceback (most recent call last)
Cell In[19], line 5
2 bbox = admin_area.total_bounds.tolist()
4 # Fetch the data from overturemaps
----> 5 table = overturemaps.record_batch_reader("place", bbox).read_all()
7 # Merge all chunks
8 table = table.combine_chunks()
File ~/micromamba/envs/geo/lib/python3.11/site-packages/pyarrow/ipc.pxi:757, in pyarrow.lib.RecordBatchReader.read_all()
File ~/micromamba/envs/geo/lib/python3.11/site-packages/pyarrow/error.pxi:88, in pyarrow.lib.check_status()
File ~/micromamba/envs/geo/lib/python3.11/site-packages/overturemaps/core.py:37, in <genexpr>(.0)
29 batches = dataset.to_batches(filter=filter)
31 # to_batches() can yield many batches with no rows. I've seen
32 # this cause downstream crashes or other negative effects. For
33 # example, the ParquetWriter will emit an empty row group for
34 # each one bloating the size of a parquet file. Just omit
35 # them so the RecordBatchReader only has non-empty ones. Use
36 # the generator syntax so the batches are streamed out
---> 37 non_empty_batches = (b for b in batches if b.num_rows > 0)
39 geoarrow_schema = geoarrow_schema_adapter(dataset.schema)
40 reader = pa.RecordBatchReader.from_batches(geoarrow_schema, non_empty_batches)
File ~/micromamba/envs/geo/lib/python3.11/site-packages/pyarrow/_dataset.pyx:3687, in _iterator()
File ~/micromamba/envs/geo/lib/python3.11/site-packages/pyarrow/_dataset.pyx:3305, in pyarrow._dataset.TaggedRecordBatchIterator.__next__()
File ~/micromamba/envs/geo/lib/python3.11/site-packages/pyarrow/error.pxi:154, in pyarrow.lib.pyarrow_internal_check_status()
File ~/micromamba/envs/geo/lib/python3.11/site-packages/pyarrow/error.pxi:91, in pyarrow.lib.check_status()
OSError: AWS Error NETWORK_CONNECTION during GetObject operation: curlCode: 28, Timeout was reached
The text was updated successfully, but these errors were encountered:
Guessing this is an internet connection issue, but reporting here in case:
The text was updated successfully, but these errors were encountered: