You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It takes a long time to read a large dataset from a source for the first time.
I created a fresh Docker environment for my dataframe example and found it very time consuming to pull a large dataset of nycflights13.
If you use red-dataset-arrow, the cache is stored in the arrow file, but the first time you load it, it takes a long time to load and parse because it uses Ruby's CSV.
Is it possible to make the environment extended with red-dataset-arrow use arrow to load and parse?
The text was updated successfully, but these errors were encountered:
It takes a long time to read a large dataset from a source for the first time.
I created a fresh Docker environment for my dataframe example and found it very time consuming to pull a large dataset of
nycflights13
.If you use
red-dataset-arrow
, the cache is stored in the arrow file, but the first time you load it, it takes a long time to load and parse because it uses Ruby's CSV.Is it possible to make the environment extended with red-dataset-arrow use arrow to load and parse?
The text was updated successfully, but these errors were encountered: