Try using polars
for fasting data loading or even use Hugging Face Datasets
#54
Labels
training
Model training
Right now the
data_loader.py
script works with a pandas DataFrame.This is fine with ~25,000 images but perhaps not ideal for larger datasets.
Alternatives are:
polars
, a DataFrame-like library built in Rust (very fast) - https://pola-rs.github.io/polars/py-polars/html/index.htmlCan also test the speed of data loading using:
The text was updated successfully, but these errors were encountered: