feat(model): Add optional memoization to datasets during model training. #209

ErikBavenstrand · 2024-04-12T08:26:59Z

By specifying the optional parameter memoized_dataset_cache_size > 0, the corresponding number of datasets will be kept in memory to avoid repeated conversion from vaex to pandas in settings where we perform repeated fitting using the same datasets e.g. hyperparamter tuning. Use with caution and always call clear_load_dataset_cache once completed to clear the cache.

By specifying the optional parameter `memoized_dataset_cache_size > 0`, the corresponding number of datasets will be kept in memory to avoid repeated conversion from `vaex` to `pandas` in settings where we perform repeated fitting using the same datasets e.g. hyperparamter tuning. Use with caution and always call `clear_load_dataset_cache` once completed to clear the cache.

Erik Båvenstrand added 9 commits April 12, 2024 10:23

revert: Attempt fix GH Actions

378ef56

revert: Attempt fix GH Actions

6da366f

revert: Attempt fix GH Actions

c049864

revert: test

2aa24f7

revert: testing

ba70053

revert: test

476655e

revert: Fix GitHub Actions

4662ac7

revert: Fix build error

0c8ceab

ErikBavenstrand merged commit 2ca4465 into main Apr 12, 2024
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(model): Add optional memoization to datasets during model training. #209

feat(model): Add optional memoization to datasets during model training. #209

ErikBavenstrand commented Apr 12, 2024 •

edited

Loading

feat(model): Add optional memoization to datasets during model training. #209

feat(model): Add optional memoization to datasets during model training. #209

Conversation

ErikBavenstrand commented Apr 12, 2024 • edited Loading

ErikBavenstrand commented Apr 12, 2024 •

edited

Loading