Skip to content

v1.20 - Pining for the Fjords

Compare
Choose a tag to compare
@pzelasko pzelasko released this 31 Jan 20:51
455b20e

What's Changed

New features

  • Extended the subset of lhotse that works without installing torchaudio by @pzelasko in #1253 #1255
  • Ensure drop_last=False always returns an equal number of mini-batches by re-distributing and/or duplicating some data by @pzelasko in #1277
  • Improved CPU memory usage and shuffling + bucketing in DynamicBucketingSampler by @pzelasko in #1276
  • Enable seed randomization in dynamic samplers by @pzelasko in #1278

Recipes

Other improvements

  • Update docs with env vars used by Lhotse by @pzelasko in #1252
  • support whisper large v3; deepspeed launcher rank world_size setting by @yuekaizhang in #1260
  • Fix non-deterministic tests by @pzelasko in #1261
  • Fix duplication issues in CutSet.mix() by @pzelasko in #1268
  • Support controllable CutSet.mux weights in multiprocess dataloading by @pzelasko in #1266
  • Fix distributed sampler initialization and exceeded sampler warning false positives by @pzelasko in #1270
  • Install kaldi-native-io explicitly in the kaldi doc example. by @csukuangfj in #1275
  • Allow duplicate cut IDs in a CutSet (CutSet is list-like instead of dict-like) by @pzelasko in #1279

New Contributors

Full Changelog: v1.19...v1.20