Skip to content

v1.22 - Sherpa's Paradise

Compare
Choose a tag to compare
@pzelasko pzelasko released this 07 Mar 19:38
· 54 commits to master since this release
d26d476

What's Changed

New features

  • Extending Lhotse dataloading to text/multimodal data by @pzelasko in #1295

As an experimental feature, we are extending the API of Lhotse samplers to enable key sampling features for non-audio data such as text. That means text (and other) data can be dynamically multiplexed and bucketed in the same way as audio data with some lightweight wrappers. Please refer to new documentation here: https://lhotse.readthedocs.io/en/latest/datasets.html#customizing-sampling-constraints

  • Multi-channel support improvements
    • Fix loading multi-channel custom recording fields in multi cuts by @pzelasko in #1298
    • Channel selection for multi-channel custom recording fields by @pzelasko in #1299

Lhotse MultiCuts:

  • are now exportable into Lhotse Shar format
  • gained a new method cut = cut.with_channels([0, 1, ...]) to modify the channels they refer to
  • can have multi-channel custom Recordings with channels selectable via a special custom key (e.g., if defining cut.target_recording, audio can be read via cut.load_target_recording() and channels will be auto-selected by looking up cut.target_recording_channel_selector).

Recipes

Other improvements

New Contributors

Full Changelog: v1.21...v1.22