Skip to content

v1.8 Sudden Avalanche

Compare
Choose a tag to compare
@pzelasko pzelasko released this 30 Sep 13:18

Breaking changes

  • Python 3.6 is no longer supported as of Lhotse v1.8. If you need to use Python 3.6, please revert to Lhotse 1.7 and earlier.

Highlights

  • New experimental module of lhotse: workflows, now integrates optional third party packages that assist corpus creators in automated data curation. With release 1.8, we support OpenAI Whisper for automatic transcription and segmentation, and torchaudio Wav2Vec2/Hubert ASR bundles for forced alignment.

ctxG6RI

What's Changed

  • Fix read and write in piped CLI by @desh2608 in #807
  • Default behavior of CutSet.mix by @ZuoyunZheng in #809
  • Adding more info about resampling options by @RuABraun in #815
  • Add pad_silence option to extend_by by @desh2608 in #816
  • Message when calling len() on LazyFilter by @desh2608 in #817
  • Refactor cut and retain git blame history by @desh2608 in #820
  • Audio backend refactoring and a workaround for FLAC reading from/writing to in-memory buffers by @pzelasko in #814
  • Experimental Lhotse feature: corpus creation tools (workflows), starting with OpenAI Whisper support by @pzelasko in #824
  • Drop support for Python 3.6 by @pzelasko in #829
  • [workflow] Word-level forced alignment with pretrained models from Torchaudio by @pzelasko in #827

New Contributors

Full Changelog: v1.7...v1.8