v1.8 Sudden Avalanche

pzelasko released this 30 Sep 13:18

Breaking changes

Python 3.6 is no longer supported as of Lhotse v1.8. If you need to use Python 3.6, please revert to Lhotse 1.7 and earlier.

Highlights

New experimental module of lhotse: workflows, now integrates optional third party packages that assist corpus creators in automated data curation. With release 1.8, we support OpenAI Whisper for automatic transcription and segmentation, and torchaudio Wav2Vec2/Hubert ASR bundles for forced alignment.

What's Changed

Fix read and write in piped CLI by @desh2608 in #807
Default behavior of CutSet.mix by @ZuoyunZheng in #809
Adding more info about resampling options by @RuABraun in #815
Add pad_silence option to extend_by by @desh2608 in #816
Message when calling len() on LazyFilter by @desh2608 in #817
Refactor cut and retain git blame history by @desh2608 in #820
Audio backend refactoring and a workaround for FLAC reading from/writing to in-memory buffers by @pzelasko in #814
Experimental Lhotse feature: corpus creation tools (workflows), starting with OpenAI Whisper support by @pzelasko in #824
Drop support for Python 3.6 by @pzelasko in #829
[workflow] Word-level forced alignment with pretrained models from Torchaudio by @pzelasko in #827

New Contributors

@ZuoyunZheng made their first contribution in #809

Full Changelog: v1.7...v1.8

Contributors

desh2608, RuABraun, and 2 other contributors

Assets 2