Skip to content

v1.9 Neighboring Peaks

Compare
Choose a tag to compare
@pzelasko pzelasko released this 20 Oct 18:32

Major features

  • MultiCut data type: simplifies working with multi-channel data (contribution from @desh2608)
  • CSJ recipe (contribution from @teowenshen)
  • lots of bug fixes

What's Changed

  • create proper wav_id in the segments file for multichannel recording by @jtrmal in #831
  • kaldi: add an switch/option to read the durations from kaldi utt2dur … by @jtrmal in #832
  • Update test packages by @pzelasko in #837
  • MultiCut to store multi-channel recordings with shared supervision by @desh2608 in #822
  • Use CutSet for whisper annotation workflow by @desh2608 in #834
  • use spawn() as the strategy to prevent heisenbug by @jtrmal in #841
  • Compatibility for reading alignments saved before Lhotse v1.8 by @pzelasko in #842
  • make regexp string raw by @jtrmal in #836
  • Use absolute recording paths in yesno recipe by @pzelasko in #845
  • Fix CutSet.compute_and_store_features support for lazy CutSets by @pzelasko in #844
  • Fixing some QA functions for lazy manifests by @desh2608 in #848
  • Fix timestamps in Whisper annotation workflow by @pzelasko in #847
  • Update supervisions channels in multi-channel recipes by @desh2608 in #838
  • Allow retaining or trimming channels in trim_to_supervisions by @desh2608 in #852
  • Match cut_id to utt_id if there is exactly one supervision per cut by @wgb14 in #853
  • forced alignment: use num2words to get word timestamps for numbers by @eschmidbauer in #849
  • Prepare CSJ by @teowenshen in #851
  • Small changes in trim_to_supervisions() by @desh2608 in #855
  • Fix checkpoints of samplers that were iterated over more than once within the same epoch by @pzelasko in #854
  • Update fisher_english.py by @maxlvov in #858

New Contributors

Full Changelog: v1.8...v1.9