Skip to content

Releases: davidlougheed/strkit

Version 0.18.0 alpha 5

02 Oct 17:08
Compare
Choose a tag to compare

Version 0.18.0 alpha 4

02 Oct 01:35
Compare
Choose a tag to compare

Version 0.18.0 alpha 3

30 Sep 19:53
Compare
Choose a tag to compare

Version 0.18.0 alpha 2

25 Sep 19:50
Compare
Choose a tag to compare
Pre-release

Full Changelog: v0.18.0-alpha.1...v0.18.0-alpha.2

Version 0.18.0 alpha 1

25 Sep 18:08
Compare
Choose a tag to compare
Pre-release

What's Changed

Features and changes

  • Caller:
    • VCFs include variant type (snv or str)
    • VCF alts now properly set the anchor (5') base if different from the reference, or are * if completely deleted
    • VCF no longer includes an unused header line
  • MI:
    • A STRkit VCF calculator is now included
    • A length-based calculator for TRGT is now included
    • Excluded regions/calls are now handled via overlap instead of exact match

Bug fixes

Performance

  • Caller:
    • More code has been moved to Rust
    • libdeflate is now used for reading BGZF files, which should be significantly quicker
    • Decoding CIGARs and finding alignment pairs are marginally quicker via better Numpy use
    • A few other minor performance changes
  • MI:

Full Changelog: v0.17.1...v0.18.0-alpha.1

Version 0.17.1

08 Jul 19:19
Compare
Choose a tag to compare

Version 0.17.0

Version 0.16.0

Version 0.15.0

15 Jun 18:16
Compare
Choose a tag to compare

What's Changed

Breaking changes

  • The Rust component of STRkit is now required. Pre-built wheels of strkit_rust_ext for some platforms are provided, but otherwise you'll need the Rust toolchain to install STRkit 0.15+.
  • Fractional TR genotype calling has been removed.
  • Support for specifying more than one alignment file at a time has been removed.

Features and changes

  • Caller:
    • Optional incorporation of haplotype-tagged reads from phased alignments
    • Phased blocks of SNVs for tandem repeat phasing
    • Progressive output for JSON, VCFs, and TSVs instead of storing all results in memory
    • Tweaked SNV incorporation logic
    • Added a minimum quality threshold for SNV incorporation
    • Better consensus sequence logic
    • More complete VCF output
    • Call reads with > max reads aligned reads (truncate to max reads reads)
    • Log current processing rate: # loci / second
  • Visualization
  • Log version on startup

Bug fixes

Performance

  • Caller:
    • Reduce reference sequence re-fetching / reference FASTA accesses
    • CIGAR decoding optimization
    • Repeat-counting optimizations
    • Maximum # of iterations for repeat-counting procedures
    • Approximate repeat counting for very large regions
    • Skip regions which take too long to repeat-count
    • Some memory usage improvements
  • Misc. micro optimizations (lambdas, partial functions, etc.)

Documentation

Full Changelog: v0.14.0...v0.15.0

Version 0.15.0 alpha 23

13 Jun 01:38
Compare
Choose a tag to compare