Skip to content

viralComplete: BLAST-based viral completeness verification

Notifications You must be signed in to change notification settings

ablab/viralComplete

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 

Repository files navigation

viralComplete: BLAST-based viral completeness verification

viralComplete is intended for completeness verification of novel viral contigs. It heavily relies on following assumptions:

  1. Virus genome size is consistent across the viral family.
  2. If a newly constructed viral contig is complete and belongs to a known family of viruses then its gene content should be similar to the gene content of a known virus.

We thus compute the “similarity” of a given contig (based on the Naive Bayesian Classifier) to each known virus from the RefSeq database, and check whether the most similar known virus have length similar to the contig length.

Requirements

viralComplete is a Python script, thus, installation is not required. However, it has the following dependencies:

To work properly, viralComplete require Prodigal in your PATH environment variable.

Usage

bin/viralcomplete 
        -f Input fasta file
        -o output_directory 

        Optional arguments:
        -h, --help  Show the help message and exit
        -t          Number of threads
        -thr        Completeness threshold (0.0-1.0), default = 0.9

Output file: comma-separated table <input_file>_result_table.csv

Output format: contig name, prediction result, log probability, most probable virus name

Fasta files with prediction results can be found in the Prediction_results_fasta folder

viralComplete considers a virus complete if its length exceeds a specified threshold (% of the length of the closest known reference, 90% by default). Working with novel viruses, you may decrease the detection threshold (provided as an optional argument), to reduce number of potenital false negatives.

About

viralComplete: BLAST-based viral completeness verification

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages