Skip to content

Searches nucleotide fasta sequences for an exact match to a PCR amplicon and outputs full-length un-matching sequences to fasta.

Notifications You must be signed in to change notification settings

DiDigsDNA/find_amplicon_in_sequence

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 

Repository files navigation

This script is the first in a set of custom scripts to facilitate in-silico screening of sequences for exact PCR primers and probe hybridization sites to screen for strains of target organism that may not be detected by lab-developed PCR assays. This script searches a nucleotide fasta file for an exact match to the PCR amplicon, using information contained in an assay-specific oligonucleotide (csv) definition file. The amplicon is created from primers and probe in the correct order, with intervening regions of 0-100 N's. Each query fasta sequence is read as a Biopython SeqRecord and searched for an exact match to the amplicon (or its reverse complement) - allowing degenerate bases - and sequences lacking an exact match are output to fasta for downstream analysis. A summary containing record IDs of non-matching sequences is printed to a text file.

About

Searches nucleotide fasta sequences for an exact match to a PCR amplicon and outputs full-length un-matching sequences to fasta.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages