-
Notifications
You must be signed in to change notification settings - Fork 2
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #2 from fmalmeida/dev
merge dev to master
- Loading branch information
Showing
17 changed files
with
384 additions
and
92 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,2 +1,3 @@ | ||
tests | ||
test | ||
docs/_site |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Empty file.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Empty file.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,6 +1,6 @@ | ||
gc_skew() | ||
{ | ||
# exec GCcalc.py | ||
$CONDA_PREFIX/bin/python3 $CONDA_PREFIX/bin/GCcalc.py -f ${RESULTS}/all_vs_all_blast/concatenated_genomes.fasta -w $GCWINDOW -s $GCSTEP | \ | ||
$CONDA_PREFIX/bin/python3 $CONDA_PREFIX/bin/GCcalc.py -f ${RESULTS}/concatenated_genomes.fasta -w $GCWINDOW -s $GCSTEP | \ | ||
cut -f 1,2,3,5 | awk '{ if ($4 > 0) print $0 "\t" "color=dblue"; else print $0 "\t" "color=red"}' > ${RESULTS}/conf/GC_skew.txt | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -25,5 +25,5 @@ gff2labels() | |
} | ||
} ; | ||
} | ||
}' $GFF | ||
}' <( sed 's/ /__/g' $GFF ) | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -5,80 +5,86 @@ cat << EOF | |
Simple script to create a circos plot between FASTA files. | ||
Copyright, Felipe Almeida <[email protected]>, 2021 | ||
Syntax: | ||
Syntax: | ||
# To draw circos | ||
plot_circos.sh [-h] [--fofn <file> --outdir <outdir> --minlen <int> --minid <int> | ||
--linklen <int> --show_intrachr --gc_window <int> --gc_step <int> | ||
--labels <file> --tiles <files> ] | ||
# To draw circos | ||
plot_circos.sh [-h] [ options ] | ||
# To use helpful scripts | ||
plot_circos [ --gff2labels <FEATURES> <PATTERN> <ATTRIBUTE> <COLOR> <GFF> ] | ||
plot_circos [ --gff2tiles <FEATURES> <PATTERN> <COLOR> <GFF> ] | ||
# To use helpful scripts | ||
plot_circos [ --gff2labels <FEATURES> <PATTERN> <ATTRIBUTE> <COLOR> <GFF> ] | ||
plot_circos [ --gff2tiles <FEATURES> <PATTERN> <COLOR> <GFF> ] | ||
Options: | ||
Options: | ||
# Help | ||
-h/--help Print this help | ||
# Help | ||
-h/--help Print this help | ||
# Threads for blastn | ||
--threads Number of threads to use [Default: 1] | ||
# Threads for blastn | ||
--threads Number of threads to use [Default: 1] | ||
# Output | ||
--outdir Path to output directory [Default: ./results] | ||
# Output | ||
--outdir Path to output directory [Default: ./results] | ||
# Input file of file names | ||
# CSV: fasta path,prefix,color | ||
--fofn File of file names containing list of fastas to | ||
draw circos plot. | ||
# Input file of file names | ||
# CSV: fasta path,prefix,color | ||
--fofn File of file names containing list of fastas to draw circos plot. | ||
# Input min. length | ||
--minlen Min size of contigs to consider for plot [Default: 10000] | ||
# Input min. length | ||
--minlen Min size of contigs to consider for plot [Default: 10000] | ||
# Links (blastn) min. percentage id | ||
--minid Min. percentage id to filter the results of blastn to draw links [Default: 85] | ||
--linklen Min. link (blastn hit) length to display in plot [Default: 5000] | ||
--show_intrachr Tells the program to create a conf file showing intra chr links [Default: false] | ||
Mandatory if using only one FASTA, otherwise, links will not be shown. | ||
# Links (blastn) min. percentage id | ||
--skip_links Do not compute blast and do not draw links. | ||
Useful for when only desiring the configs. [Default: false] | ||
# GC skew config | ||
--gc_window GC skew window size [Default: 5000] | ||
--gc_step GC skew step size [Default: 5000] | ||
--minid Min. percentage id to filter the results of blastn to draw links [Default: 85] | ||
# Labels config | ||
--labels TSV file containing the label definitions for plotting. The file must contain | ||
4 or 5 columns as shown at http://circos.ca/documentation/tutorials/2d_tracks/text_1/lesson, | ||
"DATA FORMAT" section. The first column must be the name (ID) of the contig. | ||
Checkout the "--gff2labels" script (below). | ||
--linklen Min. link (blastn hit) length to display in plot [Default: 5000] | ||
--show_intrachr Tells the program to create a conf file showing intra chr links [Default: false] | ||
Mandatory if using only one FASTA, otherwise, links will not be shown. | ||
# Tiles config | ||
--tiles TSV file containing the tile definitions for plotting. The file must contain | ||
3 or 4 columns as shown at http://circos.ca/documentation/tutorials/configuration/data_files. | ||
The first column must be the name (ID) of the contig. | ||
Checkout the "--gff2tiles" script (below). | ||
# GC skew config | ||
--gc_window GC skew window size [Default: 5000] | ||
--gc_step GC skew step size [Default: 5000] | ||
# Labels config | ||
--labels TSV file containing the label definitions for plotting. The file must contain | ||
4 or 5 columns as shown at http://circos.ca/documentation/tutorials/2d_tracks/text_1/lesson, | ||
"DATA FORMAT" section. The first column must be the name (ID) of the contig. | ||
Checkout the "--gff2labels" script (below). | ||
# Helpful scripts! | ||
# See the examples in our quickstart: https://easy-circos.readthedocs.io/en/latest/quickstart.html | ||
--gff2labels A useful script that allows you to filter a GFF file and create a "circos label file" | ||
with desired inputs. Eg. "plot_circos --gff2labels CDS arcA ID red ecoli_k12.gff". This | ||
command will get each line where the feature (3rd column) is a CDS and that has the | ||
"acrA" (in the complete line) pattern to write the "circos labels file" using the ID | ||
attributes column as label (string as found in the gff), giving these features | ||
a "red" color option. | ||
# Tiles config | ||
--tiles TSV file containing the tile definitions for plotting. The file must contain | ||
3 or 4 columns as shown at http://circos.ca/documentation/tutorials/configuration/data_files. | ||
The first column must be the name (ID) of the contig. | ||
Checkout the "--gff2tiles" script (below). | ||
For <FEATURES> and <PATTERN> users can use "" to match anything, and "|" to match | ||
more than one string. E.g. plot_circos --gff2labels "" "acrA|mdt" ID red ecoli_k12.gff | ||
--gff2tiles A useful script that allows you to filter a GFF file and create a "circos tiles file" | ||
with desired inputs. Eg. "plot_circos --gff2tiles CDS arcA red ecoli_k12.gff". This | ||
command will get each line where the feature (3rd column) is a CDS and that has the | ||
"acrA" (in the complete line) pattern to write the "circos tiles file" giving these | ||
features a "red" color option. | ||
# Helpful scripts! | ||
# See the examples in our quickstart: https://easy-circos.readthedocs.io/en/latest/quickstart.html | ||
--gff2labels A useful script that allows you to filter a GFF file and create a "circos label file" | ||
with desired inputs. Eg. "plot_circos --gff2labels CDS arcA ID red ecoli_k12.gff". This | ||
command will get each line where the feature (3rd column) is a CDS and that has the | ||
"acrA" (in the complete line) pattern to write the "circos labels file" using the ID | ||
attributes column as label (string as found in the gff), giving these features | ||
a "red" color option. | ||
For <FEATURES> and <PATTERN> users can use "" to match anything, and "|" to match | ||
more than one string. E.g. plot_circos --gff2tiles "" "acrA|mdt" red ecoli_k12.gff. | ||
For <FEATURES> and <PATTERN> users can use "" to match anything, and "|" to match | ||
more than one string. E.g. plot_circos --gff2labels "" "acrA|mdt" ID red ecoli_k12.gff | ||
--gff2tiles A useful script that allows you to filter a GFF file and create a "circos tiles file" | ||
with desired inputs. Eg. "plot_circos --gff2tiles CDS arcA red ecoli_k12.gff". This | ||
command will get each line where the feature (3rd column) is a CDS and that has the | ||
"acrA" (in the complete line) pattern to write the "circos tiles file" giving these | ||
features a "red" color option. | ||
For <FEATURES> and <PATTERN> users can use "" to match anything, and "|" to match | ||
more than one string. E.g. plot_circos --gff2tiles "" "acrA|mdt" red ecoli_k12.gff. | ||
--bacannot It also writes the customized configs for bacannot annotations. | ||
Only useful inside bacannot pipelines. | ||
EOF | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.