Snakemake workflow: EMU-smk

A Snakemake workflow for running EMU.

Usage

First, prepare a metadata csv file. You can use this as an example. At least, it should have one column that will be used as the identifier of every sample. It must match with the name of the file without the extension.

Then, edit the configuration file so it fits your case. At least, you should indicate (a) the file extension (by default .fastq), (b) the directory where those files are located, (c) the filepath of the metadata CSV and (d) the filepath of the directory with the EMU database. This repository contains toy examples to ensure the pipeline works, but you want to replace them.

snakemake --use-conda -n
snakemake --use-conda -c100

Output

The pipeline produces two TSV files containing the abundances and taxonomy and one RDS file with a phyloseq object with the counts, taxa and metadata (from any additional column you add in the metadata CSV) that you can read into Rstudio to start analyzing your data. These files will be produces in the "results" directory.

Optionally, you can keep additional files per sample (such as the alignments and a fasta of every unclassified sequences using the command "--notemp" that, unintuitively, means keep all temporary files). Those files will be located in the "steps" directory.

snakemake --use-conda -n --notemp
snakemake --use-conda -c100 --notemp

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
config		config
database		database
reads		reads
workflow		workflow
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
counts.tsv		counts.tsv
metadata.csv		metadata.csv
taxonomy.tsv		taxonomy.tsv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Snakemake workflow: EMU-smk

Usage

Output

About

Languages

License

AU-ENVS-Bioinformatics/emu-smk

Folders and files

Latest commit

History

Repository files navigation

Snakemake workflow: EMU-smk

Usage

Output

About

Resources

License

Stars

Watchers

Forks

Languages