Skip to content

Latest commit

 

History

History
36 lines (24 loc) · 1.96 KB

README.md

File metadata and controls

36 lines (24 loc) · 1.96 KB

MAGstats

DOI

MAGstats: A Jupyter notebook to visualize the completeness and redundancy of MAGs and draft genomes

Run it in Binder: Binder

Beginners Guide

To visualize the completeness and reduduancy of your MAG, you need to get two files ready (example_data):

  1. A newick tree file (MAG_tree.nwk)
  2. A metadata file (MAG_metadata.tsv) The metadata file must contain these columns in order: MAG_ID Length Completion Redundancy GC_Content, and is tab-delimited.

You can launch this jupyter notebook using binder by clicking Binder, and upload your files to the example_data folder via the upload button in your project directory (where theindex.ipynb locates). You probably need to modify these two lines to adapt to your file names in the first code block before running through all the codes:

nwk_file <- "MAG_tree.nwk"
bin_metadata_file <- "MAG_metadata.tsv"

How to generate input files

  1. For the tree file, you can use GToTree to extract single-copy marker genes and to get the concatenated multiple sequence alignements. Then use RAxML-NG to build the maximum-likelihood phylogenomic tree. GToTree also generate a tree file by running fasttree.

  2. For the metadata file, you can get it via checkM or anvio, and format it using Excel and export it as tsv.

If everything works well, you will get something like this: MAGstats