-
Notifications
You must be signed in to change notification settings - Fork 5
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #199 from mskcc/release/0.9.0
Release 0.9.0
- Loading branch information
Showing
18 changed files
with
539 additions
and
642 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,56 @@ | ||
FROM ubuntu:16.04 | ||
|
||
LABEL authors="Nikhil Kumar ([email protected]), Evan Biederstedt ([email protected]), C. Allan Bolipata ([email protected])" \ | ||
version.image="1.0.0" \ | ||
version.conpair="0.3.3" \ | ||
version.gatk="3.8-1" \ | ||
source.conpair="https://github.com/mskcc/Conpair/releases/tag/0.3.3" \ | ||
source.r="https://pkgs.alpinelinux.org/package/edge/community/x86/R" | ||
|
||
ENV CONPAIR_VERSION 0.3.3 | ||
|
||
RUN apt-get update && \ | ||
apt-get upgrade -y && \ | ||
apt-get install -y python && \ | ||
apt-get install -y wget \ | ||
curl \ | ||
bc \ | ||
unzip \ | ||
less \ | ||
bedtools \ | ||
samtools \ | ||
openjdk-8-jdk \ | ||
tabix \ | ||
bzip2 \ | ||
software-properties-common && \ | ||
apt-get -y clean && \ | ||
apt-get -y autoclean && \ | ||
apt-get -y autoremove | ||
|
||
# Download GATK | ||
RUN cd /tmp \ | ||
&& wget "https://software.broadinstitute.org/gatk/download/auth?package=GATK-archive&version=3.8-1-0-gf15c1c3ef" -O gatk-3.8-1.tar.bz2 \ | ||
&& tar xvjf gatk-3.8-1.tar.bz2 \ | ||
&& mv /tmp/GenomeAnalysisTK-3.8-1-0-gf15c1c3ef/GenomeAnalysisTK.jar /usr/bin/GenomeAnalysisTK.jar \ | ||
&& rm -rf /tmp/* | ||
|
||
## install R | ||
RUN apt-get install -y r-base-core r-base-dev \ | ||
## install R dependencies | ||
&& R -e "install.packages('ggplot2', dependencies=TRUE, repos='http://cran.rstudio.com/')" \ | ||
&& R -e "install.packages('reshape2', dependencies=TRUE, repos='http://cran.rstudio.com/')" | ||
|
||
## Python dependencies and Java | ||
## install numpy and scip | ||
RUN pip install numpy==1.15.4 \ | ||
&& pip install scipy==1.1.0 | ||
|
||
## download Conpair | ||
RUN cd /tmp && wget https://github.com/mskcc/Conpair/archive/${CONPAIR_VERSION}.tar.gz \ | ||
&& tar xvzf ${CONPAIR_VERSION}.tar.gz \ | ||
## install conpair | ||
&& mv /tmp/Conpair-${CONPAIR_VERSION} /usr/bin/conpair \ | ||
## clean up | ||
&& rm -rf /tmp/* | ||
|
||
ENV PYTHONNOUSERSITE set |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,28 @@ | ||
FROM continuumio/anaconda3:5.3.0 | ||
|
||
LABEL authors="Evan Biederstedt ([email protected])" \ | ||
version.image="1.0.0" \ | ||
version.lohhla="1.0.0" | ||
|
||
RUN conda create -n env python=3.6 | ||
RUN echo "source activate env" > ~/.bashrc | ||
|
||
ENV PATH /opt/conda/envs/env/bin:$PATH | ||
|
||
# Updating Anaconda packages | ||
RUN conda update conda | ||
RUN conda update anaconda | ||
RUN conda update --all | ||
|
||
## Set up channels | ||
## https://bioconda.github.io/index.html | ||
RUN conda config --add channels defaults | ||
RUN conda config --add channels bioconda | ||
RUN conda config --add channels conda-forge | ||
|
||
## install lohhla | ||
## https://bioconda.github.io/recipes/lohhla/README.html | ||
RUN conda install lohhla | ||
RUN conda update lohhla | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,32 @@ | ||
# Variant annotation | ||
|
||
## GRCh37 | ||
|
||
### SNVs and indels | ||
Basic annotation of merged `vcf` files from the individual variants callers is carried out in two steps. First, the combined `vcf` is annotated with information from [RepeatMasker](http://www.repeatmasker.org/) and the [ENCODE consortium](http://rohsdb.cmb.usc.edu/GBshape/ENCODE/index.html). These files are retrieved from the [UCSC genome browser](https://genome.ucsc.edu) and parsed as such: | ||
|
||
``` shell | ||
wget http://hgdownload.cse.ucsc.edu/goldenPath/hg19/database/rmsk.txt.gz | ||
gunzip rmsk.txt.gz | ||
cut -f6-8,12 rmsk.txt | \ | ||
grep -e "Low_complexity" -e "Simple_repeat" | \ | ||
sed 's/^chr//g'> rmsk_mod.bed | ||
bgzip rmsk_mod.bed | ||
tabix --preset bed rmsk_mod.bed.gz | ||
|
||
wget http://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeMapability/wgEncodeDacMapabilityConsensusExcludable.bed.gz | ||
gunzip wgEncodeDacMapabilityConsensusExcludable.bed.gz | ||
sed -i 's/^chr//g' wgEncodeDacMapabilityConsensusExcludable.bed | ||
bgzip wgEncodeDacMapabilityConsensusExcludable.bed | ||
tabix --preset bed wgEncodeDacMapabilityConsensusExcludable.bed.gz | ||
``` | ||
|
||
Subsequently, [vcf2maf](https://github.com/mskcc/vcf2maf) is used to annotate functional effects of mutations as well as other metadata using [VEP](https://www.ensembl.org/vep). The `--custom-enst` argument to vcf2maf takes a list of preferred gene transcript isoforms which to map mutations onto. We supply a consensus list of [`isoform_overrides_at_mskcc` and `isoform_overrides_uniprot`](https://github.com/mskcc/vcf2maf/tree/master/data), generated as such: | ||
``` r | ||
t1 = readr::read_tsv('isoform_overrides_at_mskcc') | ||
t2 = readr::read_tsv('isoform_overrides_uniprot') | ||
t2 %>% | ||
dplyr::filter(gene_name %nin% t1$gene_name) %>% | ||
dplyr::bind_rows(., t1) %>% | ||
readr::write_tsv('isoforms') | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,5 @@ | ||
# Genomic intervals | ||
|
||
## GRCH37 | ||
|
||
### Genome | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,6 +1,6 @@ | ||
# Contents | ||
|
||
1. [References](https://github.mskcc/vaporware/blob/master/docs/REFERENCES.md) | ||
1. [Intervals](https://github.mskcc/vaporware/blob/master/docs/INTERVALS.md) | ||
2. ... | ||
3. ... | ||
1. [References](https://github.com/mskcc/vaporware/blob/master/docs/REFERENCES.md) | ||
2. [Intervals](https://github.com/mskcc/vaporware/blob/master/docs/INTERVALS.md) | ||
3. [Variant Annotation](https://github.com/mskcc/vaporware/blob/readme_updates/docs/ANNOTATION.md) | ||
4. ... |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.