Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clinical dataset validation #8

Open
1 task done
bhillmann opened this issue Nov 18, 2016 · 8 comments
Open
1 task done

Clinical dataset validation #8

bhillmann opened this issue Nov 18, 2016 · 8 comments
Assignees

Comments

@bhillmann
Copy link
Collaborator

bhillmann commented Nov 18, 2016

  • Run deep shotgun data with disease association (need to download/curate a couple of datasets) to show same findings as deep shotgun

@RRShieldsCutler Has some of these datasets

@bhillmann
Copy link
Collaborator Author

bhillmann commented Nov 28, 2016

@RRShieldsCutler I added the folder below with the README for this task:

clinical_datasets/

How close is the final dataset to running?

I would like to use strandex to subsample these FASTQs after quality control and run them all with SHOGUN RefSeq and IMG.

@RRShieldsCutler
Copy link

RRShieldsCutler commented Dec 5, 2016

Sorry just saw this (git only emailed me the text "clinical_datasets" and none of your questions).

I ran shizen without flash on the dataset, trim_l to 50. I downsampled the R1's to 500k reads. Also converted both the deep and shallow to fasta. All those files are in this directory:
/project/flatiron2/data/public_shotgun/karlsson2013/shizen_20161130

Yesterday I ran shogun on both the deep and shallow set using utree_abfvh. Results are located:
Original depth: /project/flatiron2/robin/results/shogun_analysis_karlsson/deep_shotgun
Downsample: /project/flatiron2/robin/results/shogun_analysis_karlsson/shallow/

From this dataset
http://www.nature.com/nature/journal/v498/n7452/full/nature12198.html

@bhillmann
Copy link
Collaborator Author

Can you rerun the commands with the newest UTree versions?

@RRShieldsCutler
Copy link

Yup. Running currently, they should be done by the evening sometime. They'll be in the same directories as pasted above.

@RRShieldsCutler
Copy link

Both are finished, FYI. I confirmed that the confidence intervals are updated in the tsv files.
Original depth: /project/flatiron2/robin/results/shogun_analysis_karlsson/deep_shotgun
Downsample: /project/flatiron2/robin/results/shogun_analysis_karlsson/shallow/

@bhillmann
Copy link
Collaborator Author

Awesome, great work.

@bhillmann
Copy link
Collaborator Author

/project/flatiron2/data/public_shotgun/karlsson2013/map.txt

@RRShieldsCutler
Copy link

Currently re-running with the new complete-species utree. The deep (full reads) should finish tonight sometime here:

/project/flatiron2/robin/results/shogun_analysis_karlsson/161208_analysis/deep/shogun_utree_lca_out

The shallow (downsampled) reanalysis is complete and located here:

/project/flatiron2/robin/results/shogun_analysis_karlsson/161208_analysis/shallow/shogun_utree_lca_out

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants