Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generating reads with even 10X coverage for WGS is taking too long #80

Open
tnnandi opened this issue Dec 24, 2022 · 1 comment
Open

Comments

@tnnandi
Copy link

tnnandi commented Dec 24, 2022

Hi,

I'm trying to generate reads with 10X coverage for a genome with a size of 3 billion bp and a read length of 2x150 bp. This leads to a requirement of 60 million reads and it is turning out to be almost impossible to generate all the reads using a single CPU. Is there a way to accelerate this reads generation process?

Thank you very much.

@davetang
Copy link

davetang commented Jun 6, 2023

Split your genome up into chromosomes (or contigs), run dwgsim on each separately, and then concatenate the FASTQ (and mutation) files. If you didn't already know, you can directly cat the gz files.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants