Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pipeline crashed at transabyss #30

Open
kokyriakidis opened this issue Apr 27, 2019 · 2 comments
Open

Pipeline crashed at transabyss #30

kokyriakidis opened this issue Apr 27, 2019 · 2 comments

Comments

@kokyriakidis
Copy link

kokyriakidis commented Apr 27, 2019

Here is the CMD output:

CHECKPOINT: Unitig assembly completed.
CMD: bash -euo pipefail -c 'abyss-pe graph=adj --directory=/home/orp/Oyster_River_Protocol/DATA/assemblies/SRR5330501.transabyss k=32 name=SRR5330501.transabyss.fasta j=20 in="/home/orp/Oyster_River_Protocol/DATA/rcorr/SRR5330501.TRIM_1P.cor.fq /home/orp/Oyster_River_Protocol/DATA/rcorr/SRR5330501.TRIM_2P.cor.fq" l=32 s=32 n=2 SIMPLEGRAPH_OPTIONS="--no-scaffold" OVERLAP_OPTIONS="--no-scaffold" MERGEPATH_OPTIONS="--greedy" SRR5330501.transabyss.fasta-6.fa'
The minimum coverage of single-end contigs is 2.
The minimum coverage of merged contigs is 2.
warning: the seed-length should be at least twice k: k=32, s=32
Building the suffix array...
Building the Burrows-Wheeler transform...
Building the character occurrence table...
Mateless   52894022  100%
Unaligned         0
Singleton         0
FR                0
RF                0
FF                0
Different         0
Total      52894022
abyss-fixmate: error: All reads are mateless. This can happen when first and second read IDs do not match.
error: `SRR5330501.transabyss.fasta-3.hist': No such file or directory

Everything until then went fine! Do you know what the problem is?

The reads ID is this:

@MG00HS05:491:C7450ACXX:4:1101:1240:2223_forward/1
and
@MG00HS05:491:C7450ACXX:4:1101:1240:2223_reverse/2

These files were produced from fastq-dump. It seems that the problem is the naming, should be identical. So I should remove the forward and reverse part. Do you have a simple way to do that?

Can i restart the pipeline from the checkpoint above? Or do I have to run it from the start?

@AdamStuckert
Copy link
Contributor

Hi @kokyriakidis,

It looks like you are right and this error is occurring because it expects the same exact header before the /1 or /2 (see: https://github.com/bcgsc/abyss/wiki/ABySS-Users-FAQ).

You can try this fix. Just replace the read1/2 and output file names for your purposes.

sed "s/_forward//g" $READ1 > new_reads.1.fq
sed "s/_reverse//g" $READ2 > new_reads.2.fq

The ORP is checkpointed so you should be able to just restart it and it will pick up at the last checkpoint it passed. Note that you will want to rename the *TRIM_*P.cor.fq reads or it will resume and give you the same error.

@macmanes
Copy link
Contributor

macmanes commented Apr 29, 2019

for the checkpointing to work, you'll have to trick the software into thinking that the other assemblies have been made with the "new" reads

touch assemblies/*fasta

before rerunning the ORP, but after you change the reads as per above.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants