Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CAFE Report File Analysis error #85

Open
CLAIRE-cuhk opened this issue Sep 21, 2020 · 2 comments
Open

CAFE Report File Analysis error #85

CLAIRE-cuhk opened this issue Sep 21, 2020 · 2 comments

Comments

@CLAIRE-cuhk
Copy link

CLAIRE-cuhk commented Sep 21, 2020

Hello,

I encountered an error when I try to parse the cafe report. I found a similar issue posted before and saying this error below is caused by the additional internal node label. I tried to delete the internal node labels in the input tree to fix the problem but didn't work. Can anyone tell me which exact part should I delete?

Here is my original tree:
(Dro:155.769,(Mus:140.192,(((Aedi:41.5384,Cul:41.5384)1:41.5384,(Ang:41.5384,Ans:41.5384)1:41.5384)1:41.5384,(Tri:109.038,((((Ped:31.1538,Fra:31.1538)1:31.1538,(Bla:46.7307,((Zoo:15.5769,Cry:15.5769)1:15.5769,Per:31.1538)1:15.5769)1:15.5769)1:15.5769,((Cim:25.9615,Cin25.9615)83.6:25.9615,Lao:51.923)1:25.9615)1:15.5769,(Apia:74.7691,(Lin:56.0768,((Nyl:18.6923,Cam:18.6923)1:18.6923,(Mon:18.6923,Cyp:18.6923)1:18.6923)1:18.6923)1:18.6923)1:18.6923)1:15.5769)1:15.5769)1:15.5769)1:15.5769)

	CAFE Report File Analysis
	09.21.2020 | 10:24:20

Parsing format information...


Initializing output structures...


Counting changes per branch...
[*-------------------------------------------------] 0.061% complete.Traceback (most recent call last):
File "./cafetutorial_report_analysis.py", line 371, in
results_main, node_fams_main = cra(inlines_main, results_main, node_fams_main, linestart_main, ancfilename, sorted_nodes, 1);
File "./cafetutorial_report_analysis.py", line 193, in cra
anccount = int(tlinfo[curanc][4]);
ValueError: invalid literal for int() with base 10: '4_11'

@gwct
Copy link
Contributor

gwct commented Sep 21, 2020 via email

@CLAIRE-cuhk
Copy link
Author

Hi, I think this is happening because the report_analysis script unfortunately can't handle node labels on the tree. All of your internal nodes seem to have the label '1' (except for one that is '83.6'). These labels get carried over to each gene family tree in the CAFE output. To fix, you can either re-run CAFE without the internal labels on the original tree, or remove the labels from all trees in the CAFE report file with some regex. Let us know if you need any more help! -Gregg Thomas

Many thanks for your help!
I was able to finish the run after removing all internal node labels.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants