Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cafetutorial_clade_and_size_filter.py error #64

Open
NINGCHINA opened this issue Jul 17, 2019 · 4 comments
Open

cafetutorial_clade_and_size_filter.py error #64

NINGCHINA opened this issue Jul 17, 2019 · 4 comments

Comments

@NINGCHINA
Copy link

Hi ! When I check cafetutorial_clade_and_size_filter.py at https://iu.app.box.com/v/cafetutorial-files/folder/22161186238?page=1 , I found the script will write out wrong gene families with only one gene copy among all species, which is not suitable for gene families analysing. The mistake stays between line of 104 and 105. maybe it just I could not fully understand it.
codes looks like:
elif line_n not in lines_to_separate_set and len(lines_to_keep_set) == 0:
output_file.write(line)

@gwct
Copy link
Contributor

gwct commented Jul 17, 2019 via email

@NINGCHINA
Copy link
Author

thanks gwct ! as gene families can be divided as 3 groups. 1: large_families with more than 100 gene copies are found in one or more species.2:filtered_families with more than 1 gene copies are found in more than 2 species in any clade or in all species. 3: gene families with less than 1 gene copy in all species. however, I think it is kind of weird at line of 104 and 105. then when I checked the results in the tutorial "large_filtered_cafe_input.txt" ,"filtered_cafe_input.txt " and "unfiltered_cafe_input.txt". the code may be wrong.....

@gwct
Copy link
Contributor

gwct commented Jul 17, 2019 via email

@NINGCHINA
Copy link
Author

@gwct very much thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants