Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to interpret variation graphs in SequenceTubeMap #450

Open
gforg34 opened this issue Jun 18, 2024 · 0 comments
Open

How to interpret variation graphs in SequenceTubeMap #450

gforg34 opened this issue Jun 18, 2024 · 0 comments

Comments

@gforg34
Copy link

gforg34 commented Jun 18, 2024

Hi vgteam,

This is my first time using SequenceTubeMap and vg and I want if possible to explain to me the following figures. I did generate a vg (variant graph) from FASTA sequences from 4 different species, and I don't know how to interpret the following figures.
To begin with, the first figure is an output of using the vg index vg.xg and gbwt.xg files. Apart from the reference sequence, the rest do not contain any actual name, but instead I get thread 0, thread 1 and thread 2 as a sequence-path name. Did the FASTA headers get renamed during the creation of the variant graph? Do you know how to identify which sequences correspond to the threads? Or am I looking at something else? As far as I understanf that a "thread" refers to a specific sequence or path within the variation graph.

graph(2)

In the next figure, I used the vg file instead of the vg.xg and gbwt.xg file, resulting in a different output. This time, I see multiple paths/haplotypes in addition to the regular four. What are these additional paths, are they alternative haplotypes that are being generated by vg? What is the proper way to visualize the variation graph?
graph(1)

Any help will be valuable. Thank you for your time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant