add a csv file with alignment information #47

roblanf · 2023-08-30T04:30:29Z

At the moment all the useful information is in nexus format, which can be annoying to work with.

E.g. we have this:

begin SETS;

	[partitions]
	CHARSET	COI_1stpos = 1-1592\3;
	CHARSET	COI_2ndpos = 2-1592\3;
	CHARSET	COI_3rdpos = 3-1592\3;
	CHARSET	16S = 1593-3037;

	[loci]
	CHARPARTITION COI = 1:COI_1stpos, 2:COI_2ndpos, 3:COI_3rdpos;
	CHARPARTITION 16S = 1:16S;

	CHARPARTITION loci = 1:COI, 2:16S;

	[genomes]
	CHARPARTITION	mitochondrial_genome = 1:COI, 2:16S;

	CHARPARTITION genomes = 1:mitochondrial_genome;

But this could be represented as a csv file with the following columns:

alignment_name (e.g. "Anderson_2012")
partition (e.g. "COI_1stpos")
partition_sites (e.g. "1-1592\3")
locus (e.g. "COI")
genome (e.g. "mitochondrial")

We could then use the csv file when entering the data, and build the nexus block directly from the csv file.

The text was updated successfully, but these errors were encountered:

roblanf · 2023-08-30T04:42:50Z

also include a column for 'datatype' e.g. DNA, AA, etc. This comes from the top of the nexus alignment file.

DS4B-ANU · 2023-11-21T00:11:50Z

include a column for codon position too (NA if it's not a codon position), so now the columns are:

alignment_name (e.g. "Anderson_2012")
partition_name (e.g. "COI_1stpos")
partition_start (e.g. 1)
partition_end (e.g. 100)
partition_skip (e.g. 3; so if start is 1, end is 100, and skip is 3, the nexus format would be 1-100\3)
locus_ name (e.g. "COI")
genome (e.g. "mitochondrial")
data_type (e.g. "DNA", "AA", "RNA")
codon_position (e.g. 1, 2, 3, or NA when it's not protein-coding)

Write correct csv according to #47

HuaiyanRen added a commit that referenced this issue Dec 4, 2023

Update generate_charset_csv.py

95c44fd

Write correct csv according to #47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add a csv file with alignment information #47

add a csv file with alignment information #47

roblanf commented Aug 30, 2023

roblanf commented Aug 30, 2023

DS4B-ANU commented Nov 21, 2023

add a csv file with alignment information #47

add a csv file with alignment information #47

Comments

roblanf commented Aug 30, 2023

roblanf commented Aug 30, 2023

DS4B-ANU commented Nov 21, 2023