You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This page should list all available assemblies for one ore multiple selected species.
Column Name
json key
Always Visibly
Visible by default
Could be empty
Type
Description
Y
Y
checkbox
N
option used to select one or more assemblies/genomes
Y
Y
Action Buttons
N
buttons making it possible to perform different tasks, configurable
Assembly ID
accession
Y
Y
N
String
Genus and Species name
Name
name
Y
Y
N
String
Genus and Species name
Taxid
taxon_id
Y
Y
N
Number
Taxonomic identifier
Type
assembly_type
Y
Y
N
String
describe type of assembly
Release Data
Y
Y
Y
Date
date for when the assembly was published
N50
n50
N
Y
Y
Number
n50 (scaffold) value
Scaffolds
scaffolds
N
Y
Y
Number
number of scaffolds
Suggestion of possible action buttons:
Analyze --> forward to a page where one workflow can be selected to be run on galaxy
UCSC = Link to the UCSC Genome Browser
NCBI = Link to NCBI Datasets
EBI = Link to EBI
Some thoughts:
For the first column, it could also be an option to instead have it possible to select multiple rows in the table by clicking on a row.
The action buttons should be configurable. We could maybe have entries on the data structure for each assembly, like {'name': 'NCBI, 'url': 'address to open'}
When selecting multiple assemblies/genomes the user should have the option to enter a comparative page. The question is how we provide this option. A few possibilities are:
The user click on one of the selected assemblies/genomes Analyze button
The Analyze buttons of the selected assemblies/genome have there name transformed to Comparative analyze to make it easier to see that a comparative page will be entered.
A Comparative Analyze button pops up somewhere on the page
Assembly information can be retrieved from /genome/accession/{accessions}/dataset_report
have divided the implementation into three phases:
Initial Setup – The goal of this phase is to get a basic version up and running. This will make it possible to implement the analysis and comparative analysis page in the next step.
Enhancements using NCBI API – Once we have a simple page working, we will expand it by incorporating additional information from the NCBI REST API.
Additional Data Integration – In this final phase, we will add assemblies found on VGP S3 bucket.
Initial Setup (Minimum feature)
The following columns should be included:
Assembly ID
Name
Type (would be nice but could be moved to next phase)
And make it possible to select one or multiple assemblies/genomes and then move to the analyze pages.
Enhancements using NCBI API
Add the remaining columns to the table including action buttons.
In these step we need to extend the data extraction script to fetch more information from the NCBI API
Additional Data Integration
We want to make it possible to also select assemblies/genomes stored on the VGP S3 bucket, preferably it should be have like this:
show only NCBI available assemblies/genomes if they exist.
if no NCBI assemblies/genomes exist show the VGP (S3 bucket) assemblies
If both NCBI and VGP (S3 bucket) assemblies/genomes exists:
show the NCBI be default
make it possible to also show VGP (S3 bucket) by for example click on a button.
The script for generating a data blob for VGP data could be a new one that outputs a JSON/YAML file. I don't have a strong preference on whether we create a separate script to merge VGP data with NCBI or modify the existing NCBI data script to take the new file as input.
The text was updated successfully, but these errors were encountered:
This page should list all available assemblies for one ore multiple selected species.
Suggestion of possible action buttons:
Some thoughts:
{'name': 'NCBI, 'url': 'address to open'}
Assembly information can be retrieved from
/genome/accession/{accessions}/dataset_report
have divided the implementation into three phases:
Initial Setup (Minimum feature)
The following columns should be included:
And make it possible to select one or multiple assemblies/genomes and then move to the analyze pages.
Enhancements using NCBI API
Add the remaining columns to the table including action buttons.
In these step we need to extend the data extraction script to fetch more information from the NCBI API
Additional Data Integration
We want to make it possible to also select assemblies/genomes stored on the VGP S3 bucket, preferably it should be have like this:
The script for generating a data blob for VGP data could be a new one that outputs a JSON/YAML file. I don't have a strong preference on whether we create a separate script to merge VGP data with NCBI or modify the existing NCBI data script to take the new file as input.
The text was updated successfully, but these errors were encountered: