A major update to the OpenProblems framework, switching from a Python-based framework to a Viash + Nextflow-based framework. This update features the same concepts as the previous version, but with a new implementation that is more flexible, scalable, and maintainable.
Most relevant parts of the overall structure:
-
src/tasks
: Benchmarking tasks:batch_integration
: Batch integrationdenoising
: Denoisingdimensionality_reduction
: Dimensionality reductionmatch_modalities
: Match modalitiespredict_modality
: Predict modalityspatial_decomposition
: Spatial decompositionspatially_variable_genes
: Spatially variable genes
-
src/datasets
: Components for creating common datasets. Loaders:cellxgene_census
: Query cells from a CellxGene Censusopenproblems_neurips2021_bmmc
: Fetch a dataset from the OpenProblems NeurIPS2021 competitionopenproblems_neurips2022_pbmc
: Fetch a dataset from the OpenProblems NeurIPS2022 competitionopenproblems_v1
: Fetch a legacy OpenProblems v1 datasetopenproblems_v1_multimodal
: Fetch a legacy OpenProblems v1 multimodal datasettenx_vision
: Fetch a and convert 10x Visium datasetzenodo_spatial
: Fetch and process an Anndata file containing DBiT seq, MERFISH, seqFISH, Slide-seq v2, STARmap, and Stereo-seq data from Zenodo.zenodo_spatial_slidetags
: Download a compressed file containing gene expression matrix and spatial locations from zenodo.
-
src/common
: Common components used by all tasks.check_dataset_schema
: Check whether an h5ad dataset adheres to a dataset schemacheck_yaml_schema
: Check whether a YAML adheres to a JSON schemacomp_tests
: Reusable component unit testscreate_component
: Create a component Viash component.create_task_readme
: Create a README for an OpenProblems task.extract_metadata
: Extract the.uns
metadata from an h5ad file.helper_functions
: Commonly used helper functions in Python or in R,process_task_results
: Process the raw tasks results (containing raw logs, unprocessed component configs, and various metrics) into nicely formatted task results.schemas
: JSON schemas for YAML files in the repositorysync_test_resources
: Synchronise the test resources from s3 to resources_test
For more information related to the structure of this repository, see the documentation.