Skip to content

Software for efficient analysis of biological large datasets. Analysis performed with Apache Spark MLlib and Hadoop

Notifications You must be signed in to change notification settings

fcproj/BIGBIOCL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 

Repository files navigation

License: GPL v3

BIGBIOCL

Software for efficient analysis of biological large datasets. As an example, BIGBIOCL can be used with DNA methylation data for the identification of drivers of tumors.

The scientific publication about BIGBIOCL is availble at: "F. Celli, F. Cumbo, E. Weitschek: Classification of Large DNA Methylation Datasets for Identifying Cancer Drivers. Big Data Research, 10.1016/j.bdr.2018.02.005, 2018".

The description of the project, of our experiments, and of the software is available in the wiki section.

Directories contain:

  • Experiments: results of our experiments
  • Software: the JAVA code to run standalone applications or submit Spark Jobs
  • Support Files: mapping files and other files needed to replicate our experiments, or to run new ones

License

GNU General Public License version 3 (GPL-3.0)

About

Software for efficient analysis of biological large datasets. Analysis performed with Apache Spark MLlib and Hadoop

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages