autoNeuro

Pipeline to run the experiments with tabular data for classification tasks (Major Depressive Disorder vs Healthy Control, Schisophrenia vs Healthy Control).

Pipeline consist of several steps

GridSearchBase - search for the best ML method with best feature selection algorithm from (XGBClassifier, SVC, RandomForestClassifier(), "lr" : LogisticRegression() ) and feature selection methods (SelectKBest, RandomForestClassifier (select top important features), LogisticRegression (select top important features) ).It is located in core/gridcv.py module. You can configure the parameters for grid search algorithm for each model in GRID_CONFIG_MODELS in core/constants.py module.
ExperimentsInfo - calculate the most important features for the best methods, build roc curves and other metrics, save important features as DataFrame to EXPERIMENTS_PATH (experiments.py module). It is located in core/metrics.py module.
FeaturesStats - calculate the distributions and post-hoc t-test for the the most important features calculated in the previos step. It is located in core/stats.py

How to run

You can run code using experiments.py or see examples with jupyter notebooks in experiments package.

  X - pd.DataFrame with numeric data
  y - pd.Series with target values
  experiment_name - name of experiment (results are saved in directiry with this name) 

  best_result,best_f1 =run(X,y, experiment_name=file.stem, repeats =10, topN=1,scaling=False)

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
autoNeuro		autoNeuro
experiments		experiments
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

autoNeuro

About

Releases

Packages

Contributors 2

Languages

maryjis/autoNeuro

Folders and files

Latest commit

History

Repository files navigation

autoNeuro

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages