Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
rvanguri authored Aug 7, 2017
1 parent bf3563d commit 9830eff
Showing 1 changed file with 19 additions and 0 deletions.
19 changes: 19 additions & 0 deletions condor/README.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,22 @@
# Dataset preparation (if not using AEOLUS dataset provided)

## Import CSV file with reports
To run the nsides back-end framework on a custom dataset, first make a CSV file with the following fields:
`drug_concept_id, outcome_concept_id, report_id`
Drugs and outcomes do not need to use any specific vocabulary.

Next, run `python import_custom_data.py --data yourdata.csv`

This creates the following files: `drugmap.npy`, `alldrugstrings.npy`, `allreports.npy`, `outcomemap.npy`, `alloutcomestrings.npy`

## Build feature matrix
To build the drug and outcome feature matrices, run:
`python build_drugs.py`
`python build_outcomes.py`
`python combine_matrices.py`

The result should be 2 files: `CUSTOM_all_reports_outcomes.npy` and `CUSTOM_all_reports.npy` which will be used for submitting jobs.

# Submission of condor jobs to Open Science Grid

## How to submit jobs
Expand Down

0 comments on commit 9830eff

Please sign in to comment.