-
Notifications
You must be signed in to change notification settings - Fork 7
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Create Conventional statistics vs machine learning
- Loading branch information
Showing
1 changed file
with
19 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,19 @@ | ||
The similar concepts between conventional statistics and machine learning | ||
|
||
|
||
1. To determine the quality of data: | ||
|
||
a) effect size or power analysis (Cox regression model) is done using conventional statistics. | ||
The method used is residual analysis, if necessary standardized residual error will be performed using linear regression. | ||
|
||
b) In machine learning, model evaluation is performed using different algorithms | ||
The R codes for model evaluation is in the file (model evaluation) | ||
|
||
2. To determine the significance of factors in the dataset relating to the target variable (dependent variable): | ||
|
||
a) Chi sqauare test/ logistic regression is done in conventional statistics to select the significant variable | ||
|
||
b) Variable importance is done in machine learning to determine the rank of variables based on importance score | ||
(refer to variable importance R code) | ||
|
||
3. Survival analysis R codes are under the survival analysis file. |