Skip to content

brandenkmurray/h2oDallas

Repository files navigation

1. Clone repo  
2. Register on Kaggle (https://www.kaggle.com/account/register) to download the datasets (https://www.kaggle.com/c/bnp-paribas-cardif-claims-management/data) 
3. Move train.csv and test.csv into your local repo directory  
4. Make sure all packages in packages.R are installed  
5. Run data_trans_31.R  
  1. For time purposes, the default is set to sample 5000 rows from the train and test set. If you change the default then you will need to uncomment the PAIRWISE CORRELATIONS section before running.  
  2. If you have less than 16GB RAM then you should either sample less data or change the ‘threads’ argument to a lower number (1 or 2). This will save memory.  
6. Run layer1_xgb46.R  

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published