-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathREADME.Rmd
131 lines (99 loc) · 3.88 KB
/
README.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
---
output: github_document
---
<!-- README.md is generated from README.Rmd. Please edit that file -->
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-",
out.width = "100%"
)
```
# agua <a href="https://agua.tidymodels.org/"><img src="man/figures/logo.svg" align="right" height="139" /></a>
<!-- badges: start -->
[![Codecov test coverage](https://codecov.io/gh/tidymodels/agua/branch/main/graph/badge.svg)](https://app.codecov.io/gh/tidymodels/agua?branch=main)
[![R-CMD-check](https://github.com/tidymodels/agua/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/tidymodels/agua/actions/workflows/R-CMD-check.yaml)
<!-- badges: end -->
agua enables users to fit, optimize, and evaluate models via [H2O](https://h2o.ai/) using tidymodels syntax. Most users will not have to use aqua directly; the features can be accessed via the new parsnip computational engine `'h2o'`.
There are two main components in agua:
* New parsnip engine `'h2o'` for many models, see [Get started](https://agua.tidymodels.org/articles/agua.html) for a complete list.
* Infrastructure for the tune package.
When fitting a parsnip model, the data are passed to the h2o server directly. For tuning, the data are passed once and instructions are given to `h2o.grid()` to process them.
This work is based on @stevenpawley's [h2oparsnip](https://github.com/stevenpawley/h2oparsnip) package. Additional work was done by Qiushi Yan for his 2022 summer internship at RStudio.
## Installation
The CRAN version of the package can be installed via
```r
install.packages("agua")
```
You can also install the development version of agua using:
``` r
require(pak)
pak::pak("tidymodels/agua")
```
## Examples
The following code demonstrates how to create a single model on the h2o server and how to make predictions.
```r
library(tidymodels)
library(agua)
library(h2o)
tidymodels_prefer()
```
```r
# Start the h2o server before running models
h2o_start()
# Demonstrate fitting parsnip models:
# Specify the type of model and the h2o engine
spec <-
rand_forest(mtry = 3, trees = 1000) %>%
set_engine("h2o") %>%
set_mode("regression")
# Fit the model on the h2o server
set.seed(1)
mod <- fit(spec, mpg ~ ., data = mtcars)
mod
#> parsnip model object
#>
#> Model Details:
#> ==============
#>
#> H2ORegressionModel: drf
#> Model ID: DRF_model_R_1656520956148_1
#> Model Summary:
#> number_of_trees number_of_internal_trees model_size_in_bytes min_depth
#> 1 1000 1000 285914 4
#> max_depth mean_depth min_leaves max_leaves mean_leaves
#> 1 10 6.70600 10 27 18.04100
#>
#>
#> H2ORegressionMetrics: drf
#> ** Reported on training data. **
#> ** Metrics reported on Out-Of-Bag training samples **
#>
#> MSE: 4.354249
#> RMSE: 2.086684
#> MAE: 1.657823
#> RMSLE: 0.09848976
#> Mean Residual Deviance : 4.354249
# Predictions
predict(mod, head(mtcars))
#> # A tibble: 6 × 1
#> .pred
#> <dbl>
#> 1 20.9
#> 2 20.8
#> 3 23.3
#> 4 20.4
#> 5 17.9
#> 6 18.7
# When done
h2o_end()
```
Before using the `'h2o'` engine, users need to run `agua::h2o_start()` or `h2o::h2o.init()` to start the h2o server, which will be storing data, models, and other values passed from the R session.
There are several package vignettes including:
- [Introduction to agua](https://agua.tidymodels.org/articles/agua.html)
- [Model tuning](https://agua.tidymodels.org/articles/tune.html)
- [Automatic machine learning](https://agua.tidymodels.org/articles/auto_ml.html)
- [Parallel processing with agua and h2o](https://agua.tidymodels.org/articles/parallel.html)
## Code of Conduct
Please note that the agua project is released with a [Contributor Code of Conduct](https://contributor-covenant.org/version/2/0/CODE_OF_CONDUCT.html). By contributing to this project, you agree to abide by its terms.