-
Notifications
You must be signed in to change notification settings - Fork 13
/
Copy pathREADME.Rmd
189 lines (132 loc) · 8.52 KB
/
README.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
---
output: github_document
always_allow_html: true
---
<!-- README.md is generated from README.Rmd. Please edit that file -->
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-"
)
```
# edibble <img src="man/figures/logo.png" align="right" alt="" width="120" />
<!-- badges: start -->
[![Lifecycle: experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://lifecycle.r-lib.org/articles/stages.html)
[![CRAN status](https://www.r-pkg.org/badges/version/edibble)](https://CRAN.R-project.org/package=edibble)
<!-- badges: end -->
## Installation
You can install the package from CRAN:
```r
install.packages("edibble")
```
You can install the development version with:
```{r, eval = FALSE}
# install.packages("remotes")
remotes::install_github("emitanaka/edibble")
```
## Overview
The goal of `edibble` R-package is to aid in the plan, design and simulation of experiments by mapping fundamental components of experiments to an object oriented system. The `edibble` system is built on the principle that the system must make it easy to recover experimental context by encouraging the user to be explicit about experimental details in fundamental terms.
## Examples
Consider an experiment where you want to know what is an effective way of teaching (flipped or traditional style) for teaching a particular subject and how different forms of exams (take-home, open-book or closed-book) affect student's marks.
There are four classes for this subject with each class holding 30 students. The teaching style can only be applied to the whole class but exam can be different for individual students.
```{r}
library(edibble)
set.seed(2020)
des <- design(name = "Effective teaching") %>%
set_units(class = 4,
student = nested_in(class, 30)) %>%
set_trts(style = c("flipped", "traditional"),
exam = c("take-home", "open-book", "closed-book")) %>%
allot_trts(style ~ class,
exam ~ student) %>%
assign_trts("random")
serve_table(des)
```
Before constructing the experiment, you might want to think about what you are recording for which level of unit and what values these variables can be recorded as.
```{r}
out <- des %>%
set_rcrds_of(student = c("exam_mark",
"quiz1_mark",
"quiz2_mark",
"gender"),
class = c("room",
"teacher")) %>%
expect_rcrds(exam_mark <= 100,
exam_mark >= 0,
quiz1_mark <= 15L,
quiz1_mark >= 0L,
quiz2_mark <= 30L,
quiz2_mark >= 0L,
factor(gender, levels = c("female", "male", "non-binary", "unknown"))) %>%
serve_table()
out
```
When you export the above edibble design using the `export_design` function, the variables you are recording are constraint to the values you expect, e.g. for factors, the cells have a drop-down menu to select from possible values.
```{r, eval = FALSE}
export_design(out, file = "/PATH/TO/FILE.xlsx")
```
```{r, echo = FALSE}
knitr::include_graphics("man/figures/README-excel_factor_output.png")
```
In addition, there is a spreadsheet for every observational level. E.g. here `room` and `teacher` is the same for all students in one class so rather than entering duplicate information, these are exported to another sheet for data entry.
```{r, echo = FALSE, out.width="400px"}
knitr::include_graphics("man/figures/README-excel_sheet_output.png")
```
There is also support for more complex nesting structures. You can always make the structure using edibble and take the resulting data frame to use in other experimental design software. It's also possible to bring existing data frame into edibble if you want to take advantage of the exporting feature in edibble.
```{r}
design("nesting structure") %>%
# there are 3 sites labelled A, B, C
set_units(site = c("A", "B", "C"),
# each site has 2 blocks except B with 3 sites
block = nested_in(site,
"B" ~ 3,
. ~ 2),
# levels can be specified by their number instead
# so for below "block1" has 30 plots,
# "block2" and "block3" has 40 plots,
# the rest of blocks have 20 plots.
plot = nested_in(block,
1 ~ 30,
c(2, 3) ~ 40,
. ~ 20)) %>%
serve_table()
```
## Experimental data
tidyverse is well suited for the data science project workflow as illustrated below in (B) (from [Grolemund and Wickham 2017](https://r4ds.had.co.nz/introduction.html)). For experimental data, the statistical aspect begins before obtaining data as depicted below in (A). The focus of `edibble` is to facilitate work in (A).
<img src="man/figures/design-analysis-flow.png">
The edibble R-package differ considerably to other packages for constructing experimental design with a focus on the whole process and less on the randomisation process (which the other software generally focus and do well on). Some features include:
* declaratively create experimental designs based on experimental components (e.g. units and treatments),
* explicitly specify variables that are to be recorded (e.g. response), and
* set expected values for variables to be recorded which restrict the data entry when the design is exported as an xlsx file,
* simulate values for record variables,
* make classical named designs see [Cookbook chapter](https://emitanaka.org/edibble-book/cookbook.html).
## Documents
- Work-in-progress book on this package can be found [here](https://emitanaka.org/edibble-book/).
- The preprint about the package can be found [here](https://emitanaka.org/paper-edibble/).
## Limitations
Currently, edibble:
- expects you to know the number of units available from the start. Unknown numbers will be supported in future versions.
- does not do enough testing so design should be diagnosed after construction (which should be done regardless of how much testing edibble implements).
## Related Work
The way that edibble specifies experimental design is largely novel (if I say so myself) and there are no work that resembles it. I'm concurrently working on two extension packages:
* `deggust` - to visualise the designs constructed from edibble as ggplot2 objects (WIP).
* `sizzled` - for experiments that require sample size calculation (WIP).
Below are some other related work. You can also have a look at the [CRAN Task View for Design of Experiment and Analysis of Experimental Data](https://CRAN.R-project.org/view=ExperimentalDesign) for a whole collection.
* `DeclareDesign` for survey or sampling designs
* `designr` for balanced factorial designs with crossed and nested random and fixed effect to data frame
* `dae` for functions useful in the design and ANOVA of experiments (this is in fact powering the randomisation in edibble)
* `plotdesignr` for designing agronomic field experiments
## Acknowledgement
edibble is hugely inspired by the work of [Tidyverse Team](https://joss.theoj.org/papers/10.21105/joss.01686). I'm grateful for the dedication and work by the Tidyverse Team, as well as [R Development Core Team](https://www.r-project.org/contributors.html) that supports the core R ecosystem, that made developing this package possible.
## Tidyverse familiarity
The implementation in edibble adopt a similar nomenclature and design philosophy as tidyverse (and where it does not, it's likely my shortcoming) so that tidyverse users can leverage their familiarity of the tidyverse language when using edibble. Specifically, edibble follows the philosophy:
- main functions do one thing and have a consistent form of `<verb>_<noun>` (e.g. `set_units` and `set_rcrds`) where the nouns are generally plural. Exceptions are when the subject matter is clearly singular (e.g. `design` and `set_context`);
- pipable functions;
- all dots arguments are [dynamic dots](https://rlang.r-lib.org/reference/dyn-dots.html);
- duplicate names repaired with same option as `tibble` for additions to edibble graph;
- ability for developers to extend certain components.
- the specification of complex nested structure drawing similarity to `dplyr::case_when` (LHS is character or integer for edibble however).
## Code of Conduct
Please note that the edibble project is released with a [Contributor Code of Conduct](https://contributor-covenant.org/version/2/0/CODE_OF_CONDUCT.html).
By contributing to this project, you agree to abide by its terms.