-
Notifications
You must be signed in to change notification settings - Fork 0
/
Data_Wrangling_1.Rmd
54 lines (31 loc) · 2.22 KB
/
Data_Wrangling_1.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
## Data Wrangling Part 1
![](images/wrangler.png){width=2in}
![](images/tidyR.png){width=1in}
![](images/dplyr.png){width=1.5in}
The video below gives a brief overview of the content covered in this section of the tutorial. All the [Data Wrangling slides](http://htmlpreview.github.io/?https://github.com/CWWhitney/teaching_R/blob/master/Data_Wrangling_Rpres/Data_Wrangling.html#/) are also available as a web page.
<!-- ![](https://youtu.be/j6XRof3q9aM) -->
<iframe width="560" height="315" src="https://www.youtube.com/embed/j6XRof3q9aM" frameborder="0" allowfullscreen></iframe>
### Notes on R
- Tidy code style using tidyR
- Clean and intuitive functions using dplyr
- Concise code using magrittr 'Ceci n'est pas une pipe'
![](images/magrittr.png){width=1in}
### Notes on R: About process
- "[...] writing R code is a hedonistically artistic, left-brained, paint-in-your-hair sort of experience [...]
- learn how to code the same way we learned how to catch salamanders as children – trial and error, flipping over rocks till we get a reward [...]
- once the ecstasy of creation has swept over us, we awake late the next morning to find our canvas covered with 2100 lines of R code [...]
- Heads throbbing with a statistical absinthe hangover, we trudge through it slowly over days, trying to figure out what we did."
*Andrew MacDonald*
### Notes on R: Focus
![](images/Focus.png){width=5in}
### Notes on R: Keeping track of work
![](images/tidyR.png){width=1in}
Keep it tidy
When writing .R files use the `#` symbol to annotate and not run lines. This is a great way to make notes for others and for future you.
We will talk later about using other file types like `Rmarkdown` for organizing R script and other associated languages. There it will be possible to add a lot more information and text, citations etc. An .R file is intended for code but we can still keep it organized in sections by ending headers with `----` or `####` annotation.
`# Section 1 ----`
`# Section 2 ####`
`# Section 3 ####`
Look for the `Table of contents` in the upper right console of the RStudio scripting pane (next to the `Run` button).
![](images/toc_rstudio.png){width=2in}
Read more tips in the [Tidyverse Style guide](http://style.tidyverse.org/).