├── .gitignore ├── comments.md ├── tibbles_solutions.rmd ├── visualize_soutions.Rmd └── transform_solutions.Rmd /.gitignore: -------------------------------------------------------------------------------- 1 | .Rproj.user 2 | .Rhistory 3 | .RData 4 | .Ruserdata 5 | -------------------------------------------------------------------------------- /comments.md: -------------------------------------------------------------------------------- 1 | # Comments 2 | - The pie chart question seems like a bad idea. We could not find a clear, elegant answer, and it ends exposing the workings of geom_bar in a way that might be unwise. 3 | 4 | - cumulative logical operators probably should not be covered so early. They are a bit difficult to explain, relative to how often they are used. In addition, it seems unwise to introduce row order depended operators before arrange. 5 | 6 | - Modular arithmetic should be introduced later than it is. It is an important topic in general programming, but it is not a common function in data science. `ceiling`, `floor`, `trunc`, `round` and `signif` can achieve most or all of the same effects, and can be applied to floating point numbers 7 | 8 | - Converting timestamps to times is probably a bit involved for this point in the book, unless you want to give a feel for messy data 9 | 10 | - Not clear what delay means in chapter 4 11 | - modular arithmetic stuff can probably go 12 | -------------------------------------------------------------------------------- /tibbles_solutions.rmd: -------------------------------------------------------------------------------- 1 | # `Tibble` Exercises 2 | 3 | ```{r} 4 | library(tibble) 5 | ``` 6 | 7 | ## Exercise 1 8 | 9 | ```{r} 10 | # load the data 11 | data(mtcars) 12 | 13 | is(mtcars) # the type is data.frame 14 | 15 | mtcars %>% as_tibble() 16 | ``` 17 | 18 | ## Exercise 2 19 | 20 | ## Exercise 2.1 21 | 22 | ```{r} 23 | library(ggplot2) 24 | 25 | annoying <- tibble( 26 | `1` = 1:10, 27 | `2` = `1` * 2 + rnorm(length(`1`)) 28 | ) 29 | 30 | ggplot(data = annoying) + 31 | geom_point(aes(x = `1`, y = `2`)) 32 | ``` 33 | 34 | ## Exercise 2.2 35 | 36 | ```{r} 37 | annoying$`3` <- annoying$`2` / annoying$`1` 38 | annoying 39 | ``` 40 | 41 | ## Exercise 2.4 42 | 43 | Has to be done before the renaming. 44 | 45 | ```{r} 46 | annoying$`1` 47 | annoying 48 | ``` 49 | 50 | ## Exercise 2.3 51 | 52 | ```{r} 53 | colnames(annoying) <- c("one", "two", "three") 54 | annoying 55 | ``` 56 | 57 | ## Exercise 3 58 | 59 | It is like a named vector that is converted into a data_frame/tibble. We might use it before we use `mutate`. 60 | 61 | ```{r} 62 | v <- 1:10 63 | names(v) <- letters[v] 64 | 65 | d <- enframe(v) 66 | d 67 | ``` 68 | 69 | ## Exercise 4 70 | 71 | In the example below `tibble.max_extra_cols` is set to `100` and defines how many additional column names are printed 72 | 73 | ```{r} 74 | options(tibble.max_extra_cols = 100) 75 | ``` 76 | -------------------------------------------------------------------------------- /visualize_soutions.Rmd: -------------------------------------------------------------------------------- 1 | --- 2 | title: "visualize_soutions" 3 | output: html_document 4 | --- 5 | ```{r setup, include=FALSE} 6 | knitr::opts_chunk$set(echo = TRUE) 7 | library(ggplot2) 8 | ``` 9 | 10 | 11 | # 3.2.1 12 | ### Run `ggplot(data = mpg)` what do you see? 13 | ```{r} 14 | ggplot(data = mpg) 15 | ``` 16 | 17 | A ggplot with no aesthetics just shows a grey square, since it produces a background with no graph on it. 18 | 19 | ### What does the `drv` variable describe? Read the help for `?mpg` to find out. 20 | ```{r} 21 | ?mpg 22 | ``` 23 | The variable `drv` says which wheels [drive](https://en.wikipedia.org/wiki/Drive_wheel) the vehicle. 24 | Typing `?