Welcome to Data Manipulation with dplyr
Data manipulation is a crucial part of data visualization and analysis. This workshop introduces common functions in the dplyr package to manipulate rows and columns.
See the UBC library Research Commons event calendar for related workshops on various topics.
Prerequisites
This workshop assumes a basic understanding of R language, but all learners are welcome to participate.
Participants are expected to have installed the latest versions of R and RStudio on their computers before the workshop:
Install R from https://cran.rstudio.com/
Install RStudio from https://rstudio.com/products/rstudio/download/#download
Please note that while we go through many examples during the workshop, we do not cover all of the operations available in dplyr.
Learning objectives
At the end of this workshop, you will be able to:
- Understand what the dplyr package is used for
- Use pipe operator to combine multiple manipulations together
- Manipulate rows to select, arrange, and add cases
- Manipulate columns to select, arrange, add, and rename variables
Schedule
0:00 - 0:05 | Welcome and using Zoom |
0:05 - 0:15 | Intro to dplyr |
0:15 - 0:35 | Manipulating rows |
0:35 - 0:50 | Manipulating columns |
0:55 - 1:00 | Recap |
1:00 - 1:30 | Optional Q&A |
Resources
- dplyr Cheat Sheet
- vignette(“dplyr”)
- Chapter 4 Data transformation in the book R for Data Science (2e) by Hadley Wickham and Garrett Grolemund
- Chapter 6 Introduction to dplyr in the textbook for STAT 545 by Jenny Bryan and The STAT 545 TAs