e-learning
dplyr & tidyverse for data processing
Abstract
dplyr is a powerful R-package to transform and summarize tabular data with rows and columns. It is part of a group of packages (including ggplot2
) called the tidyverse
, a collection of packages for data processing and visualisation. For further exploration please see the dplyr package vignette: Introduction to dplyr
About This Material
This is a Hands-on Tutorial from the GTN which is usable either for individual self-study, or as a teaching material in a classroom.
Questions this will address
- How can I load tabular data into R?
- How can I slice and dice the data to ask questions?
Learning Objectives
- Read data with the built-in
read.csv
- Read data with dplyr's
read_csv
- Use dplyr and tidyverse functions to cleanup data.
Licence: Creative Commons Attribution 4.0 International
Keywords: Foundations of Data Science, R, jupyter-notebook, rmarkdown-notebook
Target audience: Students
Resource type: e-learning
Version: 6
Status: Active
Prerequisites:
- Advanced R in Galaxy
- R basics in Galaxy
Learning objectives:
- Read data with the built-in
read.csv
- Read data with dplyr's
read_csv
- Use dplyr and tidyverse functions to cleanup data.
Date modified: 2024-10-15
Date published: 2021-10-20
Contributors: Anthony Bretaudeau, Helena Rasche, Saskia Hiltemann
Scientific topics: Software engineering
Activity log