e-learning

dplyr & tidyverse for data processing

Abstract

dplyr is a powerful R-package to transform and summarize tabular data with rows and columns. It is part of a group of packages (including ggplot2) called the tidyverse, a collection of packages for data processing and visualisation. For further exploration please see the dplyr package vignette: Introduction to dplyr

About This Material

This is a Hands-on Tutorial from the GTN which is usable either for individual self-study, or as a teaching material in a classroom.

Questions this will address

  • How can I load tabular data into R?
  • How can I slice and dice the data to ask questions?

Learning Objectives

  • Read data with the built-in read.csv
  • Read data with dplyr's read_csv
  • Use dplyr and tidyverse functions to cleanup data.

Licence: Creative Commons Attribution 4.0 International

Keywords: Foundations of Data Science, R, jupyter-notebook, rmarkdown-notebook

Target audience: Students

Resource type: e-learning

Version: 6

Status: Active

Prerequisites:

  • Advanced R in Galaxy
  • R basics in Galaxy

Learning objectives:

  • Read data with the built-in read.csv
  • Read data with dplyr's read_csv
  • Use dplyr and tidyverse functions to cleanup data.

Date modified: 2024-10-15

Date published: 2021-10-20

Authors: Avans Hogeschool, Helena Rasche

Contributors: Anthony Bretaudeau, Helena Rasche, Saskia Hiltemann

Scientific topics: Software engineering


Activity log