Course
Data Manipulation with dplyr
Included withPremium or Teams
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Loved by learners at thousands of companies
Training 2 or more people?
Try DataCamp for BusinessCourse Description
First steps: Transforming data with dplyr
This course is designed to teach users how to efficiently manipulate and transform data using the dplyr package in R.First, explore fundamental data transformation techniques, including the use of key dplyr verbs like select, filter, arrange, and mutate. These functions will teach you how to modify datasets by selecting specific columns, filtering rows based on conditions, sorting data, and creating new calculated columns.
Aggregating data with dplyr
Next, the course covers data aggregation, teaching users how to summarize and condense data for better interpretation.You’ll understand how to make your data more interpretable and manageable. Functions such as count, group_by, and summarize are introduced to perform operations that aggregate many observations into meaningful summaries, essential for data analysis and reporting.
Selecting and transforming data
Finally, you will learn advanced data selection and transformation techniques, such as using select helpers and the rename verb. You will also get to apply your skills to a real-world case study and practice grouped mutates, window functions, and data visualization with ggplot2.By the end of the course, you will have developed robust data manipulation skills using dplyr, enabling more efficient and effective data analysis—a vital capability for any data analyst or scientist.
Feels like what you want to learn?
Start Course for FreeWhat you'll learn
- Assess grouped mutate operations and window functions to compute intra-group metrics such as year-over-year changes
- Differentiate between count(), group_by + summarize(), and slice_min/slice_max when aggregating or extracting extreme observations
- Evaluate multi-step dplyr pipelines that integrate several verbs to generate analytical insights from the counties and babynames datasets.
- Identify the appropriate dplyr verb to perform specific data transformations involving selection, filtering, arrangement, and mutation
- Recognize how select helpers, rename(), and relocate() alter column selection, naming, and ordering within a tibble
Prerequisites
Introduction to the TidyverseTransforming Data with dplyr
Aggregating Data
Selecting and Transforming Data
Case Study: The babynames Dataset
Complete
Earn Statement of Accomplishment
Add this credential to your LinkedIn profile, resume, or CVShare it on social media and in your performance review
Included withPremium or Teams
Enroll NowFAQs
Who should take this course?
This course is for aspiring data analysts and data scientists interested in learning the R programming language to analyze data as a data analyst or data scientist.
What is dplyr?
dplyr is an R package. It is one of the core packages of the popular tidyverse set of packages in the R programming language. The dplyr package in R is used to quickly and easily manipulate data for common tasks. It's built to work directly with data frames and offers a consistent syntax that makes data manipulation more straightforward.
Do I need any prior knowledge of R?
Introduction to the Tidyverse is a prerequisite for this course. Foundational skills in R will help you succeed in this course.
Do I need to have R on my computer?
You do not need to have R on your computer. All exercises are done in-browser within this course.
Join over 19 million learners and start Data Manipulation with dplyr today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.