Course
Feature Engineering with PySpark
AdvancedSkill Level
Updated 01/2026Start Course for Free
Included withPremium or Teams
SparkData Manipulation4 hr16 videos60 Exercises5,000 XP17,303Statement of Accomplishment
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Loved by learners at thousands of companies
Training 2 or more people?
Try DataCamp for BusinessCourse Description
Prerequisites
Supervised Learning with scikit-learnIntroduction to PySpark1
Exploratory Data Analysis
Get to know a bit about your problem before you dive in! Then learn how to statistically and visually inspect your dataset!
2
Wrangling with Spark Functions
Real data is rarely clean and ready for analysis. In this chapter learn to remove unneeded information, handle missing values and add additional data to your analysis.
3
Feature Engineering
In this chapter learn how to create new features for your machine learning model to learn from. We'll look at generating them by combining fields, extracting values from messy columns or encoding them for better results.
4
Building a Model
In this chapter we'll learn how to choose which type of model we want. Then we will learn how to apply our data to the model and evaluate it. Lastly, we'll learn how to interpret the results and save the model for later!
Feature Engineering with PySpark
Course Complete
Earn Statement of Accomplishment
Add this credential to your LinkedIn profile, resume, or CVShare it on social media and in your performance review
Included withPremium or Teams
Enroll NowJoin over 19 million learners and start Feature Engineering with PySpark today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.