Course
Building Recommendation Engines with PySpark
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Loved by learners at thousands of companies
Training 2 or more people?
Try DataCamp for BusinessCourse Description
Prerequisites
Supervised Learning with scikit-learnIntroduction to PySparkRecommendations Are Everywhere
How does ALS work?
Recommending Movies
What if you don't have customer ratings?
Complete
Earn Statement of Accomplishment
Add this credential to your LinkedIn profile, resume, or CVShare it on social media and in your performance reviewEnroll Now
FAQs
What recommendation algorithm does this PySpark course focus on?
The course focuses on the Alternating Least Squares (ALS) algorithm for collaborative filtering, covering its mathematical foundation, hyperparameters, and implementation in PySpark.
What datasets are used for building recommendation engines?
You will work with the MovieLens dataset to build and evaluate a cross-validated ALS model, and the Million Songs dataset to practice with implicit feedback data.
Does the course cover recommendations when explicit ratings are not available?
Yes. The final chapter teaches you how to infer ratings from customer behavior data and build ALS recommendation engines using implicit feedback.
What PySpark and Python prerequisites should I have?
You need experience with pandas, Intermediate Python, Introduction to PySpark, basic SQL, and supervised learning with scikit-learn. This is an advanced-level course.
What is matrix factorization and why does it matter for recommendations?
Matrix factorization decomposes a large user-item matrix into smaller matrices to uncover latent features. It is the mathematical core of ALS and helps predict missing ratings.
Join over 19 million learners and start Building Recommendation Engines with PySpark today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Grow your data skills with DataCamp for Mobile
Make progress on the go with our mobile courses and daily 5-minute coding challenges.