Skip to main content
HomeData EngineeringIntroduction to Databricks

Introduction to Databricks

4.1+
18 reviews
Beginner

Learn about the power of Databricks Lakehouse and help you scale up your data engineering and machine learning skills.

Start Course for Free
4 Hours19 Videos60 Exercises
6,888 LearnersTrophyStatement of Accomplishment

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.
GroupTraining 2 or more people?Try DataCamp For Business

Loved by learners at thousands of companies


Course Description


Learn the power of the Lakehouse In today's data-filled world, we need tools that allow us to be as data-driven as possible. This course guides you from start to finish on how the Databricks Lakehouse Platform provides a single, scalable, and performant platform for your data processes. Working through a real-world dataset will teach you how to accomplish various tasks within the Databricks platform. You'll start the course by learning how to administer the Databricks platform and ensuring your environment is set up securely.


Practice scalable data engineering After setting up your workspace, you will learn how to create powerful data pipelines using Databricks. You will apply different transformations to the dataset, moving it from Bronze to Silver and then Gold in a Medallion architecture. You will learn how Databricks clusters provide readily available compute power and scalability. You will set up an end-to-end Databricks Workflow to automate your entire data pipeline.


Use the Lakehouse as your data warehouse A key part of the Lakehouse architecture is that you can query your data storage like a traditional data warehouse. In this section, you will learn how Databricks SQL gives you the data warehousing performance you want on top of your data lake. You will learn how to create queries using standard ANSI SQL, and use those results to create ad-hoc dashboards against your entire dataset.


Implement governed data science and machine learning Finally, you will learn how Databricks provides a complete set of tools for data science and machine learning use cases. You will learn to track and evaluate your models using the fully integrated MLFlow framework for MLOps. You will learn how the Feature Store and Model Registry simplify the process of creating production-quality machine-learning models. Finally, you will learn how to deploy and monitor your models using built-in model serving capabilities.
For Business

GroupTraining 2 or more people?

Get your team access to the full DataCamp library, with centralized reporting, assignments, projects and more
Try DataCamp for BusinessFor a bespoke solution book a demo.
  1. 1

    Introduction to Databricks

    Free

    Learn about the new lakehouse paradigm for your cloud data strategy and how the Databricks Lakehouse platform can modernize your data architecture. Understand the foundational components of the Databricks platform and how they all fit together.

    Play Chapter Now
    Introduction to the Databricks Lakehouse Platform
    50 xp
    Why pick a Lakehouse?
    50 xp
    Benefits of the Databricks Lakehouse
    50 xp
    Architectural Decisions
    100 xp
    Core features of the Databricks Lakehouse Platform
    50 xp
    Why Delta?
    50 xp
    Databricks for different personas
    50 xp
    Capabilities for each data persona
    100 xp
    Administering a Databricks workspace
    50 xp
    Managing and adding users
    50 xp
    Setting up a Databricks workspace example
    50 xp
    Control Plane vs. Data Plane
    50 xp
    Configure your Databricks workspace
    100 xp
For Business

GroupTraining 2 or more people?

Get your team access to the full DataCamp library, with centralized reporting, assignments, projects and more

Collaborators

Collaborator's avatar
Arne Warnke
Collaborator's avatar
Iason Prassides
Collaborator's avatar
Carl Rosseel

Prerequisites

Intermediate SQLUnderstanding Data EngineeringUnderstanding Machine Learning
Kevin Barlow HeadshotKevin Barlow

Data Professional

Kevin has over a decade of experience working with data in various applications. He is passionate about helping people and companies find insights from their data and hopes to teach you some of the strategies and techniques he has learned throughout his career.
See More

Don’t just take our word for it

*4.1
from 18 reviews
44%
33%
17%
6%
0%
Sort by
  • Pushpendra R.
    6 months

    Best to course to get started with Databricks!

  • Raminta N.
    about 1 month

    The course was clear and comprehensive.

  • Irfan G.
    3 months

    It was really good. The course provided an in-depth introduction to the platform. I now feel comfortable using it at work and even started proposing solutions.

  • Sufi M.
    6 months

    Nice. Add practical utility in industry.

  • Hendrik M.
    5 months

    Lacking depths and true hands-on experience

"Best to course to get started with Databricks!"

Pushpendra R.

"The course was clear and comprehensive."

Raminta N.

"It was really good. The course provided an in-depth introduction to the platform. I now feel comfortable using it at work and even started proposing solutions."

Irfan G.

Join over 13 million learners and start Introduction to Databricks today!

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.