Skip to main content
HomePython

Course

Introduction to Natural Language Processing in Python

IntermediateSkill Level
4.7+
957 reviews
Updated 02/2026
Learn fundamental natural language processing techniques using Python and how to apply them to extract insights from real-world text data.
Start Course for Free
PythonMachine Learning4 hr15 videos51 Exercises3,750 XP140K+Statement of Accomplishment

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Loved by learners at thousands of companies

Group

Training 2 or more people?

Try DataCamp for Business

Course Description

In this course, you'll learn natural language processing (NLP) basics, such as how to identify and separate words, how to extract topics in a text, and how to build your own fake news classifier. You'll also learn how to use basic libraries such as NLTK, alongside libraries which utilize deep learning to solve common NLP problems. This course will give you the foundation to process and parse text as you move forward in your Python learning.

Prerequisites

Python Toolbox
1

Regular expressions & word tokenization

This chapter will introduce some basic NLP concepts, such as word tokenization and regular expressions to help parse text. You'll also learn how to handle non-English text and more difficult tokenization you might find.
Start Chapter
2

Simple topic identification

This chapter will introduce you to topic identification, which you can apply to any text you encounter in the wild. Using basic NLP models, you will identify topics from texts based on term frequencies. You'll experiment and compare two simple methods: bag-of-words and Tf-idf using NLTK, and a new library Gensim.
Start Chapter
3

Named-entity recognition

This chapter will introduce a slightly more advanced topic: named-entity recognition. You'll learn how to identify the who, what, and where of your texts using pre-trained models on English and non-English text. You'll also learn how to use some new libraries, polyglot and spaCy, to add to your NLP toolbox.
Start Chapter
4

Building a "fake news" classifier

Introduction to Natural Language Processing in Python
Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
Enroll Now

Don’t just take our word for it

*4.7
from 957 reviews
76%
21%
3%
0%
0%
  • 202221066
    4 hours ago

  • k225062
    yesterday

  • Augusto
    4 days ago

  • Abdul
    6 days ago

    I really enjoyed this course with Katharine. It was easy to follow and overall an interesting topic.

  • Warren
    last week

  • İlayda
    last week

202221066

k225062

Augusto

FAQs

Which NLP libraries will I learn to use in this course?

You will work with NLTK, Gensim, spaCy, and polyglot to perform tokenization, topic identification, named-entity recognition, and text classification tasks.

What prior Python knowledge is required?

You should have completed Introduction to Python, Intermediate Python, Introduction to Functions, and Python Toolbox. Comfort with functions, loops, and basic data structures is needed.

What is the fake news classifier project in the final chapter?

You will apply supervised machine learning to build a classifier that distinguishes fake news articles from real ones, selecting features and evaluating model performance on text data.

Does this course handle non-English text?

Yes. The first chapter covers tokenization challenges with non-English text, and the named-entity recognition chapter uses polyglot to process text in multiple languages.

What topic identification methods are compared?

You will experiment with bag-of-words and TF-IDF approaches using NLTK and Gensim to identify topics based on term frequencies, then compare how each method performs.

Join over 19 million learners and start Introduction to Natural Language Processing in Python today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Grow your data skills with DataCamp for Mobile

Make progress on the go with our mobile courses and daily 5-minute coding challenges.