Skip to main content
Category
Topics

Data Science Tutorials

Advance your data career with our data science tutorials. We walk you through challenging data science functions and models step-by-step.
Other topics:
GroupTraining 2 or more people?Try DataCamp for Business

Kernel Density Estimation: From Theory to Practice

Kernel density estimation is a nonparametric method for estimating the shape of a data distribution without assuming a fixed model. Learn the formula, bandwidth selection, and hands-on implementation in Python and R.
Dario Radečić's photo

Dario Radečić

June 16, 2026

Logistic Regression Assumptions: What You Need to Check Before Modeling

A practical walkthrough of the assumptions behind logistic regression, the diagnostics that catch violations in Python and R, and the alternatives to reach for when the assumptions don't hold.
Dario Radečić's photo

Dario Radečić

June 15, 2026

Spline Regression: A Practical Guide with Python & R

A practical guide to spline regression, covering how piecewise polynomials and knots model nonlinear relationships, the main spline types, and how to fit them in Python and R.
Dario Radečić's photo

Dario Radečić

June 14, 2026

Generalized Linear Model (GLM): A Beginner's Guide to Theory and Code

A practical guide to GLMs - what they are, how their three components work together, and how to fit and interpret them in Python and R.
Dario Radečić's photo

Dario Radečić

June 12, 2026

Overfitting vs. Underfitting: A Practical Guide to Model Diagnostics

A detailed walkthrough of overfitting and underfitting in machine learning, including how to identify each failure mode, why it happens, and how to fix it through the bias-variance tradeoff.
Dario Radečić's photo

Dario Radečić

June 12, 2026

Markov Chain Monte Carlo (MCMC): Sample Complex Probability Distributions

A guide to Markov Chain Monte Carlo - covering how it works, why it’s used, the most common algorithms, and how to apply it in Python for Bayesian inference.
Dario Radečić's photo

Dario Radečić

June 10, 2026

Gradient Clipping: How to Prevent Exploding Gradients

Gradient clipping is a one-line training fix that prevents exploding gradients from ruining deep neural network training. This guide covers how it works, the two main clipping methods, threshold selection, and implementation in PyTorch and TensorFlow.
Dario Radečić's photo

Dario Radečić

June 10, 2026

Support Vector Regression (SVR): How It Works and When to Use It

Support Vector Regression is a margin-based regression method that ignores small errors intentionally, handles nonlinear relationships through kernels, and holds up on noisy real-world data where standard regression comes up short.
Dario Radečić's photo

Dario Radečić

June 4, 2026

Singular Value Decomposition (SVD): What You Need to Know

Singular Value Decomposition (SVD) is a matrix factorization method that breaks any matrix into three simpler components, revealing its underlying structure.
Dario Radečić's photo

Dario Radečić

May 18, 2026

Kernel Trick Explained: How SVMs Learn Nonlinear Patterns

A conceptual guide to the kernel trick - what it is, how it enables SVMs and other kernel-based models, and when to use it over other approaches to nonlinear modeling.
Dario Radečić's photo

Dario Radečić

May 4, 2026

Kruskal-Wallis Test: Comparing Multiple Groups Without Normality

A practical guide to the Kruskal-Wallis test - what it is, how it works, when to use it over ANOVA, and how to run and interpret it in Python and R.
Dario Radečić's photo

Dario Radečić

May 4, 2026