Certified Data Science with Python

5 February, 2024Virtual

3 days
Data Science

Do you know how to perform basic data analysis in Python but want to learn more about machine learning? During this three-day data science deep dive, you will learn how to unlock Python’s potential for data analysis and machine learning. The Data Science with Python Foundation course covers training models with scikit-learn and best practices for transforming your data with pandas, with a perfect combination of theory and practice. 

Book this training

Book now

Looking to upskill your team(s) or organization? 

Nico will gladly help you further with custom training solutions.

Get in touch


3 days


09:00 – 17:00









What will you learn?

After the training, you will be able to:

Perform exploratory data analysis on your datasets with pandas. 

Train and evaluate machine learning models with scikit-learn. 

Identify a suitable machine learning algorithm and metric for your data problem. 

Prepare complex data for machine learning with scaling, encoding, and imputing techniques. 

Apply best practices for data wrangling and model building.

Key takeaways

Data Wrangling with Pandas 

  1. Fetch descriptive summary statistics of your data with simple operations.
  2. Effectively select and filter parts of your data with loc
  3. Retrieve advanced statistics with groupby aggregations. 
  4. Extend your dataset by creating new columns with assign
  5. Structure your code neatly by chaining methods.

Machine Learning with Scikit-Learn 

  1. Use scikit-learn to train classification and regression models.  
  2. Evaluate trained models with train/test set split and scikit-learn metrics. 
  3. Use scikit-learn transformers for categorical variable encoding, scaling and missing values imputation. 
  4. Pre-process complex data in scikit-learn with ColumnTransformer and Pipeline
  5. Tune pre-processing and model hyperparameters with gridsearch

Machine Learning Theory 

  1. Identify the type of machine learning task (classification or regression, supervised or unsupervised, and others) .
  2. Differentiate between several machine learning algorithms (such as linear regression, decision tree, and support vector machine). 
  3. Create models that generalize (underfitting and overfitting, train-test split, k-fold cross-validation). 
  4. Understand how to evaluate your model’s effectiveness with various metrics (such as precision & recall, F1, root mean squared error, r2).  


  • Master exploratory data analysis with Pandas  
  • Introduction to machine learning (theory) 
  • Build your first machine learning model on a real dataset 

Who is it for?

This training is perfect for data scientists who want to enhance their skills and gain practical knowledge using the most popular and essential data science tools. Whether you’re a beginner or have some prior experience with Python, this Foundation training will provide you with the necessary expertise to kickstart your venture into data science.    

If you are already an experienced Data Scientist and want to develop your skills even further, check out our Advance Data Science with Python Training. 


Basic knowledge of Python (or another programming language) is necessary to fully engage with the training.

If you are unsure about your Python proficiency, we recommend checking out the Python for Data Analysts training or contacting our sales representative. 

Why should I
follow this training?

Dive into the world of Data Science and Machine Learning using Python.

The course combines theory and practice perfectly, making it a great introduction to the field.  

Get best practices for data wrangling and model building.

What else
should I know?

After registering for this training, you will receive a confirmation email with practical information. A week before the training, we will ask you about any dietary requirements and share literature if you need to prepare.

See you soon!

Course information

For this training, you need a laptop on which you can install programs.  

All literature and course materials are included in the price. 

Meet the trainers

James Hayward

Meet James Hayward, a data science trainer at Xebia Academy. Get to know him here.

Lucy Sheppard 

Meet Lucy Sheppard, trainer at Xebia Academy. Lucy is a data science trainer and teaches many Python courses as well as dbt Learn.

Lysanne van Beek

Also interesting for you

View all training courses
Developing Data Models with LookML

This course empowers you to develop scalable, performant LookML (Looker Modeling Language) models that provide your business users with the standardized, ready-to-use data they need to answer their questions. Upon completing this course, you will be able to start building and maintaining LookML models to curate and manage data in your organization’s Looker instance.

Data Analytics
View training
Analyzing and Visualizing Data in Looker

In this course, you learn how to do the kind of data exploration and analysis in Looker that would formerly be done primarily by SQL developers or analysts. Upon completing this course, you will be able to leverage Looker’s modern analytics platform to find and explore relevant content in your organization’s Looker instance, ask questions about your data, create new metrics as needed, and build and share visualizations and dashboards to facilitate data-driven decision-making.

Data Analytics
View training
Data Science Bootcamp

Transform into a certified Data Scientist in just 12 weeks. This boot camp will kick-start your Data Science career with Python.

Lysanne van Beek

Data Science
11 days


25 Jan, 2024



View training
Advanced Power BI – DAX and Data Modeling

Increase your PowerBI knowledge with DAX & Data Modeling. Get started now!

Juan Manuel Perafan

Data Analytics
2 days


11 – 12 Dec, 2023



View training
Time Series Analysis & Forecasting

Learn how to extract insights, interpret seasonality and build forecasting models from time series data.

Data Science
4 days


4 Mar, 2024



View training

Can’t find the course you’re looking for? There’s more!