Data Science
with Spark

29 oktober, 2025Amsterdam, The Netherlands

3 dagen
In Person
Apache Spark
Data Science

Apache Spark is a powerful, open-source processing engine built around speed, ease of use, and advanced analytics. This course teaches you to unlock its full potential and master this challenging tool.

Book this training

Registreer je nu

Looking to upskill your team(s) or organization? 

Rozaliia will gladly help you further with custom training solutions. 

Rozaliia Khafizova
Data and AI Training Advisor

+31 6 11 58 19 37
Rozaliia.Khafizova@xebia.com
linkedin.com/in/rozaliya-k

Get in touch

Duur

3 dagen

Tijd

09:00 – 17:00 (GMT +1:00)

Taal

Engels

Lunch

Included

Certificering

Nee

Level

Advanced

What will you learn?

After the training, you will be able to:

Process large-scale data using PySpark.

Understand the fundamentals of Apache Spark.

Scale your machine learning workflows using PySpark.

Program

  • Spark execution and Spark sessions. 
  • DataFrame methods, properties, and actions.
  • APIs: (Py)Spark DataFrame vs Spark SQL. 
  • Reading and writing data in Spark. 

This training is for you if:

You have worked with Python before and want to know how to scale to large datasets.

You have started, or are about to start, working with large data.

You know the concepts of machine learning and want to know how to apply them at scale.

This training is not for you if:

You won’t be working with Spark but want to learn Python (check out our Python for Data Analysis training instead).

You would like an introduction to machine learning (check out our Certified Data Science with Python course instead).

Why should I follow this training?

Learn the fundamentals of Apache Spark

Learn from the Spark experts

Learn to process large-scale data using PySpark and perform machine learning

What else
should I know?

After registering for this training, you will receive a confirmation email with practical information. A week before the training, we will ask you about any dietary requirements and share literature if you need to prepare.

See you soon!

Course information

All literature and course materials are included in the price. 

After registering for this course, you will receive a confirmation email with practical information. 

Also interesting for you

View all training courses
Data as a Product

Become equipped to implement data products in your organisation. You will learn how to apply a use-case driven approach to data products, starting from end-user needs.

Data Science
Gegevens en AI
Productbeheer
Bekijk training
MLOps on GCP

Discover what MLOps is and how you can apply it in GCP (Google Cloud Platform) with our MLOps on GCP training course.

Data Analytics
Data Engineering
Data Science
Gegevens en AI
Google Cloud Platform (GCP)
2 dagen
In Person

Next:

14 – 16 mei, 2025

From:

€1520

Bekijk training
Data Science Bootcamp

Transform into a certified Data Scientist in just 12 weeks. This boot camp will kick-start your Data Science career with Python.

Data Science
Python
11 dagen
Virtual

Next:

11 sep, 2025

From:

€2975

Bekijk training
Time Series Analysis & Forecasting

Learn how to extract insights, interpret seasonality and build forecasting models from time series data.

Data Science
Gegevens en AI
4 dagen
In Person

Next:

6 okt, 2025

From:

€1425

Bekijk training
Python for Data Analysis

Learn how to code in Python and perform data analysis with our Python for Data Analysis training.

Steven van Duin

Data Science
Gegevens en AI
Python
2 dagen
Virtual

Next:

12 – 13 jun, 2025

From:

€1315

Bekijk training

Can’t find the course you’re looking for? There’s more!