Data Processing at Scale
Data is knowledge, and knowledge is power. But the efficient processing of data can be challenging when scaled up. This training dives deep into one of the most popular and scalable tools for large-data transformation: Apache Spark. In this course, you will learn everything you need to know about how Apache Spark works. Through a combination of theory and hands-on exercises, you will also gain the skills to write efficient ETL Spark jobs to process large data sets.
Looking to upskill your team(s) or organization?
Nico will gladly help you further with custom training solutions for your organisation.
Get in touchWhat will you learn?
After the training, you will be able to:
Use Apache Spark and its advanced features
Write efficient ETL jobs
Use the API to transform data at a basic and advanced level
Think in terms of distributed systems when writing Spark jobs
Key takeaways
- Inner workings of Apache Spark
- Loading data from various formats
- Basic and advanced data frame operations
- Window and user-defined functions
- Unit testing
- Hands-on exercise to analyze large-scale logs to find trending topics
Program
- Inner workings of Apache Spark
- Loading data from various formats
- Basic and advanced data frame operations Window and user-defined functions
Who is it for?
This training is perfect for you if you are a data or machine learning engineer dealing with transforming large volumes of data.
Requirements
This training requires basic experience with Python. Still needing that experience? Then check out Python for Data Engineers instead.
Why should I follow this training?
Optimal Spark Use
Use Apache Spark and its advanced features and write efficient ETL jobs
Go Advanced
Learn about the inner workings of Apache Spark, loading data from various formats, and basic and advanced data frame operations.
Processing data sets
Gain the skills necessary to process large data sets
What else
should I know?
After registering for this training, you will receive a confirmation email with practical information. A week before the training, we will ask you about any dietary requirements and share literature if you need to prepare.
See you soon!
Course information
All literature and course materials are included in the price.
All literature and course materials are included in the price.
After registering for this course, you will receive a confirmation email with practical information.