Apache Spark 3 – Databricks Certified Associate Developer
Learn Apache Spark 3 With Scala & Earn the Databricks Associate Certification to prove your skills as data professional
What you’ll learn
How to prepare for the Databricks Certified Associate Developer For Apache Spark 3 Certification Exam
The Architecture of an Apache Spark Application
Learn how Apache Spark runs on a cluster of computer
Learn the Execution Hierarchy of Apache Spark
Create DataFrame from files and Scala Collections
Spark DataFrame API and SQL functions
Learn the different techniques to select the columns of a DataFrame
How to define the schema of a DataFrame and set the data types of the columns
Apply various methods to manipulate the columns of a DataFrame
How to filter your DataFrame based on specifics rules
Learn how to sort data in a specific order
Learn how to sort rows of a DataFrame in a specific order
How to arrange the rows of DataFrame as groups
How to handle NULL Values in a DataFrame
How to use JOIN or UNION to combine two data sets
How you can save the result of complex data transformations to an external storage system
The different deployment modes of an Apache Spark Application
working with UDFs and Spark SQL functions
How to use Databricks Community Edition to write Apache Spark Code
Basic Scala Knowledge
Basic data skills
NO Previous Spark Knowledge
Do you want to learn how to handle massive amounts of data at scale?
Learn Apache Spark 3 and pass the Databricks Certified Associate Developer for Apache Spark 3.0
Hi, My name is Wadson, and I’m a Databricks Certified Associate Developer for Apache Spark 3.0
In today’s data-driven world, Apache Spark has become the standard big-data cluster processing framework.
Apache Spark is used for Data Engineering, Data Science, and Machine Learning.
I will teach you everything you need to know about getting started with Apache Spark.
You will learn the Architecture of Apache Spark and use it’s Core APIs to manipulate complex data.
You will write queries to perform transformations such as Join, Union, GroupBy, and more.
This course is for beginners.
You do not need previous knowledge of Apache Spark.
There are Notebooks available to download so that you can follow along with me in the videos.
The Notebooks contains all the source code I use in the course.
There are also Quizzes to help you assess your understanding of the topics.
Who this course is for:
- Any Developer who wants to start using Apache Spark in their career
- Beginner Spark Developer seeking Big Data Certification
- Developer curious about Data Engineering and Data Science
Created by Wadson Guimatsa
Last updated 3/2023
Size: 1.87 GB
Google Drive Links