Azure Databricks and Spark SQL (Python)

Master Azure Databricks with PySpark: Your Hands-On Guide to Advanced Data Engineering and Analysis (DP203)

Azure Databricks and Spark SQL (Python)
Azure Databricks and Spark SQL (Python)

Azure Databricks and Spark SQL (Python) udemy course

Master Azure Databricks with PySpark: Your Hands-On Guide to Advanced Data Engineering and Analysis (DP203)

What you'll learn:

Azure Databricks & Spark For Data Engineers (PySpark / SQL)

  • You will learn how to build a real-world data project using Azure Databricks and Spark Core. This course has been taught using real-world data from Formula1 motor racing
  • You will acquire professional-level data engineering skills in Azure Databricks, Delta Lake, Spark Core, Azure Data Lake Gen2, and Azure Data Factory (ADF)
  • You will learn how to create notebooks, dashboards, clusters, cluster pools, and jobs in Azure Databricks
  • You will learn how to ingest and transform data using PySpark in Azure Databricks
  • You will learn how to transform and analyze data using Spark SQL in Azure Databricks
  • You will learn about Data Lake architecture and Lakehouse architecture. Also, you will learn how to implement a solution for Lakehouse architecture using Delta Lake.
  • You will learn how to create Azure Data Factory pipelines to execute Databricks notebooks.
  • You will learn to create Azure Data Factory triggers to schedule and monitor pipelines.
  • You will gain the skills required around Azure Databricks and Data Factory to pass the Azure Data Engineer Associate certification exam DP203. Still, the course’s primary objective is not to teach you to pass the exams.
  • You will learn how to connect to Azure Databricks from PowerBI to create reports

Requirements:

  • All the code and step-by-step instructions are provided, but the skills below will greatly benefit your journey.
  • Basic Python programming experience will be required
  • Basic SQL knowledge will be required
  • Knowledge of cloud fundamentals will be beneficial but not necessary
  • An Azure subscription will be required; if you don’t have one, we will create a free account in the course

Description:

Databricks is one of the most in demand big data tools around. It is a fast, easy, and collaborative Spark based big data analytics service designed for data science, ML and data engineering workflows.

The course is packed with lectures, code-along videos and dedicated challenge sections. This should be more than enough to keep you engaged and learning! As an added bonus you will also have lifetime access to all the lectures… and I have provided detailed notebooks as a downloadable asset, the notebooks will contain step by step documentation with additional resources and links.

I have ensured that the delivery of the course is engaging and concise, the curriculum is extensive yet delivered in an efficient way. The course will provide you with hands-on training utilising a variety of different data sets.

The course is aimed at teaching you PySpark, Spark SQL in Python and the Databricks Lakehouse Architecture.

You will primarily be using Databricks on Microsoft Azure in addition to other services such as Azure Data Lake Storage Gen 2.

The course will cover a variety of areas including:

  • Set Up and Overview

  • Azure Databricks Notebooks

  • Spark SQL

  • Reading and Writing Data

  • Data Analysis and Transformation with Spark SQL in Python

  • Charts and Dashboards in Databricks Notebooks

  • Databricks Medallion Architecture

  • Accessing Data in Cloud Object Storage

  • Hive Metastore

  • Databases, Tables and Views in Databricks

  • Delta Lake / Databricks Lakehouse Architecture

Who this course is for:

Course Details:

  • 9 hours on-demand video
  • 8 articles
  • 7 downloadable resources
  • Access on mobile and TV
  • Certificate of completion

Azure Databricks and Spark SQL (Python) udemy free download

Master Azure Databricks with PySpark: Your Hands-On Guide to Advanced Data Engineering and Analysis (DP203)

Demo Link: https://www.udemy.com/course/azure-databricks-and-spark-sql-python/