DP-3011
This course explores how to use Databricks and Apache Spark on Azure to take data projects from exploration to production.
- Learn how to ingest, transform, and analyze large-scale datasets with Spark DataFrames, Spark SQL, and PySpark
- Build confidence in managing distributed data processing
- Get hands-on with the Databricks workspace—navigating clusters and creating and optimizing Delta tables
- Dive into data engineering practices, including designing ETL pipelines, handling schema evolution, and enforcing data quality.
- Automate and manage workloads with Lakeflow Jobs and pipelines
- Explore governance and security capabilities such as Unity Catalog and Purview integration

1 Day
HK$3500
