Ebook details

Data Engineering with Azure Databricks. Design, build, and optimize scalable data pipelines and analytics solutions with Azure Databricks

Data Engineering with Azure Databricks. Design, build, and optimize scalable data pipelines and analytics solutions with Azure Databricks

Dmitry Foshin, Dmitry Anoshin, Tonya Chernyshova, Sergii Volodarskyi

Loading...
EBOOK
Loading...
Data Engineering with Azure Databricks is your essential guide to building scalable, secure, and high-performing data pipelines using the powerful Databricks platform on Azure. Designed for data engineers, architects, and developers, this book demystifies the complexities of Spark-based workloads, Delta Lake, Unity Catalog, and real-time data processing.
Beginning with the foundational role of Azure Databricks in modern data engineering, you’ll explore how to set up robust environments, manage data ingestion with Auto Loader, optimize Spark performance, and orchestrate complex workflows using tools like Azure Data Factory and Airflow.
The book offers deep dives into structured streaming, Delta Live Tables, and Delta Lake’s ACID features for data reliability and schema evolution. You’ll also learn how to manage security, compliance, and access controls using Unity Catalog, and gain insights into managing CI/CD pipelines with Azure DevOps and Terraform.
With a special focus on machine learning and generative AI, the final chapters guide you in automating model workflows, leveraging MLflow, and fine-tuning large language models on Databricks. Whether you're building a modern data lakehouse or operationalizing analytics at scale, this book provides the tools and insights you need.
  • 1. The Role of Azure Databricks in Modern Data Engineering
  • 2. Setting up an End-To-End Azure Databricks Environment
  • 3. Data Ingestion Strategies for Azure Databricks
  • 4. Data Engineering with Apache Spark
  • 5. Building Real-Time Data Pipelines
  • 6. Working with Delta Lake: ACID Transactions and Schema Evolution
  • 7. Automating Data Systems with Lakeflow Spark Declarative Pipelines
  • 8. Orchestrating Data Workflows: From Notebooks to Production
  • 9. CI/CD and DevOps for Azure Databricks
  • 10. Optimizing Query Performance and Cost Management
  • 11. Security, Compliance, and Data Governance
  • 12. Machine Learning and AI on Databricks
  • Title:Data Engineering with Azure Databricks. Design, build, and optimize scalable data pipelines and analytics solutions with Azure Databricks
  • Author:Dmitry Foshin, Dmitry Anoshin, Tonya Chernyshova, Sergii Volodarskyi
  • Original title:Data Engineering with Azure Databricks. Design, build, and optimize scalable data pipelines and analytics solutions with Azure Databricks
  • ISBN:9781806106363, 9781806106363
  • Date of issue:2026-04-30
  • Format:Ebook
  • Item ID: e_4isb
  • Publisher: Packt Publishing
Loading...
Loading...