Деталі електронної книги

Engineering Lakehouses with Open Table Formats. Build scalable and efficient lakehouses with Apache Iceberg, Apache Hudi, and Delta Lake

Engineering Lakehouses with Open Table Formats. Build scalable and efficient lakehouses with Apache Iceberg, Apache Hudi, and Delta Lake

Dipankar Mazumdar, Vinoth Govindarajan, Chao Sun

Завантаження...
EЛЕКТРОННА КНИГА
Engineering Lakehouses with Open Table Formats provides detailed insights into lakehouse concepts, and dives deep into the practical implementation of open table formats such as Apache Iceberg, Apache Hudi, and Delta Lake.
You’ll explore the internals of a table format and learn in detail about the transactional capabilities of lakehouses. You’ll also get hands on with each table format with exercises using popular computing engines, such as Apache Spark, Flink, Trino, and Python-based tools. The book addresses advanced topics, including performance optimization techniques and interoperability among different formats, equipping you to build production-ready lakehouses. With step-by-step explanations, you’ll get to grips with the key components of lakehouse architecture and learn how to build, maintain, and optimize them.
By the end of this book, you’ll be proficient in evaluating and implementing open table formats, optimizing lakehouse performance, and applying these concepts to real-world scenarios, ensuring you make informed decisions in selecting the right architecture for your organization’s data needs.
  • 1. Open Data Lakehouse: A New Architectural Paradigm
  • 2. Transactional Capabilities of the Lakehouse
  • 3. Apache Iceberg Deep Dive
  • 4. Apache Hudi Deep Dive
  • 5. Delta Lake Deep Dive
  • 6. Catalog and Metadata Management
  • 7. Interoperability in Lakehouses
  • 8. Performance Optimization and Tuning in a Lakehouse
  • 9. Data Governance and Security in Lakehouses
  • 10. Evaluating and Selecting Open Table Formats
  • 11. Real-World Applications and Learnings
  • Назва:Engineering Lakehouses with Open Table Formats. Build scalable and efficient lakehouses with Apache Iceberg, Apache Hudi, and Delta Lake
  • Автор:Dipankar Mazumdar, Vinoth Govindarajan, Chao Sun
  • Оригінальна назва:Engineering Lakehouses with Open Table Formats. Build scalable and efficient lakehouses with Apache Iceberg, Apache Hudi, and Delta Lake
  • ISBN:9781836207221, 9781836207221
  • Дата видання:2025-12-26
  • Формат:Eлектронна книга
  • Ідентифікатор видання: e_44fv
  • Видавець: Packt Publishing
Завантаження...
Завантаження...