Big data

1033
Ładowanie...
EBOOK

The Definitive Guide to Microsoft Fabric. From discovery to building a unified, secure, and scalable data platform

Christopher Maneu, Emilie Beau, Jean-Pierre Riehl, Romain...

Microsoft Fabric is reshaping how organizations manage, analyze, and act on data by unifying ingestion, storage, transformation, analytics, AI, and visualization in a single platform.The Definitive Guide to Microsoft Fabric takes you from your very first workspace to building a secure, scalable, and future-proof analytics environment. You’ll learn how to unify data in OneLake, design data meshes, transform and model data, implement real-time analytics, and integrate AI capabilities. The book also covers advanced topics, such as governance, security, cost optimization, and team collaboration using DevOps and DataOps principles. Drawing on the real-world expertise of five seasoned professionals who have built and advised on platforms for startups, SMEs, and Europe’s largest enterprises, this book blends strategic insight with practical guidance.By the end of this book, you’ll have gained the knowledge and skills to design, deploy, and operate a Microsoft Fabric platform that delivers sustainable business value.

1034
Ładowanie...
EBOOK

The Definitive Guide to Power Query (M). Mastering complex data transformation with Power Query

Greg Deckler, Rick de Groot, Melissa de...

Data transformation is a critical step in building data models and business intelligence reports. Power Query is an invaluable tool for anyone who wants to master data transformation, and this book will equip you with the knowledge and skills to make the most of it.The Definitive Guide to Power Query (M) will help you build a solid foundation in the Power Query M language. As you progress through the chapters, you'll learn how to use that knowledge to implement advanced concepts and data transformations. This will set the stage for an uncompromisingly thorough exploration of the Power Query M Language.You'll also get to grips with optimizing performance, handling errors, and implementing efficient data processing techniques. As this is a hands-on guide, the practical examples in the chapters will help you gain the skills to apply Power Query to real-world problems and improve your data analysis capabilities.By the end of this book, you will be able to leverage all of Power Query's remarkable capabilities for data transformation.*Email sign-up and proof of purchase required

1035
Ładowanie...
EBOOK

The Economics of Data, Analytics, and Digital Transformation. The theorems, laws, and empowerments to guide your organization's digital transformation

Bill Schmarzo

In today’s digital era, every organization has data, but just possessing enormous amounts of data is not a sufficient market discriminator. The Economics of Data, Analytics, and Digital Transformation aims to provide actionable insights into the real market discriminators, including an organization’s data-fueled analytics products that inspire innovation, deliver insights, help make practical decisions, generate value, and produce mission success for the enterprise.The book begins by first building your mindset to be value-driven and introducing the Big Data Business Model Maturity Index, its maturity index phases, and how to navigate the index. You will explore value engineering, where you will learn how to identify key business initiatives, stakeholders, advanced analytics, data sources, and instrumentation strategies that are essential to data science success. The book will help you accelerate and optimize your company’s operations through AI and machine learning.By the end of the book, you will have the tools and techniques to drive your organization’s digital transformation.Here are a few words from Dr. Kirk Borne, Data Scientist and Executive Advisor at Booz Allen Hamilton, about the book:Data analytics should first and foremost be about action and value. Consequently, the great value of this book is that it seeks to be actionable. It offers a dynamic progression of purpose-driven ignition points that you can act upon.

1036
Ładowanie...
EBOOK

The Kaggle Book. Data analysis and machine learning for competitive data science

Konrad Banachewicz, Anthony Goldbloom, Luca Massaron

Millions of data enthusiasts from around the world compete on Kaggle, the most famous data science competition platform of them all. Participating in Kaggle competitions is a surefire way to improve your data analysis skills, network with an amazing community of data scientists, and gain valuable experience to help grow your career.The first book of its kind, The Kaggle Book assembles in one place the techniques and skills you’ll need for success in competitions, data science projects, and beyond. Two Kaggle Grandmasters walk you through modeling strategies you won’t easily find elsewhere, and the knowledge they’ve accumulated along the way. As well as Kaggle-specific tips, you’ll learn more general techniques for approaching tasks based on image, tabular, textual data, and reinforcement learning. You’ll design better validation schemes and work more comfortably with different evaluation metrics.Whether you want to climb the ranks of Kaggle, build some more data science skills, or improve the accuracy of your existing models, this book is for you.Plus, join our Discord Community to learn along with more than 1,000 members and meet like-minded people!

1037
Ładowanie...
EBOOK

The Kaggle Book. Master data science competitions with machine learning, GenAI, and LLMs - Second Edition

Luca Massaron, Bojan Tunguz, Konrad Banachewicz, Anthony...

Kaggle has become the proving ground for millions of data enthusiasts worldwide, offering what no classroom tutorial can match: battle-tested skills built through real-world challenges and the hands-on experience that employers seek. Every competition sharpens your data analysis skills, expands your network within the data scientist community, and gives compelling proof of expertise to unlock career opportunities.The first book of its kind, The Kaggle Book brings together everything you need to excel in competitions, data science projects, and beyond. This new edition includes fresh content and new chapters on Kaggle Models, time series, and Generative AI competitions, with three Kaggle Grandmasters guiding you through modeling strategies and sharing hard-earned insights accumulated over years of competition.The book extends far past competition tactics, revealing techniques for tackling image, tabular, and textual data as well as reinforcement learning tasks. You’ll also discover tips for designing better validation schemes and working confidently with both standard and unconventional evaluation metrics.Whether you want to climb the Kaggle leaderboard, accelerate your data science career, or improve the accuracy of your models, this book is for you.Join our Discord community of over 1,000 members to learn, share, and grow together!

1038
Ładowanie...
EBOOK

The Kaggle Workbook. Self-learning exercises and valuable insights for Kaggle data science competitions

Konrad Banachewicz, Luca Massaron

More than 80,000 Kaggle novices currently participate in Kaggle competitions. To help them navigate the often-overwhelming world of Kaggle, two Grandmasters put their heads together to write The Kaggle Book, which made plenty of waves in the community. Now, they’ve come back with an even more practical approach based on hands-on exercises that can help you start thinking like an experienced data scientist.In this book, you’ll get up close and personal with four extensive case studies based on past Kaggle competitions. You’ll learn how bright minds predicted which drivers would likely avoid filing insurance claims in Brazil and see how expert Kagglers used gradient-boosting methods to model Walmart unit sales time-series data. Get into computer vision by discovering different solutions for identifying the type of disease present on cassava leaves. And see how the Kaggle community created predictive algorithms to solve the natural language processing problem of subjective question-answering.You can use this workbook as a supplement alongside The Kaggle Book or on its own alongside resources available on the Kaggle website and other online communities. Whatever path you choose, this workbook will help make you a formidable Kaggle competitor.

1039
Ładowanie...
EBOOK

The Machine Learning Solutions Architect Handbook. Practical strategies and best practices on the ML lifecycle, system design, MLOps, and generative AI - Second Edition

David Ping

David Ping, Head of GenAI and ML Solution Architecture for global industries at AWS, provides expert insights and practical examples to help you become a proficient ML solutions architect, linking technical architecture to business-related skills.You'll learn about ML algorithms, cloud infrastructure, system design, MLOps , and how to apply ML to solve real-world business problems. David explains the generative AI project lifecycle and examines Retrieval Augmented Generation (RAG), an effective architecture pattern for generative AI applications. You’ll also learn about open-source technologies, such as Kubernetes/Kubeflow, for building a data science environment and ML pipelines before building an enterprise ML architecture using AWS. As well as ML risk management and the different stages of AI/ML adoption, the biggest new addition to the handbook is the deep exploration of generative AI.By the end of this book , you’ll have gained a comprehensive understanding of AI/ML across all key aspects, including business use cases, data science, real-world solution architecture, risk management, and governance. You’ll possess the skills to design and construct ML solutions that effectively cater to common use cases and follow established ML architecture patterns, enabling you to excel as a true professional in the field.

1040
Ładowanie...
EBOOK

The Machine Learning Workshop. Get ready to develop your own high-performance machine learning algorithms with scikit-learn - Second Edition

Hyatt Saleh , John Wesley Doyle ,...

Machine learning algorithms are an integral part of almost all modern applications. To make the learning process faster and more accurate, you need a tool flexible and powerful enough to help you build machine learning algorithms quickly and easily. With The Machine Learning Workshop, you'll master the scikit-learn library and become proficient in developing clever machine learning algorithms.The Machine Learning Workshop begins by demonstrating how unsupervised and supervised learning algorithms work by analyzing a real-world dataset of wholesale customers. Once you've got to grips with the basics, you'll develop an artificial neural network using scikit-learn and then improve its performance by fine-tuning hyperparameters. Towards the end of the workshop, you'll study the dataset of a bank's marketing activities and build machine learning models that can list clients who are likely to subscribe to a term deposit. You'll also learn how to compare these models and select the optimal one.By the end of The Machine Learning Workshop, you'll not only have learned the difference between supervised and unsupervised models and their applications in the real world, but you'll also have developed the skills required to get started with programming your very own machine learning algorithms.