Analiza danych

481
Ebook
482
Ebook

Odsłaniamy SQL Server 2019: Klastry Big Data i uczenie maszynowe

Bob Ward

Nabierz prędkości dzięki przełomowym zmianom w SQL Server 2019. Nie jest to już jedynie silnik bazodanowy, ale nowatorskie narzędzie wyposażone we wsparcie dla uczenia maszynowego, analiz Big Data, możliwość działania w systemie Linux, kontenery, Kubernetes, Javę czy wirtualizację danych w Azure. Ta książka nie zajmuje się tradycyjną administracją bazami danych w środowisku SQL Server. Koncentruje się na tym wszystkim, co nowe w jednej z najskuteczniej modernizowanych platform danych w branży. To książka dla profesjonalistów danych, którzy znają już podstawy SQL Server i chcą się rozwijać, rozbudowując umiejętności w najgorętszych obszarach nowych technologii. Zagłębimy się w szczegóły kluczowych nowych możliwości SQL Server 2019 przy użyciu podejścia „nauka przez przykład”. Zajmiemy się zagadnieniami Intelligent Performance, zabezpieczeń, dostępności i funkcjonalności oczekiwanych przez współczesnych programistów. Omówimy usprawnienia w SQL Server 2019 dla systemu Linux oraz wykorzystanie kontenerów i klastrów Kubernetes. Pokażemy, jak zwirtualizować dostęp do danych przy użyciu Polybase dla Oracle, MongoDB, Hadoop i Azure, co pozwala zredukować potrzebę stosowania kosztownych aplikacji ETL. Nauczymy się również, jak budować wszechstronne rozwiązania Big Data Clusters, sztandarowej funkcjonalności wydania 2019, zapewniającej dostęp do środowisk Spark, SQL Server HDFS i dowiemy się, jak wbudować inteligencję w nasze własne dane i wdrażać kompletne aplikacje uczenia maszynowego. Dowiedz się, jak: • Implementować Big Data Clusters przy użyciu SQL Server, Spark i HDFS • Tworzyć węzły danych z połączeniami do Oracle, Azure, Hadoop i innych źródeł • Łączyć SQL i Spark w celu zbudowania platformy uczenia maszynowego dla aplikacji AI • Zwiększyć wydajność bez zmieniania aplikacji przy użyciu Intelligent Performance • Podnieść zabezpieczenia SQL Server dzięki mechanizmom Secure Enclaves i Data Classification • Zmaksymalizować czas działania bazy danych poprzez indeksowanie online i Accelerated Database Recovery • Budować nowoczesne aplikacje przy użyciu narzędzi Graph, ML Services i T-SQL Extensibility dla języka Java • Zwiększyć możliwości wdrażania SQL Server w systemie Linux • Uruchamiać SQL Server w kontenerach i Kubernetes • Korzystać z najnowszych narzędzi i metod migracji bazy danych do SQL Server 2019 • Zastosować wiedzę na temat SQL Server 2019 w środowisku Azure

483
Ebook

OpenGL Data Visualization Cookbook. Over 35 hands-on recipes to create impressive, stunning visuals for a wide range of real-time, interactive applications using OpenGL

William Lo, Raymond Chun Hing Lo

OpenGL is a great multi-platform, cross-language, and hardware-accelerated graphics interface for visualizing large 2D and 3D datasets. Data visualization has become increasingly challenging using conventional approaches as datasets become larger and larger, especially with the Big Data evolution. From a mobile device to a sophisticated high-performance computing cluster, OpenGL libraries provide developers with an easy-to-use interface to create stunning visuals in 3D in real time for a wide range of interactive applications.This book provides a series of easy-to-follow, hands-on tutorials to create appealing OpenGL-based visualization tools with minimal development time. We will first illustrate how to quickly set up the development environment in Windows, Mac OS X, and Linux. Next, we will demonstrate how to visualize data for a wide range of applications using OpenGL, starting from simple 2D datasets to increasingly complex 3D datasets with more advanced techniques. Each chapter addresses different visualization problems encountered in real life and introduces the relevant OpenGL features and libraries in a modular fashion.By the end of this book, you will be equipped with the essential skills to develop a wide range of impressive OpenGL-based applications for your unique data visualization needs, on platforms ranging from conventional computers to the latest mobile/wearable devices.

484
Ebook

OpenStack Sahara Essentials. Integrate, deploy, rapidly configure, and successfully manage your own big data-intensive clusters in the cloud using OpenStack Sahara

Omar Khedher

The Sahara project is a module that aims to simplify the building of data processing capabilities on OpenStack.The goal of this book is to provide a focused, fast paced guide to installing, configuring, and getting started with integrating Hadoop with OpenStack, using Sahara.The book should explain to users how to deploy their data-intensive Hadoop and Spark clusters on top of OpenStack. It will also cover how to use the Sahara REST API, how to develop applications for Elastic Data Processing on Openstack, and setting up hadoop or spark clusters on Openstack.

485
Ebook

Optimizing Databricks Workloads. Harness the power of Apache Spark in Azure and maximize the performance of modern big data workloads

Anirudh Kala, Anshul Bhatnagar, Sarthak Sarbahi

Databricks is an industry-leading, cloud-based platform for data analytics, data science, and data engineering supporting thousands of organizations across the world in their data journey. It is a fast, easy, and collaborative Apache Spark-based big data analytics platform for data science and data engineering in the cloud.In Optimizing Databricks Workloads, you will get started with a brief introduction to Azure Databricks and quickly begin to understand the important optimization techniques. The book covers how to select the optimal Spark cluster configuration for running big data processing and workloads in Databricks, some very useful optimization techniques for Spark DataFrames, best practices for optimizing Delta Lake, and techniques to optimize Spark jobs through Spark core. It contains an opportunity to learn about some of the real-world scenarios where optimizing workloads in Databricks has helped organizations increase performance and save costs across various domains.By the end of this book, you will be prepared with the necessary toolkit to speed up your Spark jobs and process your data more efficiently.

486
Ebook
487
Ebook

Oracle Database 12c Backup and Recovery Survival Guide. A comprehensive guide for every DBA to learn recovery and backup solutions

Francisco Munoz Alvarez, Aman Sharma

The three main responsibilities of a successful DBA are to ensure the availability, recoverability, and performance of any database. To ensure the recoverability of any database, a DBA needs to have a strong backup and recovery skills set. Every DBA is always looking for a reference book that will help them to solve any possible backup and recovery situation that they can come across in their professional life.Oracle Database 12c Backup and Recovery Survival Guide has the unique advantage to be a reference to all Oracle backup and recovery options available, making it essential for any DBA in the world. If you are new to Oracle Database, this book will introduce you to the fantastic world of backup and recovery that is vital to your success. If you are an experienced DBA, this book will become a reference guide and will also help you to learn some possible new skills, or give you some new ideas you were never aware about. It will also help you to easily find the solution to some of the most well known problems you could find during your career as a DBA. This book contains useful screenshots, scripts, and examples that you will find more than useful.Most of the books currently available in the market concentrate only on the RMAN utility to backup and recovery. This book will be an exception to the rule and will become a must-have reference, allowing you to design a real and complete backup and recovery strategy. It covers the most important topics on Oracle database such as backup strategies, Nologging operations, new features in 12c, user managed backups and recoveries, RMAN (including reporting, catalog management, troubleshooting, and performance tuning), advanced data pump, Oracle Enterprise Manager 12c and SQL Developer.Oracle Database 12c Backup and Recovery Survival Guide contains everything a DBA needs to know to keep data safe and recoverable, using real-life scenarios.

488
Ebook

Pandas Basics. Mastering Data Analysis with Pandas

Mercury Learning and Information, Oswald Campesato

This book is essential for aspiring data scientists and anyone needing to perform data cleaning using Pandas and NumPy. It offers numerous code samples and comprehensive coverage of NumPy and Pandas features, including writing regular expressions. Chapter 3 introduces fundamental statistical concepts, while Chapter 7 delves into data visualization using Matplotlib and Seaborn. Companion files with code are available for download from the publisher.Starting with an introduction to Python, the course progresses through working with data, and then moves into Pandas, covering its functionalities in three detailed chapters. The statistical concepts provided are crucial for analyzing data effectively, while the visualization techniques enhance the ability to present data insights clearly.By the end of this course, users will have a solid foundation in data manipulation and cleaning, statistical analysis, and data visualization, enabling them to tackle real-world data science tasks confidently and efficiently.