Big data

753
Wird geladen...
E-BOOK

Numerical Computing with Python. Harness the power of Python to analyze and find hidden patterns in the data

Pratap Dangeti, Allen Yu, Claire Chung, Aldrin...

Data mining, or parsing the data to extract useful insights, is a niche skill that can transform your career as a data scientist Python is a flexible programming language that is equipped with a strong suite of libraries and toolkits, and gives you the perfect platform to sift through your data and mine the insights you seek. This Learning Path is designed to familiarize you with the Python libraries and the underlying statistics that you need to get comfortable with data mining.You will learn how to use Pandas, Python's popular library to analyze different kinds of data, and leverage the power of Matplotlib to generate appealing and impressive visualizations for the insights you have derived. You will also explore different machine learning techniques and statistics that enable you to build powerful predictive models.By the end of this Learning Path, you will have the perfect foundation to take your data mining skills to the next level and set yourself on the path to become a sought-after data science professional.This Learning Path includes content from the following Packt products:• Statistics for Machine Learning by Pratap Dangeti• Matplotlib 2.x By Example by Allen Yu, Claire Chung, Aldrin Yim• Pandas Cookbook by Theodore Petrou

755
Wird geladen...
E-BOOK

Odsłaniamy SQL Server 2019: Klastry Big Data i uczenie maszynowe

Bob Ward

Nabierz prędkości dzięki przełomowym zmianom w SQL Server 2019. Nie jest to już jedynie silnik bazodanowy, ale nowatorskie narzędzie wyposażone we wsparcie dla uczenia maszynowego, analiz Big Data, możliwość działania w systemie Linux, kontenery, Kubernetes, Javę czy wirtualizację danych w Azure. Ta książka nie zajmuje się tradycyjną administracją bazami danych w środowisku SQL Server. Koncentruje się na tym wszystkim, co nowe w jednej z najskuteczniej modernizowanych platform danych w branży. To książka dla profesjonalistów danych, którzy znają już podstawy SQL Server i chcą się rozwijać, rozbudowując umiejętności w najgorętszych obszarach nowych technologii. Zagłębimy się w szczegóły kluczowych nowych możliwości SQL Server 2019 przy użyciu podejścia „nauka przez przykład”. Zajmiemy się zagadnieniami Intelligent Performance, zabezpieczeń, dostępności i funkcjonalności oczekiwanych przez współczesnych programistów. Omówimy usprawnienia w SQL Server 2019 dla systemu Linux oraz wykorzystanie kontenerów i klastrów Kubernetes. Pokażemy, jak zwirtualizować dostęp do danych przy użyciu Polybase dla Oracle, MongoDB, Hadoop i Azure, co pozwala zredukować potrzebę stosowania kosztownych aplikacji ETL. Nauczymy się również, jak budować wszechstronne rozwiązania Big Data Clusters, sztandarowej funkcjonalności wydania 2019, zapewniającej dostęp do środowisk Spark, SQL Server HDFS i dowiemy się, jak wbudować inteligencję w nasze własne dane i wdrażać kompletne aplikacje uczenia maszynowego. Dowiedz się, jak: • Implementować Big Data Clusters przy użyciu SQL Server, Spark i HDFS • Tworzyć węzły danych z połączeniami do Oracle, Azure, Hadoop i innych źródeł • Łączyć SQL i Spark w celu zbudowania platformy uczenia maszynowego dla aplikacji AI • Zwiększyć wydajność bez zmieniania aplikacji przy użyciu Intelligent Performance • Podnieść zabezpieczenia SQL Server dzięki mechanizmom Secure Enclaves i Data Classification • Zmaksymalizować czas działania bazy danych poprzez indeksowanie online i Accelerated Database Recovery • Budować nowoczesne aplikacje przy użyciu narzędzi Graph, ML Services i T-SQL Extensibility dla języka Java • Zwiększyć możliwości wdrażania SQL Server w systemie Linux • Uruchamiać SQL Server w kontenerach i Kubernetes • Korzystać z najnowszych narzędzi i metod migracji bazy danych do SQL Server 2019 • Zastosować wiedzę na temat SQL Server 2019 w środowisku Azure

756
Wird geladen...
E-BOOK

Odsłaniamy SQL Server 2019: Klastry Big Data i uczenie maszynowe

Bob Ward

Nabierz prędkości dzięki przełomowym zmianom w SQL Server 2019. Nie jest to już jedynie silnik bazodanowy, ale nowatorskie narzędzie wyposażone we wsparcie dla uczenia maszynowego, analiz Big Data, możliwość działania w systemie Linux, kontenery, Kubernetes, Javę czy wirtualizację danych w Azure. Ta książka nie zajmuje się tradycyjną administracją bazami danych w środowisku SQL Server. Koncentruje się na tym wszystkim, co nowe w jednej z najskuteczniej modernizowanych platform danych w branży. To książka dla profesjonalistów danych, którzy znają już podstawy SQL Server i chcą się rozwijać, rozbudowując umiejętności w najgorętszych obszarach nowych technologii. Zagłębimy się w szczegóły kluczowych nowych możliwości SQL Server 2019 przy użyciu podejścia „nauka przez przykład”. Zajmiemy się zagadnieniami Intelligent Performance, zabezpieczeń, dostępności i funkcjonalności oczekiwanych przez współczesnych programistów. Omówimy usprawnienia w SQL Server 2019 dla systemu Linux oraz wykorzystanie kontenerów i klastrów Kubernetes. Pokażemy, jak zwirtualizować dostęp do danych przy użyciu Polybase dla Oracle, MongoDB, Hadoop i Azure, co pozwala zredukować potrzebę stosowania kosztownych aplikacji ETL. Nauczymy się również, jak budować wszechstronne rozwiązania Big Data Clusters, sztandarowej funkcjonalności wydania 2019, zapewniającej dostęp do środowisk Spark, SQL Server HDFS i dowiemy się, jak wbudować inteligencję w nasze własne dane i wdrażać kompletne aplikacje uczenia maszynowego. Dowiedz się, jak: • Implementować Big Data Clusters przy użyciu SQL Server, Spark i HDFS • Tworzyć węzły danych z połączeniami do Oracle, Azure, Hadoop i innych źródeł • Łączyć SQL i Spark w celu zbudowania platformy uczenia maszynowego dla aplikacji AI • Zwiększyć wydajność bez zmieniania aplikacji przy użyciu Intelligent Performance • Podnieść zabezpieczenia SQL Server dzięki mechanizmom Secure Enclaves i Data Classification • Zmaksymalizować czas działania bazy danych poprzez indeksowanie online i Accelerated Database Recovery • Budować nowoczesne aplikacje przy użyciu narzędzi Graph, ML Services i T-SQL Extensibility dla języka Java • Zwiększyć możliwości wdrażania SQL Server w systemie Linux • Uruchamiać SQL Server w kontenerach i Kubernetes • Korzystać z najnowszych narzędzi i metod migracji bazy danych do SQL Server 2019 • Zastosować wiedzę na temat SQL Server 2019 w środowisku Azure

757
Wird geladen...
E-BOOK

OpenCV 3 Computer Vision Application Programming Cookbook. Recipes to make your applications see - Third Edition

Robert Laganiere

Making your applications see has never been easier with OpenCV. With it, you can teach your robot how to follow your cat, write a program to correctly identify the members of One Direction, or even help you find the right colors for your redecoration.OpenCV 3 Computer Vision Application Programming Cookbook Third Edition provides a complete introduction to the OpenCV library and explains how to build your first computer vision program. You will be presented with a variety of computer vision algorithms and exposed to important concepts in image and video analysis that will enable you to build your own computer vision applications.This book helps you to get started with the library, and shows you how to install and deploy the OpenCV library to write effective computer vision applications following good programming practices. You will learn how to read and write images and manipulate their pixels. Different techniques for image enhancement and shape analysis will be presented. You will learn how to detect specific image features such as lines, circles or corners. You will be introduced to the concepts of mathematical morphology and image filtering. The most recent methods for image matching and object recognition are described, and you’ll discover how to process video from files or cameras, as well as how to detect and track moving objects. Techniques to achieve camera calibration and perform multiple-view analysis will also be explained. Finally, you’ll also get acquainted with recent approaches in machine learning and object classification.

758
Wird geladen...
E-BOOK

OpenCV 3.x with Python By Example. Make the most of OpenCV and Python to build applications for object recognition and augmented reality - Second Edition

Gabriel Garrido Calvo, Prateek Joshi

Computer vision is found everywhere in modern technology. OpenCV for Python enables us to run computer vision algorithms in real time. With the advent of powerful machines, we have more processing power to work with. Using this technology, we can seamlessly integrate our computer vision applications into the cloud. Focusing on OpenCV 3.x and Python 3.6, this book will walk you through all the building blocks needed to build amazing computer vision applications with ease.We start off by manipulating images using simple filtering and geometric transformations. We then discuss affine and projective transformations and see how we can use them to apply cool advanced manipulations to your photos like resizing them while keeping the content intact or smoothly removing undesired elements. We will then cover techniques of object tracking, body part recognition, and object recognition using advanced techniques of machine learning such as artificial neural network. 3D reconstruction and augmented reality techniques are also included. The book covers popular OpenCV libraries with the help of examples.This book is a practical tutorial that covers various examples at different levels, teaching you about the different functions of OpenCV and their actual implementation. By the end of this book, you will have acquired the skills to use OpenCV and Python to develop real-world computer vision applications.

759
Wird geladen...
E-BOOK

OpenGL Data Visualization Cookbook. Over 35 hands-on recipes to create impressive, stunning visuals for a wide range of real-time, interactive applications using OpenGL

William Lo, Raymond Chun Hing Lo

OpenGL is a great multi-platform, cross-language, and hardware-accelerated graphics interface for visualizing large 2D and 3D datasets. Data visualization has become increasingly challenging using conventional approaches as datasets become larger and larger, especially with the Big Data evolution. From a mobile device to a sophisticated high-performance computing cluster, OpenGL libraries provide developers with an easy-to-use interface to create stunning visuals in 3D in real time for a wide range of interactive applications.This book provides a series of easy-to-follow, hands-on tutorials to create appealing OpenGL-based visualization tools with minimal development time. We will first illustrate how to quickly set up the development environment in Windows, Mac OS X, and Linux. Next, we will demonstrate how to visualize data for a wide range of applications using OpenGL, starting from simple 2D datasets to increasingly complex 3D datasets with more advanced techniques. Each chapter addresses different visualization problems encountered in real life and introduces the relevant OpenGL features and libraries in a modular fashion.By the end of this book, you will be equipped with the essential skills to develop a wide range of impressive OpenGL-based applications for your unique data visualization needs, on platforms ranging from conventional computers to the latest mobile/wearable devices.

760
Wird geladen...
E-BOOK

OpenStack Sahara Essentials. Integrate, deploy, rapidly configure, and successfully manage your own big data-intensive clusters in the cloud using OpenStack Sahara

Omar Khedher

The Sahara project is a module that aims to simplify the building of data processing capabilities on OpenStack.The goal of this book is to provide a focused, fast paced guide to installing, configuring, and getting started with integrating Hadoop with OpenStack, using Sahara.The book should explain to users how to deploy their data-intensive Hadoop and Spark clusters on top of OpenStack. It will also cover how to use the Sahara REST API, how to develop applications for Elastic Data Processing on Openstack, and setting up hadoop or spark clusters on Openstack.

761
Wird geladen...
E-BOOK

Optimization Using Linear Programming. A Practical Guide to Mastering Linear Programming Techniques

Mercury Learning and Information, A. J. Metei,...

This book is designed for engineers, mathematicians, computer scientists, financial analysts, and anyone interested in using numerical linear algebra, matrix theory, and game theory to solve applied problems efficiently. It emphasizes solving linear programming problems with software like MS-Excel, Mathematica, MATLAB, WinQSB, and LINDO, while providing the necessary definitions and theorems for mastering theoretical aspects.The journey begins with basics of linear algebra using MS-Excel, followed by an introduction to linear programming problems and the graphical method. It then delves into the simplex method, duality, and sensitivity analysis. The course covers transportation, transshipment, assignment problems, and concludes with game theory. Each chapter builds on the previous one, ensuring a comprehensive understanding of the topics.Understanding these concepts is crucial for solving complex applied problems. This book transitions readers from basic to advanced techniques in numerical linear algebra and linear programming, combining theoretical knowledge with practical applications. It is an essential resource for mastering these topics and maximizing efficiency in problem-solving.

762
Wird geladen...
E-BOOK

Optimizing Databricks Workloads. Harness the power of Apache Spark in Azure and maximize the performance of modern big data workloads

Anirudh Kala, Anshul Bhatnagar, Sarthak Sarbahi

Databricks is an industry-leading, cloud-based platform for data analytics, data science, and data engineering supporting thousands of organizations across the world in their data journey. It is a fast, easy, and collaborative Apache Spark-based big data analytics platform for data science and data engineering in the cloud.In Optimizing Databricks Workloads, you will get started with a brief introduction to Azure Databricks and quickly begin to understand the important optimization techniques. The book covers how to select the optimal Spark cluster configuration for running big data processing and workloads in Databricks, some very useful optimization techniques for Spark DataFrames, best practices for optimizing Delta Lake, and techniques to optimize Spark jobs through Spark core. It contains an opportunity to learn about some of the real-world scenarios where optimizing workloads in Databricks has helped organizations increase performance and save costs across various domains.By the end of this book, you will be prepared with the necessary toolkit to speed up your Spark jobs and process your data more efficiently.

764
Wird geladen...
E-BOOK

Oracle Database 12c Backup and Recovery Survival Guide. A comprehensive guide for every DBA to learn recovery and backup solutions

Francisco Munoz Alvarez, Aman Sharma

The three main responsibilities of a successful DBA are to ensure the availability, recoverability, and performance of any database. To ensure the recoverability of any database, a DBA needs to have a strong backup and recovery skills set. Every DBA is always looking for a reference book that will help them to solve any possible backup and recovery situation that they can come across in their professional life.Oracle Database 12c Backup and Recovery Survival Guide has the unique advantage to be a reference to all Oracle backup and recovery options available, making it essential for any DBA in the world. If you are new to Oracle Database, this book will introduce you to the fantastic world of backup and recovery that is vital to your success. If you are an experienced DBA, this book will become a reference guide and will also help you to learn some possible new skills, or give you some new ideas you were never aware about. It will also help you to easily find the solution to some of the most well known problems you could find during your career as a DBA. This book contains useful screenshots, scripts, and examples that you will find more than useful.Most of the books currently available in the market concentrate only on the RMAN utility to backup and recovery. This book will be an exception to the rule and will become a must-have reference, allowing you to design a real and complete backup and recovery strategy. It covers the most important topics on Oracle database such as backup strategies, Nologging operations, new features in 12c, user managed backups and recoveries, RMAN (including reporting, catalog management, troubleshooting, and performance tuning), advanced data pump, Oracle Enterprise Manager 12c and SQL Developer.Oracle Database 12c Backup and Recovery Survival Guide contains everything a DBA needs to know to keep data safe and recoverable, using real-life scenarios.

765
Wird geladen...
E-BOOK

OSINT w praktyce. Jak gromadzić i analizować dane dostępne w sieci

Dale Meredith

Pojęcie OSINT pochodzi od angielskiego wyrażenia open source intelligence i oznacza biały wywiad. Polega na pozyskiwaniu danych z publicznie dostępnych źródeł. Okazuje się, że niezwykle cenne informacje są dostępne na wyciągnięcie ręki, ale trzeba wiedzieć, w jaki sposób do nich dotrzeć. A potrafi to być niezwykle wciągające zajęcie, przy okazji którego można poznać podstawy cyberbezpieczeństwa, zrozumieć czyhające w internecie zagrożenia i nauczyć się zabezpieczać swoją cyfrową obecność. Z tą książką krok po kroku zagłębisz się w metody OSINT, a także powiązane z nim zagadnienia natury prawnej i etycznej. Poznasz sposoby gromadzenia i analizowania informacji z wykorzystaniem wyszukiwarek, portali społecznościowych i innych zasobów internetowych. Zrozumiesz wagę anonimowości i technik gwarantujących bezpieczne poruszanie się po sieci, ułatwiających zarządzanie cyfrowym śladem czy tworzenie fikcyjnych tożsamości internetowych. Zdobędziesz również doświadczenie w korzystaniu z popularnych narzędzi OSINT, takich jak Recon-ng, Maltego, Shodan czy Aircrack-ng. Dowiesz się też, jak ograniczać ryzyko, przewidywać cyberataki, zapobiegać im i na nie reagować - wszystko dzięki technikom opartym na OSINT. W książce: działanie OSINT i najlepsze praktyki automatyzacja zbierania i analizy danych dane z mediów społecznościowych a OSINT zarządzanie swoim cyfrowym śladem, ograniczanie ryzyka i ochrona prywatności skuteczny program analizy ryzyka na bazie OSINT zwiększanie bezpieczeństwa firmy technikami OSINT Dołącz potężne narzędzia OSINT do swojego arsenału!

766
Wird geladen...
E-BOOK

Pandas Basics. Mastering Data Analysis with Pandas

Mercury Learning and Information, Oswald Campesato

This book is essential for aspiring data scientists and anyone needing to perform data cleaning using Pandas and NumPy. It offers numerous code samples and comprehensive coverage of NumPy and Pandas features, including writing regular expressions. Chapter 3 introduces fundamental statistical concepts, while Chapter 7 delves into data visualization using Matplotlib and Seaborn. Companion files with code are available for download from the publisher.Starting with an introduction to Python, the course progresses through working with data, and then moves into Pandas, covering its functionalities in three detailed chapters. The statistical concepts provided are crucial for analyzing data effectively, while the visualization techniques enhance the ability to present data insights clearly.By the end of this course, users will have a solid foundation in data manipulation and cleaning, statistical analysis, and data visualization, enabling them to tackle real-world data science tasks confidently and efficiently.

767
Wird geladen...
E-BOOK

Pandas Cookbook. Practical recipes for scientific computing, time series, and exploratory data analysis using Python - Third Edition

William Ayd, Matthew Harrison, Wes McKinney

Unlock the full power of pandas 2.x with this hands-on cookbook, designed for Python developers, data analysts, and data scientists who need fast, efficient solutions for real-world data challenges. This book provides practical, ready-to-use recipes to streamline your workflow. With step-by-step guidance, you'll master data wrangling, visualization, performance optimization, and scalable data analysis using pandas’ most powerful features. From importing and merging large datasets to advanced time series analysis and SQL-like operations, this cookbook equips you with the tools to analyze, manipulate, and visualize data like a pro. Learn how to boost efficiency, optimize memory usage, and seamlessly integrate pandas with NumPy, PyArrow, and databases. This book will help you transform raw data into actionable insights with ease.*Email sign-up and proof of purchase required

768
Wird geladen...
E-BOOK

Pandas Cookbook. Recipes for Scientific Computing, Time Series Analysis and Data Visualization using Python

Theodore Petrou

This book will provide you with unique, idiomatic, and fun recipes for both fundamental and advanced data manipulation tasks with pandas 0.20. Some recipes focus on achieving a deeper understanding of basic principles, or comparing and contrasting two similar operations. Other recipes will dive deep into a particular dataset, uncovering new and unexpected insights along the way.The pandas library is massive, and it's common for frequent users to be unaware of many of its more impressive features. The official pandas documentation, while thorough, does not contain many useful examples of how to piece together multiple commands like one would do during an actual analysis. This book guides you, as if you were looking over the shoulder of an expert, through practical situations that you are highly likely to encounter.Many advanced recipes combine several different features across the pandas 0.20 library to generate results.

769
Wird geladen...
E-BOOK

Pentaho 8 Reporting for Java Developers. Create pixel-perfect analytical reports using reporting tools

Francesco Corti

This hands-on tutorial, filled with exercises and examples, introduces the reader to a variety of concepts within Pentaho Reporting. With screenshots that show you how reports look at design time as well as how they should look when rendered as PDF, Excel, HTML, Text, Rich-Text-File, XML, and CSV, this book also contains complete example source code that you can copy and paste into your environment to get up-and-running quickly. Updated to cover the features of Pentaho 8, this book will teach you everything you need to know to build fast, efficient reports using Pentaho. If your interest lies in the technical details of creating reports and you want to see how to solve common reporting problems with a minimum of fuss, this is the book for you.