Big data

761
Wird geladen...
E-BOOK

OpenGL Data Visualization Cookbook. Over 35 hands-on recipes to create impressive, stunning visuals for a wide range of real-time, interactive applications using OpenGL

Raymond Chun Hing Lo, William Lo

OpenGL is a great multi-platform, cross-language, and hardware-accelerated graphics interface for visualizing large 2D and 3D datasets. Data visualization has become increasingly challenging using conventional approaches as datasets become larger and larger, especially with the Big Data evolution. From a mobile device to a sophisticated high-performance computing cluster, OpenGL libraries provide developers with an easy-to-use interface to create stunning visuals in 3D in real time for a wide range of interactive applications.This book provides a series of easy-to-follow, hands-on tutorials to create appealing OpenGL-based visualization tools with minimal development time. We will first illustrate how to quickly set up the development environment in Windows, Mac OS X, and Linux. Next, we will demonstrate how to visualize data for a wide range of applications using OpenGL, starting from simple 2D datasets to increasingly complex 3D datasets with more advanced techniques. Each chapter addresses different visualization problems encountered in real life and introduces the relevant OpenGL features and libraries in a modular fashion.By the end of this book, you will be equipped with the essential skills to develop a wide range of impressive OpenGL-based applications for your unique data visualization needs, on platforms ranging from conventional computers to the latest mobile/wearable devices.

762
Wird geladen...
E-BOOK

OpenStack Sahara Essentials. Integrate, deploy, rapidly configure, and successfully manage your own big data-intensive clusters in the cloud using OpenStack Sahara

Omar Khedher

The Sahara project is a module that aims to simplify the building of data processing capabilities on OpenStack.The goal of this book is to provide a focused, fast paced guide to installing, configuring, and getting started with integrating Hadoop with OpenStack, using Sahara.The book should explain to users how to deploy their data-intensive Hadoop and Spark clusters on top of OpenStack. It will also cover how to use the Sahara REST API, how to develop applications for Elastic Data Processing on Openstack, and setting up hadoop or spark clusters on Openstack.

763
Wird geladen...
E-BOOK

Optimization Using Linear Programming. A Practical Guide to Mastering Linear Programming Techniques

Mercury Learning and Information, A. J. Metei,...

This book is designed for engineers, mathematicians, computer scientists, financial analysts, and anyone interested in using numerical linear algebra, matrix theory, and game theory to solve applied problems efficiently. It emphasizes solving linear programming problems with software like MS-Excel, Mathematica, MATLAB, WinQSB, and LINDO, while providing the necessary definitions and theorems for mastering theoretical aspects.The journey begins with basics of linear algebra using MS-Excel, followed by an introduction to linear programming problems and the graphical method. It then delves into the simplex method, duality, and sensitivity analysis. The course covers transportation, transshipment, assignment problems, and concludes with game theory. Each chapter builds on the previous one, ensuring a comprehensive understanding of the topics.Understanding these concepts is crucial for solving complex applied problems. This book transitions readers from basic to advanced techniques in numerical linear algebra and linear programming, combining theoretical knowledge with practical applications. It is an essential resource for mastering these topics and maximizing efficiency in problem-solving.

764
Wird geladen...
E-BOOK

Optimizing Databricks Workloads. Harness the power of Apache Spark in Azure and maximize the performance of modern big data workloads

Anirudh Kala, Anshul Bhatnagar, Sarthak Sarbahi

Databricks is an industry-leading, cloud-based platform for data analytics, data science, and data engineering supporting thousands of organizations across the world in their data journey. It is a fast, easy, and collaborative Apache Spark-based big data analytics platform for data science and data engineering in the cloud.In Optimizing Databricks Workloads, you will get started with a brief introduction to Azure Databricks and quickly begin to understand the important optimization techniques. The book covers how to select the optimal Spark cluster configuration for running big data processing and workloads in Databricks, some very useful optimization techniques for Spark DataFrames, best practices for optimizing Delta Lake, and techniques to optimize Spark jobs through Spark core. It contains an opportunity to learn about some of the real-world scenarios where optimizing workloads in Databricks has helped organizations increase performance and save costs across various domains.By the end of this book, you will be prepared with the necessary toolkit to speed up your Spark jobs and process your data more efficiently.

765
Wird geladen...
E-BOOK

Optimizing Hadoop for MapReduce. This book is the perfect introduction to sophisticated concepts in MapReduce and will ensure you have the knowledge to optimize job performance. This is not an academic treatise; it's an example-driven tutorial for the real world

Khaled Tannir

This book is an example-based tutorial that deals with Optimizing Hadoop for MapReduce job performance. If you are a Hadoop administrator, developer, MapReduce user, or beginner, this book is the best choice available if you wish to optimize your clusters and applications. Having prior knowledge of creating MapReduce applications is not necessary, but will help you better understand the concepts and snippets of MapReduce class template code.

766
Wird geladen...
E-BOOK

Oracle Database 12c Backup and Recovery Survival Guide. A comprehensive guide for every DBA to learn recovery and backup solutions

Francisco Munoz Alvarez, Aman Sharma

The three main responsibilities of a successful DBA are to ensure the availability, recoverability, and performance of any database. To ensure the recoverability of any database, a DBA needs to have a strong backup and recovery skills set. Every DBA is always looking for a reference book that will help them to solve any possible backup and recovery situation that they can come across in their professional life.Oracle Database 12c Backup and Recovery Survival Guide has the unique advantage to be a reference to all Oracle backup and recovery options available, making it essential for any DBA in the world. If you are new to Oracle Database, this book will introduce you to the fantastic world of backup and recovery that is vital to your success. If you are an experienced DBA, this book will become a reference guide and will also help you to learn some possible new skills, or give you some new ideas you were never aware about. It will also help you to easily find the solution to some of the most well known problems you could find during your career as a DBA. This book contains useful screenshots, scripts, and examples that you will find more than useful.Most of the books currently available in the market concentrate only on the RMAN utility to backup and recovery. This book will be an exception to the rule and will become a must-have reference, allowing you to design a real and complete backup and recovery strategy. It covers the most important topics on Oracle database such as backup strategies, Nologging operations, new features in 12c, user managed backups and recoveries, RMAN (including reporting, catalog management, troubleshooting, and performance tuning), advanced data pump, Oracle Enterprise Manager 12c and SQL Developer.Oracle Database 12c Backup and Recovery Survival Guide contains everything a DBA needs to know to keep data safe and recoverable, using real-life scenarios.

767
Wird geladen...
E-BOOK

OSINT w praktyce. Jak gromadzić i analizować dane dostępne w sieci

Dale Meredith

Pojęcie OSINT pochodzi od angielskiego wyrażenia open source intelligence i oznacza biały wywiad. Polega na pozyskiwaniu danych z publicznie dostępnych źródeł. Okazuje się, że niezwykle cenne informacje są dostępne na wyciągnięcie ręki, ale trzeba wiedzieć, w jaki sposób do nich dotrzeć. A potrafi to być niezwykle wciągające zajęcie, przy okazji którego można poznać podstawy cyberbezpieczeństwa, zrozumieć czyhające w internecie zagrożenia i nauczyć się zabezpieczać swoją cyfrową obecność. Z tą książką krok po kroku zagłębisz się w metody OSINT, a także powiązane z nim zagadnienia natury prawnej i etycznej. Poznasz sposoby gromadzenia i analizowania informacji z wykorzystaniem wyszukiwarek, portali społecznościowych i innych zasobów internetowych. Zrozumiesz wagę anonimowości i technik gwarantujących bezpieczne poruszanie się po sieci, ułatwiających zarządzanie cyfrowym śladem czy tworzenie fikcyjnych tożsamości internetowych. Zdobędziesz również doświadczenie w korzystaniu z popularnych narzędzi OSINT, takich jak Recon-ng, Maltego, Shodan czy Aircrack-ng. Dowiesz się też, jak ograniczać ryzyko, przewidywać cyberataki, zapobiegać im i na nie reagować - wszystko dzięki technikom opartym na OSINT. W książce: działanie OSINT i najlepsze praktyki automatyzacja zbierania i analizy danych dane z mediów społecznościowych a OSINT zarządzanie swoim cyfrowym śladem, ograniczanie ryzyka i ochrona prywatności skuteczny program analizy ryzyka na bazie OSINT zwiększanie bezpieczeństwa firmy technikami OSINT Dołącz potężne narzędzia OSINT do swojego arsenału!

768
Wird geladen...
E-BOOK

Pandas Basics. Mastering Data Analysis with Pandas

Mercury Learning and Information, Oswald Campesato

This book is essential for aspiring data scientists and anyone needing to perform data cleaning using Pandas and NumPy. It offers numerous code samples and comprehensive coverage of NumPy and Pandas features, including writing regular expressions. Chapter 3 introduces fundamental statistical concepts, while Chapter 7 delves into data visualization using Matplotlib and Seaborn. Companion files with code are available for download from the publisher.Starting with an introduction to Python, the course progresses through working with data, and then moves into Pandas, covering its functionalities in three detailed chapters. The statistical concepts provided are crucial for analyzing data effectively, while the visualization techniques enhance the ability to present data insights clearly.By the end of this course, users will have a solid foundation in data manipulation and cleaning, statistical analysis, and data visualization, enabling them to tackle real-world data science tasks confidently and efficiently.