Analiza danych
Real-Time Big Data Analytics. Design, process, and analyze large sets of complex data in real time
Shilpi Saxena
Enterprise has been striving hard to deal with the challenges of data arriving in real time or near real time.Although there are technologies such as Storm and Spark (and many more) that solve the challenges of real-time data, using the appropriate technology/framework for the right business use case is the key to success. This book provides you with the skills required to quickly design, implement and deploy your real-time analytics using real-world examples of big data use cases.From the beginning of the book, we will cover the basics of varied real-time data processing frameworks and technologies. We will discuss and explain the differences between batch and real-time processing in detail, and will also explore the techniques and programming concepts using Apache Storm.Moving on, we’ll familiarize you with “Amazon Kinesis” for real-time data processing on cloud. We will further develop your understanding of real-time analytics through a comprehensive review of Apache Spark along with the high-level architecture and the building blocks of a Spark program. You will learn how to transform your data, get an output from transformations, and persist your results using Spark RDDs, using an interface called Spark SQL to work with Spark.At the end of this book, we will introduce Spark Streaming, the streaming library of Spark, and will walk you through the emerging Lambda Architecture (LA), which provides a hybrid platform for big data processing by combining real-time and precomputed batch data to provide a near real-time view of incoming data.
Redash v5 Quick Start Guide. Create and share interactive dashboards using Redash
Alexander Leibzon, Yael Leibzon
Data exploration and visualization is vital to Business Intelligence, the backbone of almost every enterprise or organization. Redash is a querying and visualization tool developed to simplify how marketing and business development departments are exposed to data. If you want to learn to create interactive dashboards with Redash, explore different visualizations, and share the insights with your peers, then this is the ideal book for you.The book starts with essential Business Intelligence concepts that are at the heart of data visualizations. You will learn how to find your way round Redash and its rich array of data visualization options for building interactive dashboards. You will learn how to create data storytelling and share these with peers. You will see how to connect to different data sources to process complex data, and then visualize this data to reveal valuable insights. By the end of this book, you will be confident with the Redash dashboarding tool to provide insight and communicate data storytelling.
Luca Massaron, Alberto Boschetti
Regression is the process of learning relationships between inputs and continuous outputs from example data, which enables predictions for novel inputs. There are many kinds of regression algorithms, and the aim of this book is to explain which is the right one to use for each set of problems and how to prepare real-world data for it. With this book you will learn to define a simple regression problem and evaluate its performance. The book will help you understand how to properly parse a dataset, clean it, and create an output matrix optimally built for regression. You will begin with a simple regression algorithm to solve some data science problems and then progress to more complex algorithms. The book will enable you to use regression models to predict outcomes and take critical business decisions. Through the book, you will gain knowledge to use Python for building fast better linear models and to apply the results in Python or in any computer language you prefer.
Giuseppe Ciaburro, Pierre Paquay, Manoj Kumar, Shaikh...
Regression analysis is a statistical process which enables prediction of relationships between variables. The predictions are based on the casual effect of one variable upon another. Regression techniques for modeling and analyzing are employed on large set of data in order to reveal hidden relationship among the variables.This book will give you a rundown explaining what regression analysis is, explaining you the process from scratch. The first few chapters give an understanding of what the different types of learning are – supervised and unsupervised, how these learnings differ from each other. We then move to covering the supervised learning in details covering the various aspects of regression analysis. The outline of chapters are arranged in a way that gives a feel of all the steps covered in a data science process – loading the training dataset, handling missing values, EDA on the dataset, transformations and feature engineering, model building, assessing the model fitting and performance, and finally making predictions on unseen datasets. Each chapter starts with explaining the theoretical concepts and once the reader gets comfortable with the theory, we move to the practical examples to support the understanding. The practical examples are illustrated using R code including the different packages in R such as R Stats, Caret and so on. Each chapter is a mix of theory and practical examples.By the end of this book you will know all the concepts and pain-points related to regression analysis, and you will be able to implement your learning in your projects.
Svetlana Karslioglu
Pachyderm is an open source project that enables data scientists to run reproducible data pipelines and scale them to an enterprise level. This book will teach you how to implement Pachyderm to create collaborative data science workflows and reproduce your ML experiments at scale.You’ll begin your journey by exploring the importance of data reproducibility and comparing different data science platforms. Next, you’ll explore how Pachyderm fits into the picture and its significance, followed by learning how to install Pachyderm locally on your computer or a cloud platform of your choice. You’ll then discover the architectural components and Pachyderm's main pipeline principles and concepts. The book demonstrates how to use Pachyderm components to create your first data pipeline and advances to cover common operations involving data, such as uploading data to and from Pachyderm to create more complex pipelines. Based on what you've learned, you'll develop an end-to-end ML workflow, before trying out the hyperparameter tuning technique and the different supported Pachyderm language clients. Finally, you’ll learn how to use a SaaS version of Pachyderm with Pachyderm Notebooks.By the end of this book, you will learn all aspects of running your data pipelines in Pachyderm and manage them on a day-to-day basis.
Robo-Advisor with Python. A hands-on guide to building and operating your own Robo-advisor
Aki Ranin
Robo-advisors are becoming table stakes for the wealth management industry across all segments, from retail to high-net-worth investors. Robo-advisors enable you to manage your own portfolios and financial institutions to create automated platforms for effective digital wealth management. This book is your hands-on guide to understanding how Robo-advisors work, and how to build one efficiently. The chapters are designed in a way to help you get a comprehensive grasp of what Robo-advisors do and how they are structured with an end-to-end workflow.You’ll begin by learning about the key decisions that influence the building of a Robo-advisor, along with considerations on building and licensing a platform. As you advance, you’ll find out how to build all the core capabilities of a Robo-advisor using Python, including goals, risk questionnaires, portfolios, and projections. The book also shows you how to create orders, as well as open accounts and perform KYC verification for transacting. Finally, you’ll be able to implement capabilities such as performance reporting and rebalancing for operating a Robo-advisor with ease.By the end of this book, you’ll have gained a solid understanding of how Robo-advisors work and be well on your way to building one for yourself or your business.
Lentin Joseph, Carol Fairchild, Dr. Thomas L....
ROS is a robust robotics framework that works regardless of hardware architecture or hardware origin. It standardizes most layers of robotics functionality from device drivers to process control and message passing to software package management. But apart from just plain functionality, ROS is a great platform to learn about robotics itself and to simulate, as well as actually build, your first robots. This does not mean that ROS is a platform for students and other beginners; on the contrary, ROS is used all over the robotics industry to implement flying, walking and diving robots, yet implementation is always straightforward, and never dependent on the hardware itself.ROS Robotics has been the standard introduction to ROS for potential professionals and hobbyists alike since the original edition came out; the second edition adds a gradual introduction to all the goodness available with the Kinetic Kame release.By providing you with step-by-step examples including manipulator arms and flying robots, the authors introduce you to the new features. The book is intensely practical, with space given to theory only when absolutely necessary. By the end of this book, you will have hands-on experience on controlling robots with the best possible framework.
Enrico Murru
The Salesforce Advanced Administrator certification extends beyond administrator certification, covering advanced platform features and functions such as configuration, automation, security, and customization. Complete with comprehensive coverage of all these topics and exam-oriented questions and mock tests, this Salesforce book will help you earn advanced administrator credentials. You'll start your journey by mastering data access security, monitoring and auditing, and understanding best practices for handling change management and data across organizations. The book then delves into data model management for improving data quality and lets you explore Sales features such as products, schedules, quotes, and forecasting capabilities. As you progress, this book will guide you in working with content management to set up and maintain Salesforce content. You'll also master organizing your files and data using reports and dashboards. Finally, you'll learn how to use a combination of automation tools to solve business problems.By the end of the book, you will have developed the skills required to get your advanced administrator credentials.