Analiza danych

361
Ebook

Learning PySpark. Click here to enter text

Tomasz Drabas, Denny Lee

Apache Spark is an open source framework for efficient cluster computing with a strong interface for data parallelism and fault tolerance. This book will show you how to leverage the power of Python and put it to use in the Spark ecosystem. You will start by getting a firm understanding of the Spark 2.0 architecture and how to set up a Python environment for Spark. You will get familiar with the modules available in PySpark. You will learn how to abstract data with RDDs and DataFrames and understand the streaming capabilities of PySpark. Also, you will get a thorough overview of machine learning capabilities of PySpark using ML and MLlib, graph processing using GraphFrames, and polyglot persistence using Blaze. Finally, you will learn how to deploy your applications to the cloud using the spark-submit command. By the end of this book, you will have established a firm understanding of the Spark Python API and how it can be used to build data-intensive applications.

362
Ebook

Learning QGIS. Create great maps and perform geoprocessing tasks with ease - Third Edition

QGIS is a user-friendly open source geographic information system (GIS) that runs on Linux, Unix, Mac OS X, and Windows. The popularity of open source geographic information systems and QGIS in particular has been growing rapidly over the last few years.Learning QGIS Third Edition is a practical, hands-on guide updated for QGIS 2.14 that provides you with clear, step-by-step exercises to help you apply your GIS knowledge to QGIS. Through clear, practical exercises, this book will introduce you to working with QGIS quickly and painlessly.This book takes you from installing and configuring QGIS to handling spatial data to creating great maps. You will learn how to load and visualize existing spatialdata and create data from scratch. You will get to know important plugins, perform common geoprocessing and spatial analysis tasks and automate them with Processing.We will cover how to achieve great cartographic output and print maps. Finally, you will learn how to extend QGIS using Python and even create your own plugin.

363
Ebook

Learning Quantitative Finance with R. Implement machine learning, time-series analysis, algorithmic trading and more

Dr. Param Jeet, PRASHANT VATS

The role of a quantitative analyst is very challenging, yet lucrative, so there is a lot of competition for the role in top-tier organizations and investment banks. This book is your go-to resource if you want to equip yourself with the skills required to tackle any real-world problem in quantitative finance using the popular R programming language.You'll start by getting an understanding of the basics of R and its relevance in the field of quantitative finance. Once you've built this foundation, we'll dive into the practicalities of building financialmodels in R. This will help you have a fair understanding of the topics as well as their implementation, as the authors have presented some use cases along with examples that are easy to understand and correlate.We'll also look at risk management and optimization techniques for algorithmic trading. Finally, the book will explain some advanced concepts, such as trading using machine learning, optimizations, exotic options, and hedging.By the end of this book, you will have a firm grasp of the techniques required to implement basic quantitative finance models in R.

364
Ebook

Learning R Programming. Language, tools, and practical techniques

Kun Ren

R is a high-level functional language and one of the must-know tools for data science and statistics. Powerful but complex, R can be challenging for beginners and those unfamiliar with its unique behaviors. Learning R Programming is the solution - an easy and practical way to learn R and develop a broad and consistent understanding of the language. Through hands-on examples you'll discover powerful R tools, and R best practices that will give you a deeper understanding of working with data. You'll get to grips with R's data structures and data processing techniques, as well as the most popular R packages to boost your productivity from the offset.Start with the basics of R, then dive deep into the programming techniques and paradigms to make your R code excel. Advance quickly to a deeper understanding of R's behavior as you learn common tasks including data analysis, databases, web scraping, high performance computing, and writing documents. By the end of the book, you'll be a confident R programmer adept at solving problems with the right techniques.

365
Ebook

Learning Responsive Data Visualization. Create stunning data visualizations that look awesome on every device and screen resolutions

Christoph Körner

Using D3.js and Responsive Design principles, you will not just be able to implement visualizations that look and feel awesome across all devices and screen resolutions, but you will also boost your productivity and reduce development time by making use of Bootstrap—the most popular framework for developing responsive web applications.This book teaches the basics of scalable vector graphics (SVG), D3.js, and Bootstrap while focusing on Responsive Design as well as mobile-first visualizations; the reader will start by discovering Bootstrap and how it can be used for creating responsive applications, and then implement a basic bar chart in D3.js. You will learn about loading, parsing, and filtering data in JavaScript and then dive into creating a responsive visualization by using Media Queries, responsive interactions for Mobile and Desktop devices, and transitions to bring the visualization to life. In the following chapters, we build a fully responsive interactive map to display geographic data using GeoJSON and set up integration testing with Protractor to test the application across real devices using a mobile API gateway such as AWS Device Farm. You will finish the journey by discovering the caveats of mobile-first applications and learn how to master cross-browser complications.

366
Ebook

Learning SAP Analytics Cloud. Collaborate, predict and solve business intelligence problems with cloud computing

Riaz Ahmed

The book starts with the basics of SAP Analytics Cloud (formerly known as SAP BusinessObjects Cloud) and exposes almost every significant feature a beginner needs to master. Packed with illustrations and short, essential, to-the-point descriptions, the book provides a unique learning experience. Your journey of exploration starts with a basic introduction to the SAP Analytics Cloud platform. You will then learn about different segments of the product, such as Models, Stories, Digital Boardroom, and so on. Then, you are introduced to the product's interface: the Home screen, the main menu, and more. Then comes the hands-on aspect of the book, which starts with model creation. Next, you learn how to utilize a model to prepare different types of stories(reports) with the help of charts, tables, Geo Maps, and more. In the final chapters of this book, you will learn about Digital Boardroom, Collaboration, and Administration.

367
Ebook

Learning Social Media Analytics with R. Transform data from social media platforms into actionable business insights

Dipanjan Sarkar, Raghav Bali, Tushar Sharma

The Internet has truly become humongous, especially with the rise of various forms of social media in the last decade, which give users a platform to express themselves and also communicate and collaborate with each other. This book will help the reader to understand the current social media landscape and to learn how analytics can be leveraged to derive insights from it. This data can be analyzed to gain valuable insights into the behavior and engagement of users, organizations, businesses, and brands. It will help readers frame business problems and solve them using social data.The book will also cover several practical real-world use cases on social media using R and its advanced packages to utilize data science methodologies such as sentiment analysis, topic modeling, text summarization, recommendation systems, social network analysis, classification, and clustering. This will enable readers to learn different hands-on approaches to obtain data from diverse social media sources such as Twitter and Facebook. It will also show readers how to establish detailed workflows to process, visualize, and analyze data to transform social data into actionable insights.

368
Ebook

Learning Spark SQL. Architect streaming analytics and machine learning solutions

Aurobindo Sarkar

In the past year, Apache Spark has been increasingly adopted for the development of distributed applications. Spark SQL APIs provide an optimized interface that helps developers build such applications quickly and easily. However, designing web-scale production applications using Spark SQL APIs can be a complex task. Hence, understanding the design and implementation best practices before you start your project will help you avoid these problems.This book gives an insight into the engineering practices used to design and build real-world, Spark-based applications. The book's hands-on examples will give you the required confidence to work on any future projects you encounter in Spark SQL.It starts by familiarizing you with data exploration and data munging tasks using Spark SQL and Scala. Extensive code examples will help you understand the methods used to implement typical use-cases for various types of applications. You will get a walkthrough of the key concepts and terms that are common to streaming, machine learning, and graph applications. You will also learn key performance-tuning details including Cost Based Optimization (Spark 2.2) in Spark SQL applications. Finally, you will move on to learning how such systems are architected and deployed for a successful delivery of your project.

369
Ebook

Learning Splunk Web Framework. Create, extend and publish real time Splunk applications

Vincent Sesto

Building rich applications on the Web using Splunk is now simpler than ever before with the Splunk Web Framework. It empowers developers to build their own web applications with custom dashboards, tables, charts, form searches, and other functionalities in the datasets at their disposal. The book will start with the fundamentals of the Splunk Web Framework, teaching you the secrets of building interesting and user-friendly applications. In the first application, you will learn to analyze and monitor traffic hitting the NASA website and learn to create dashboards for it. You will then learn additional, and more detailed, techniques to enhance the functionalities of the app such as dashboards and forms, editing simple XML, using simple XML extensions, tokens, post-process searches, dynamic drill-downs, the Splunk Web Framework and REST API, and much more. The second app will use historical stock market data and will create custom dashboards using Splunk Web Framework; the book will now cover important topics such as creating HTML dashboards, enhancing the visual appeal of the app using CSS, and moving your app with SplunkJS.The book will provide different and interesting examples instead of the usual “Log, Index, Search, and Graph” so that Splunk will be the first tool readers think of to resolve a problem.

370
Ebook

Learning Tableau 2019. Tools for Business Intelligence, data prep, and visual analytics - Third Edition

Joshua N. Milligan

Tableau is the gold standard of business intelligence and visual analytics tools in every industry. It enables rapid data visualization and interpretation with charts, graphs, dashboards, and much more. Updated with the latest features of Tableau, this book takes you from the foundations of the Tableau 2019 paradigm through to advanced topics.This third edition of the bestselling guide by Tableau Zen Master, Joshua Milligan, will help you come to grips with updated features, such as set actions and transparent views. Beginning with installation, you'll create your first visualizations with Tableau and then explore practical examples and advanced techniques. You'll create bar charts, tree maps, scatterplots, time series, and a variety of other visualizations. Next, you'll discover techniques to overcome challenges presented by data structure and quality and engage in effective data storytelling and decision making with business critical information. Finally, you'll be introduced to Tableau Prep, and learn how to use it to integrate and shape data for analysis.By the end of this book, you will be equipped to leverage the powerful features of Tableau 2019 for decision making.

371
Ebook
372
Ebook

Learning YARN. Moving beyond MapReduce - learn resource management and big data processing using YARN

Akhil Arora, Shrey Mehrotra, Shreyank Gupta

Today enterprises generate huge volumes of data. In order to provide effective services and to make smarter and more intelligent decisions from these huge volumes of data, enterprises use big-data analytics. In recent years, Hadoop has been used for massive data storage and efficient distributed processing of data. The Yet Another Resource Negotiator (YARN) framework solves the design problems related to resource management faced by the Hadoop 1.x framework by providing a more scalable, efficient, flexible, and highly available resource management framework for distributed data processing.This book starts with an overview of the YARN features and explains how YARN provides a business solution for growing big data needs. You will learn to provision and manage single, as well as multi-node, Hadoop-YARN clusters in the easiest way. You will walk through the YARN administration, life cycle management, application execution, REST APIs, schedulers, security framework and so on. You will gain insights about the YARN components and features such as ResourceManager, NodeManager, ApplicationMaster, Container, Timeline Server, High Availability, Resource Localisation and so on.The book explains Hadoop-YARN commands and the configurations of components and explores topics such as High Availability, Resource Localization and Log aggregation. You will then be ready to develop your own ApplicationMaster and execute it over a Hadoop-YARN cluster.Towards the end of the book, you will learn about the security architecture and integration of YARN with big data technologies like Spark and Storm. This book promises conceptual as well as practical knowledge of resource management using YARN.

373
Ebook

Limitless Analytics with Azure Synapse. An end-to-end analytics service for data processing, management, and ingestion for BI and ML

Prashant Kumar Mishra, Mukesh Kumar

Azure Synapse Analytics, which Microsoft describes as the next evolution of Azure SQL Data Warehouse, is a limitless analytics service that brings enterprise data warehousing and big data analytics together. With this book, you'll learn how to discover insights from your data effectively using this platform.The book starts with an overview of Azure Synapse Analytics, its architecture, and how it can be used to improve business intelligence and machine learning capabilities. Next, you'll go on to choose and set up the correct environment for your business problem. You'll also learn a variety of ways to ingest data from various sources and orchestrate the data using transformation techniques offered by Azure Synapse. Later, you'll explore how to handle both relational and non-relational data using the SQL language. As you progress, you'll perform real-time streaming and execute data analysis operations on your data using various languages, before going on to apply ML techniques to derive accurate and granular insights from data. Finally, you'll discover how to protect sensitive data in real time by using security and privacy features.By the end of this Azure book, you'll be able to build end-to-end analytics solutions while focusing on data prep, data management, data warehousing, and AI tasks.

374
Ebook

Linear Regression With Python. A Tutorial Introduction to the Mathematics of Regression Analysis

James V Stone

This book offers a detailed yet approachable introduction to linear regression, blending mathematical theory with Python-based practical applications. Beginning with fundamentals, it explains the best-fitting line, regression and causation, and statistical measures like variance, correlation, and the coefficient of determination. Clear examples and Python code ensure readers can connect theory to implementation.As the journey continues, readers explore statistical significance through concepts like t-tests, z-tests, and p-values, understanding how to assess slopes, intercepts, and overall model fit. Advanced chapters cover multivariate regression, introducing matrix formulations, the best-fitting plane, and methods to handle multiple variables. Topics such as Bayesian regression, nonlinear models, and weighted regression are explored in depth, with step-by-step coding guides for hands-on practice.The final sections tie together these techniques with maximum likelihood estimation and practical summaries. Appendices provide resources such as matrix tutorials, key equations, and mathematical symbols. Designed for both beginners and professionals, this book ensures a structured learning experience. Basic mathematical knowledge or foundation is recommended.

375
Ebook

LLM Prompt Engineering for Developers. The Art and Science of Unlocking LLMs' True Potential

Aymen El Amri

LLM Prompt Engineering For Developers begins by laying the groundwork with essential principles of natural language processing (NLP), setting the stage for more complex topics. It methodically guides readers through the initial steps of understanding how large language models work, providing a solid foundation that prepares them for the more intricate aspects of prompt engineering.As you proceed, the book transitions into advanced strategies and techniques that reveal how to effectively interact with and utilize these powerful models. From crafting precise prompts that enhance model responses to exploring innovative methods like few-shot and zero-shot learning, this resource is designed to unlock the full potential of language model technology.This book not only teaches the technical skills needed to excel in the field but also addresses the broader implications of AI technology. It encourages thoughtful consideration of ethical issues and the impact of AI on society. By the end of this book, readers will master the technical aspects of prompt engineering & appreciate the importance of responsible AI development, making them well-rounded professionals ready to focus on the advancement of this cutting-edge technology.

376
Ebook

Machine Learning Algorithms. A reference guide to popular algorithms for data science and machine learning

Giuseppe Bonaccorso

In this book, you will learn all the important machine learning algorithms that are commonly used in the field of data science. These algorithms can be used for supervised as well as unsupervised learning, reinforcement learning, and semi-supervised learning. The algorithms that are covered in this book are linear regression, logistic regression, SVM, naïve Bayes, k-means, random forest, TensorFlow and feature engineering.In this book, you will how to use these algorithms to resolve your problems, and how they work. This book will also introduce you to natural language processing and recommendation systems, which help you to run multiple algorithms simultaneously.On completion of the book, you will know how to pick the right machine learning algorithm for clustering, classification, or regression for your problem

377
Ebook

Machine Learning Automation with TPOT. Build, validate, and deploy fully automated machine learning models with Python

Dario Radečić

The automation of machine learning tasks allows developers more time to focus on the usability and reactivity of the software powered by machine learning models. TPOT is a Python automated machine learning tool used for optimizing machine learning pipelines using genetic programming. Automating machine learning with TPOT enables individuals and companies to develop production-ready machine learning models cheaper and faster than with traditional methods.With this practical guide to AutoML, developers working with Python on machine learning tasks will be able to put their knowledge to work and become productive quickly. You'll adopt a hands-on approach to learning the implementation of AutoML and associated methodologies. Complete with step-by-step explanations of essential concepts, practical examples, and self-assessment questions, this book will show you how to build automated classification and regression models and compare their performance to custom-built models. As you advance, you'll also develop state-of-the-art models using only a couple of lines of code and see how those models outperform all of your previous models on the same datasets.By the end of this book, you'll have gained the confidence to implement AutoML techniques in your organization on a production level.

378
Ebook

Machine Learning for Developers. Uplift your regular applications with the power of statistics, analytics, and machine learning

Rodolfo Bonnin

Most of us have heard about the term Machine Learning, but surprisingly the question frequently asked by developers across the globe is, “How do I get started in Machine Learning?”. One reason could be attributed to the vastness of the subject area because people often get overwhelmed by the abstractness of ML and terms such as regression, supervised learning, probability density function, and so on. This book is a systematic guide teaching you how to implement various Machine Learning techniques and their day-to-day application and development. You will start with the very basics of data and mathematical models in easy-to-follow language that you are familiar with; you will feel at home while implementing the examples. The book will introduce you to various libraries and frameworks used in the world of Machine Learning, and then, without wasting any time, you will get to the point and implement Regression, Clustering, classification, Neural networks, and more with fun examples. As you get to grips with the techniques, you’ll learn to implement those concepts to solve real-world scenarios for ML applications such as image analysis, Natural Language processing, and anomaly detections of time series data. By the end of the book, you will have learned various ML techniques to develop more efficient and intelligent applications.

379
Ebook

Machine Learning for the Web. Gaining insight and intelligence from the internet with Python

Andrea Isoni

Python is a general purpose and also a comparatively easy to learn programming language. Hence it is the language of choice for data scientists to prototype, visualize, and run data analyses on small and medium-sized data sets. This is a unique book that helps bridge the gap between machine learning and web development. It focuses on the difficulties of implementing predictive analytics in web applications. We focus on the Python language, frameworks, tools, and libraries, showing you how to build a machine learning system. You will explore the core machine learning concepts and then develop and deploy the data into a web application using the Django framework. You will also learn to carry out web, document, and server mining tasks, and build recommendation engines. Later, you will explore Python’s impressive Django framework and will find out how to build a modern simple web app with machine learning features.

380
Ebook

Machine Learning in Microservices. Productionizing microservices architecture for machine learning solutions

Mohamed Abouahmed, Omar Ahmed

With the rising need for agile development and very short time-to-market system deployments, incorporating machine learning algorithms into decoupled fine-grained microservices systems provides the perfect technology mix for modern systems. Machine Learning in Microservices is your essential guide to staying ahead of the curve in this ever-evolving world of technology.The book starts by introducing you to the concept of machine learning microservices architecture (MSA) and comparing MSA with service-based and event-driven architectures, along with how to transition into MSA. Next, you’ll learn about the different approaches to building MSA and find out how to overcome common practical challenges faced in MSA design. As you advance, you’ll get to grips with machine learning (ML) concepts and see how they can help better design and run MSA systems. Finally, the book will take you through practical examples and open source applications that will help you build and run highly efficient, agile microservices systems.By the end of this microservices book, you’ll have a clear idea of different models of microservices architecture and machine learning and be able to combine both technologies to deliver a flexible and highly scalable enterprise system.

381
Ebook

Machine Learning Model Serving Patterns and Best Practices. A definitive guide to deploying, monitoring, and providing accessibility to ML models in production

Md Johirul Islam

Serving patterns enable data science and ML teams to bring their models to production. Most ML models are not deployed for consumers, so ML engineers need to know the critical steps for how to serve an ML model.This book will cover the whole process, from the basic concepts like stateful and stateless serving to the advantages and challenges of each. Batch, real-time, and continuous model serving techniques will also be covered in detail. Later chapters will give detailed examples of keyed prediction techniques and ensemble patterns. Valuable associated technologies like TensorFlow severing, BentoML, and RayServe will also be discussed, making sure that you have a good understanding of the most important methods and techniques in model serving. Later, you’ll cover topics such as monitoring and performance optimization, as well as strategies for managing model drift and handling updates and versioning. The book will provide practical guidance and best practices for ensuring that your model serving pipeline is robust, scalable, and reliable. Additionally, this book will explore the use of cloud-based platforms and services for model serving using AWS SageMaker with the help of detailed examples.By the end of this book, you'll be able to save and serve your model using state-of-the-art techniques.

382
Ebook

Machine Learning Quick Reference. Quick and essential machine learning hacks for training smart data models

Rahul Kumar

Machine learning makes it possible to learn about the unknowns and gain hidden insights into your datasets by mastering many tools and techniques. This book guides you to do just that in a very compact manner.After giving a quick overview of what machine learning is all about, Machine Learning Quick Reference jumps right into its core algorithms and demonstrates how they can be applied to real-world scenarios. From model evaluation to optimizing their performance, this book will introduce you to the best practices in machine learning. Furthermore, you will also look at the more advanced aspects such as training neural networks and work with different kinds of data, such as text, time-series, and sequential data. Advanced methods and techniques such as causal inference, deep Gaussian processes, and more are also covered.By the end of this book, you will be able to train fast, accurate machine learning models at your fingertips, which you can easily use as a point of reference.

383
Ebook

Machine Learning Techniques for Text. Apply modern techniques with Python for text processing, dimensionality reduction, classification, and evaluation

Nikos Tsourakis

With the ever-increasing demand for machine learning and programming professionals, it's prime time to invest in the field. This book will help you in this endeavor, focusing specifically on text data and human language by steering a middle path among the various textbooks that present complicated theoretical concepts or focus disproportionately on Python code.A good metaphor this work builds upon is the relationship between an experienced craftsperson and their trainee. Based on the current problem, the former picks a tool from the toolbox, explains its utility, and puts it into action. This approach will help you to identify at least one practical use for each method or technique presented. The content unfolds in ten chapters, each discussing one specific case study. For this reason, the book is solution-oriented. It's accompanied by Python code in the form of Jupyter notebooks to help you obtain hands-on experience. A recurring pattern in the chapters of this book is helping you get some intuition on the data and then implement and contrast various solutions.By the end of this book, you'll be able to understand and apply various techniques with Python for text preprocessing, text representation, dimensionality reduction, machine learning, language modeling, visualization, and evaluation.

384
Ebook

Machine Learning with Apache Spark Quick Start Guide. Uncover patterns, derive actionable insights, and learn from big data using MLlib

Jillur Quddus

Every person and every organization in the world manages data, whether they realize it or not. Data is used to describe the world around us and can be used for almost any purpose, from analyzing consumer habits to fighting disease and serious organized crime. Ultimately, we manage data in order to derive value from it, and many organizations around the world have traditionally invested in technology to help process their data faster and more efficiently.But we now live in an interconnected world driven by mass data creation and consumption where data is no longer rows and columns restricted to a spreadsheet, but an organic and evolving asset in its own right. With this realization comes major challenges for organizations: how do we manage the sheer size of data being created every second (think not only spreadsheets and databases, but also social media posts, images, videos, music, blogs and so on)? And once we can manage all of this data, how do we derive real value from it?The focus of Machine Learning with Apache Spark is to help us answer these questions in a hands-on manner. We introduce the latest scalable technologies to help us manage and process big data. We then introduce advanced analytical algorithms applied to real-world use cases in order to uncover patterns, derive actionable insights, and learn from this big data.