Big data

665
Ładowanie...
EBOOK

Mastering Social Media Mining with R. Extract valuable data from your social media sites and make better business decisions using R

Vikram Garg, Sharan Kumar Ravindran

With an increase in the number of users on the web, the content generated has increased substantially, bringing in the need to gain insights into the untapped gold mine that is social media data. For computational statistics, R has an advantage over other languages in providing readily-available data extraction and transformation packages, making it easier to carry out your ETL tasks. Along with this, its data visualization packages help users get a better understanding of the underlying data distributions while its range of standard statistical packages simplify analysis of the data.This book will teach you how powerful business cases are solved by applying machine learning techniques on social media data. You will learn about important and recent developments in the field of social media, along with a few advanced topics such as Open Authorization (OAuth). Through practical examples, you will access data from R using APIs of various social media sites such as Twitter, Facebook, Instagram, GitHub, Foursquare, LinkedIn, Blogger, and other networks. We will provide you with detailed explanations on the implementation of various use cases using R programming.With this handy guide, you will be ready to embark on your journey as an independent social media analyst.

666
Ładowanie...
EBOOK

Mastering spaCy. An end-to-end practical guide to implementing NLP applications using the Python ecosystem

Duygu Altinok

spaCy is an industrial-grade, efficient NLP Python library. It offers various pre-trained models and ready-to-use features. Mastering spaCy provides you with end-to-end coverage of spaCy's features and real-world applications.You'll begin by installing spaCy and downloading models, before progressing to spaCy's features and prototyping real-world NLP apps. Next, you'll get familiar with visualizing with spaCy's popular visualizer displaCy. The book also equips you with practical illustrations for pattern matching and helps you advance into the world of semantics with word vectors. Statistical information extraction methods are also explained in detail. Later, you'll cover an interactive business case study that shows you how to combine all spaCy features for creating a real-world NLP pipeline. You'll implement ML models such as sentiment analysis, intent recognition, and context resolution. The book further focuses on classification with popular frameworks such as TensorFlow's Keras API together with spaCy. You'll cover popular topics, including intent classification and sentiment analysis, and use them on popular datasets and interpret the classification results.By the end of this book, you'll be able to confidently use spaCy, including its linguistic features, word vectors, and classifiers, to create your own NLP apps.

667
Ładowanie...
EBOOK

Mastering Spark for Data Science. Lightning fast and scalable data science solutions

Andrew Morgan, Antoine Amend, Matthew Hallett, David...

Data science seeks to transform the world using data, and this is typically achievedthrough disrupting and changing real processes in real industries. In order to operate at this level you need to build data science solutions of substance –solutions that solve real problems. Spark has emerged as the big data platform of choice for data scientists due to its speed, scalability, and easy-to-use APIs.This book deep dives into using Spark to deliver production-grade data sciencesolutions. This process is demonstrated by exploring the construction of a sophisticated global news analysis service that uses Spark to generate continuous geopolitical and current affairs insights.You will learn all about the core Spark APIs and take a comprehensive tour of advanced libraries, including Spark SQL, Spark Streaming, MLlib, and more.You will be introduced to advanced techniques and methods that will help you to construct commercial-grade data products. Focusing on a sequence of tutorials that deliver a working news intelligence service, you will learn about advanced Spark architectures, how to work with geographic data in Spark, and how to tune Spark algorithms so they scale linearly.

668
Ładowanie...
EBOOK

Mastering Tableau 2019.1. An expert guide to implementing advanced business intelligence and analytics with Tableau 2019.1 - Second Edition

Marleen Meier, David Baldwin

Tableau is one of the leading business intelligence (BI) tools used to solve BI and analytics challenges. With this book, you will master Tableau's features and offerings in various paradigms of the BI domain. This book is also the second edition of the popular Mastering Tableau series, with new features, examples, and updated code. The book covers essential Tableau concepts and its advanced functionalities. Using Tableau Hyper and Tableau Prep, you’ll be able to handle and prepare data easily. You’ll gear up to perform complex joins, spatial joins, union, and data blending tasks using practical examples. Following this, you’ll learn how to perform data densification to make displaying granular data easier. Next, you’ll explore expert-level examples to help you with advanced calculations, mapping, and visual design using various Tableau extensions. With the help of examples, you’ll also learn about improving dashboard performance, connecting Tableau Server, and understanding data visualizations. In the final chapters, you’ll cover advanced use cases such as Self-Service Analytics, Time Series Analytics, and Geo-Spatial Analytics, and learn to connect Tableau to R, Python, and MATLAB. By the end of this book, you’ll have mastered the advanced offerings of Tableau and be able to tackle common and not-so-common challenges faced in the BI domain.

669
Ładowanie...
EBOOK

Mastering Tableau 2021. Implement advanced business intelligence techniques and analytics with Tableau - Third Edition

Marleen Meier, David Baldwin, Kate Strachnyi

Tableau is one of the leading business intelligence (BI) tools that can help you solve data analysis challenges. With this book, you will master Tableau's features and offerings in various paradigms of the BI domain.Updated with fresh topics including Quick Level of Detail expressions, the newest Tableau Server features, Einstein Discovery, and more, this book covers essential Tableau concepts and advanced functionalities. Leveraging Tableau Hyper files and using Prep Builder, you’ll be able to perform data preparation and handling easily. You’ll gear up to perform complex joins, spatial joins, unions, and data blending tasks using practical examples. Next, you’ll learn how to execute data densification and further explore expert-level examples to help you with calculations, mapping, and visual design using Tableau extensions. You’ll also learn about improving dashboard performance, connecting to Tableau Server and understanding data visualization with examples. Finally, you'll cover advanced use cases such as self-service analysis, time series analysis, and geo-spatial analysis, and connect Tableau to Python and R to implement programming functionalities within it.By the end of this Tableau book, you’ll have mastered the advanced offerings of Tableau 2021 and be able to tackle common and advanced challenges in the BI domain.

670
Ładowanie...
EBOOK

Mastering Tableau 2026. Implement advanced data visualizations, BI techniques and AI-powered analytics with Tableau - Fifth Edition

Marleen Meier

This edition of the bestselling Tableau guide will teach you how to leverage Tableau's newest features and offerings in various paradigms of the BI domain. Updated with fresh topics, including the newest features in Tableau Server, Tableau Prep Builder, and Desktop, and up-to-date examples, this book will take you from mastering essential Tableau concepts to advanced functionalities. A chapter on AI-powered analytics has also been added.Throughout this book, you'll learn how to use Tableau Prep Builder to easily perform data preparation and handling, as well as complex joins, spatial joins and unions, using practical examples. You'll get to grips with OData or Google BigQuery, executing data densification and exploring other expert-level examples to help you with calculations, mappings, and visual designs.Later chapters will teach you all about improving dashboard performance, Tableau Server, Tableau exchange, Data Governance and AI-powered features.. Finally, you'll cover advanced use cases, such as self-service analysis, time series analysis, geo-spatial analysis, and how to connect Tableau to Python and R to implement programming functionalities within Tableau.By the end of this book, you'll have mastered Tableau 2026 and be able to tackle common and advanced challenges in the BI domain.

671
Ładowanie...
EBOOK

Mastering TensorFlow 1.x. Advanced machine learning and deep learning concepts using TensorFlow 1.x and Keras

Armando Fandango

TensorFlow is the most popular numerical computation library built from the ground up for distributed, cloud, and mobile environments. TensorFlow represents the data as tensors and the computation as graphs.This book is a comprehensive guide that lets you explore the advanced features of TensorFlow 1.x. Gain insight into TensorFlow Core, Keras, TF Estimators, TFLearn, TF Slim, Pretty Tensor, and Sonnet. Leverage the power of TensorFlow and Keras to build deep learning models, using concepts such as transfer learning, generative adversarial networks, and deep reinforcement learning. Throughout the book, you will obtain hands-on experience with varied datasets, such as MNIST, CIFAR-10, PTB, text8, and COCO-Images.You will learn the advanced features of TensorFlow1.x, such as distributed TensorFlow with TF Clusters, deploy production models with TensorFlow Serving, and build and deploy TensorFlow models for mobile and embedded devices on Android and iOS platforms. You will see how to call TensorFlow and Keras API within the R statistical software, and learn the required techniques for debugging when the TensorFlow API-based code does not work as expected.The book helps you obtain in-depth knowledge of TensorFlow, making you the go-to person for solving artificial intelligence problems. By the end of this guide, you will have mastered the offerings of TensorFlow and Keras, and gained the skills you need to build smarter, faster, and efficient machine learning and deep learning systems.

672
Ładowanie...
EBOOK

Mastering Text Mining with R. Extract and recognize your text data

Avinash Paul, KUMAR ASHISH

Text Mining (or text data mining or text analytics) is the process of extracting useful and high-quality information from text by devising patterns and trends. R provides an extensive ecosystem to mine text through its many frameworks and packages.Starting with basic information about the statistics concepts used in text mining, this book will teach you how to access, cleanse, and process text using the R language and will equip you with the tools and the associated knowledge about different tagging, chunking, and entailment approaches and their usage in natural language processing. Moving on, this book will teach you different dimensionality reduction techniques and their implementation in R. Next, we will cover pattern recognition in text data utilizing classification mechanisms, perform entity recognition, and develop an ontology learning framework. By the end of the book, you will develop a practical application from the concepts learned, and will understand how text mining can be leveraged to analyze the massively available data on social media.