Analiza danych

Analiza danych jest ekscytującą dyscypliną, która umożliwia zrozumienie pewnych zjawisk, uzyskanie wglądu i wiedzy na podstawie surowych danych. Pojęcie to oznacza dokładnie przetwarzanie danych za pomocą technik matematycznych i statystycznych w celu uzyskania cennych wniosków, podjęcia ważnych decyzji i opracowania przydatnych produktów. Termin ten wywodzi się od angielskiego data science, często traktowanego jako synonim takich terminów, jak analityka biznesowa, badania operacyjne, business intelligence, wywiad konkurencyjny, analiza i modelowanie danych, a także pozyskiwanie wiedzy. Dzięki takim technologiom, jak języki Python czy R, platformy Hadoop i Spark masz szansę wyciągnąć maksimum wniosków, dostrzec szanse na rozwój swojej organizacji albo przewidzieć i zapobiec zagrożeniom.

409
Ładowanie...
EBOOK

Machine Learning with scikit-learn Quick Start Guide. Classification, regression, and clustering techniques in Python

Kevin Jolly

Scikit-learn is a robust machine learning library for the Python programming language. It provides a set of supervised and unsupervised learning algorithms. This book is the easiest way to learn how to deploy, optimize, and evaluate all of the important machine learning algorithms that scikit-learn provides.This book teaches you how to use scikit-learn for machine learning. You will start by setting up and configuring your machine learning environment with scikit-learn. To put scikit-learn to use, you will learn how to implement various supervised and unsupervised machine learning models. You will learn classification, regression, and clustering techniques to work with different types of datasets and train your models.Finally, you will learn about an effective pipeline to help you build a machine learning project from scratch. By the end of this book, you will be confident in building your own machine learning models for accurate predictions.

410
Ładowanie...
EBOOK

Machine Learning with Spark. Develop intelligent, distributed machine learning systems - Second Edition

Rajdeep Dua, Manpreet Singh Ghotra

This book will teach you about popular machine learning algorithms and their implementation. You will learn how various machine learning concepts are implemented in the context of Spark ML. You will start by installing Spark in a single and multinode cluster. Next you'll see how to execute Scala and Python based programs for Spark ML. Then we will take a few datasets and go deeper into clustering, classification, and regression. Toward the end, we will also cover text processing using Spark ML.Once you have learned the concepts, they can be applied to implement algorithms in either green-field implementations or to migrate existing systems to this new platform. You can migrate from Mahout or Scikit to use Spark ML.By the end of this book, you will acquire the skills to leverage Spark's features to create your own scalable machine learning applications and power a modern data-driven business.

411
Ładowanie...
EBOOK

Machine Learning with TensorFlow 1.x. Second generation machine learning with Google's brainchild - TensorFlow 1.x

Saif Ahmed, Quan Hua, Shams Ul Azeem

Google's TensorFlow is a game changer in the world of machine learning. It has made machine learning faster, simpler, and more accessible than ever before. This book will teach you how to easily get started with machine learning using the power of Python and TensorFlow 1.x. Firstly, you’ll cover the basic installation procedure and explore the capabilities of TensorFlow 1.x. This is followed by training and running the first classifier, and coverage of the unique features of the library including data ?ow graphs, training, and the visualization of performance with TensorBoard—all within an example-rich context using problems from multiple industries. You’ll be able to further explore text and image analysis, and be introduced to CNN models and their setup in TensorFlow 1.x. Next, you’ll implement a complete real-life production system from training to serving a deep learning model. As you advance you’ll learn about Amazon Web Services (AWS) and create a deep neural network to solve a video action recognition problem. Lastly, you’ll convert the Caffe model to TensorFlow and be introduced to the high-level TensorFlow library, TensorFlow-Slim.By the end of this book, you will be geared up to take on any challenges of implementing TensorFlow 1.x in your machine learning environment.

412
Ładowanie...
EBOOK

Machine Learning with the Elastic Stack. Expert techniques to integrate machine learning with distributed search and analytics

Rich Collier, Bahaaldine Azarmi

Machine Learning with the Elastic Stack is a comprehensive overview of the embedded commercial features of anomaly detection and forecasting. The book starts with installing and setting up Elastic Stack. You will perform time series analysis on varied kinds of data, such as log files, network flows, application metrics, and financial data.As you progress through the chapters, you will deploy machine learning within the Elastic Stack for logging, security, and metrics. In the concluding chapters, you will see how machine learning jobs can be automatically distributed and managed across the Elasticsearch cluster and made resilient to failure.By the end of this book, you will understand the performance aspects of incorporating machine learning within the Elastic ecosystem and create anomaly detection jobs and view results from Kibana directly.

413
Ładowanie...
EBOOK

Managing Data as a Product. Design and build data-product-centered socio-technical architectures

Andrea Gioia, Giulio Scotti

Traditional monolithic data platforms struggle with scalability and burden central data teams with excessive cognitive load, leading to challenges in managing technological debt. As maintenance costs escalate, these platforms lose their ability to provide sustained value over time. With two decades of hands-on experience implementing data solutions and his pioneering work in the Open Data Mesh Initiative, Andrea Gioia brings practical insights and proven strategies for transforming how organizations manage their data assets.Managing Data as a Product introduces a modular and distributed approach to data platform development, centered on the concept of data products. In this book, you’ll explore the rationale behind this shift, understand the core features and structure of data products, and learn how to identify, develop, and operate them in a production environment. The book guides you through designing and implementing an incremental, value-driven strategy for adopting data product-centered architectures, including strategies for securing buy-in from stakeholders. It also covers data modeling in distributed environments and its role in enabling modern generative AI.By the end of this book, you’ll understand product-centric data architecture and how to adopt it.*Email sign-up and proof of purchase required

414
Ładowanie...
EBOOK

Managing Data Integrity for Finance. Discover practical data quality management strategies for finance analysts and data professionals

Jane Sarah Lat

Data integrity management plays a critical role in the success and effectiveness of organizations trying to use financial and operational data to make business decisions. Unfortunately, there is a big gap between the analysis and management of finance data along with the proper implementation of complex data systems across various organizations.The first part of this book covers the important concepts for data quality and data integrity relevant to finance, data, and tech professionals. The second part then focuses on having you use several data tools and platforms to manage and resolve data integrity issues on financial data. The last part of this the book covers intermediate and advanced solutions, including managed cloud-based ledger databases, database locks, and artificial intelligence, to manage the integrity of financial data in systems and databases.After finishing this hands-on book, you will be able to solve various data integrity issues experienced by organizations globally.

415
Ładowanie...
EBOOK

Managing Data Science. Effective strategies to manage data science projects and build a sustainable team

Kirill Dubovikov

Data science and machine learning can transform any organization and unlock new opportunities. However, employing the right management strategies is crucial to guide the solution from prototype to production. Traditional approaches often fail as they don't entirely meet the conditions and requirements necessary for current data science projects. In this book, you'll explore the right approach to data science project management, along with useful tips and best practices to guide you along the way. After understanding the practical applications of data science and artificial intelligence, you'll see how to incorporate them into your solutions. Next, you will go through the data science project life cycle, explore the common pitfalls encountered at each step, and learn how to avoid them. Any data science project requires a skilled team, and this book will offer the right advice for hiring and growing a data science team for your organization. Later, you'll be shown how to efficiently manage and improve your data science projects through the use of DevOps and ModelOps. By the end of this book, you will be well versed with various data science solutions and have gained practical insights into tackling the different challenges that you'll encounter on a daily basis.

416
Ładowanie...
EBOOK

Managing Microsoft Teams: MS-700 Exam Guide. Configure and manage Microsoft Teams workloads and achieve Microsoft 365 certification with ease

Peter Rising, Nate Chamberlain

Do you want to build and test your proficiency in the deployment, management, and monitoring of Microsoft Teams features within the Microsoft 365 platform? Managing Microsoft Teams: MS-700 Exam Guide will help you to effectively plan and implement Microsoft Teams using the Microsoft 365 Teams admin center and Windows PowerShell. You’ll also discover best practices for rolling out and managing MS services for Teams users within your Microsoft 365 tenant. The chapters are divided into three easy-to-follow parts: planning and design, feature policies and administration, and team management, while aligning with the official MS-700 exam objectives to help you prepare effectively for the exam.The book starts by taking you through planning and design, where you’ll learn how to plan migrations, make assessments for network readiness, and plan and implement governance tasks such as configuring guest access and monitoring usage. Later, you’ll understand feature administration, focusing on collaboration, meetings, live events, phone numbers, and the phone system, along with applicable policy configurations. Finally, the book shows you how to manage Teams and membership settings and create app policies.By the end of this book, you'll have learned everything you need to pass the MS-700 certification exam and have a handy reference guide for MS Teams.

417
Ładowanie...
EBOOK

Market Research and Analysis. Mastering Market Research: Advanced Methods, Design, and Data Analysis

Mercury Learning and Information, Marcus Goncalves

This book offers an in-depth exploration of market research and analysis, guiding readers through the entire process from defining research objectives to communicating results. Begin by understanding the purpose and ethics of market research, laying a strong groundwork for your studies. Progress to defining precise research objectives and exploring secondary research methods to gather existing information.Next, engage with primary research methods, focusing on both quantitative and qualitative approaches. Learn how to develop and distribute surveys, choose the right sampling techniques, and utilize tools for data mining and web scraping. Gain insights into focus groups and observation studies, understanding how these qualitative methods can provide depth to your research.Finally, master the art of data analysis and result communication. Explore descriptive statistics, hypothesis testing, and inferential statistics to make sense of your data. Learn to effectively present your findings to stakeholders, ensuring your research translates into actionable insights. By the end of the course, you will be well-equipped to conduct thorough market research and communicate your results effectively.

418
Ładowanie...
EBOOK

Marketing i analityka biznesowa dla początkujących. Poznaj najważniejsze narzędzia i wykorzystaj ich możliwości

Kinga Sroka

Witaj w świecie fascynujących danych! Biznes nie istnieje bez twardych danych, założeń, KPI i ich realizacji. Także ta jego część, która jest związana z promocją. Szczególnie online. Dziś nie można być marketerem z prawdziwego zdarzenia i nie znać chociażby narzędzi oferowanych przez Google ― Analytics, Tag Manager, Search Console i Trends. Te nazwy kojarzą Ci się z czymś niezrozumiałym albo budzą obawy? Nie martw się i sięgnij po tę książkę! Dowiesz się z niej, jak efektywnie korzystać z internetowych rozwiązań analitycznych. Ten poradnik stanowi świetne wprowadzenie do marketingu i analityki biznesowej online dla osób, które dopiero zapoznają się z tym tematem. To nie tylko przegląd narzędzi współczesnego analityka. Autorka opisuje również kompetencje, które będą potrzebne osobom z branży w najbliższej przyszłości, wskazuje miejsca, gdzie już można je zdobywać, wreszcie podpowiada, jakie umiejętności trzeba mieć, by otrzymać wymarzoną pracę w firmach zajmujących się danymi cyfrowymi.

419
Ładowanie...
EBOOK

Master Your Data with Power Query in Excel and Power BI. Leveraging Power Query to Get & Transform Your Task Flow

MrExcel's Holy Macro! Books, Miguel Escobar, Ken...

This book equips you with the essential skills to master Power Query in Excel and Power BI. Starting with the basics, you'll learn query management, data types, and error handling, establishing a solid foundation. You'll explore techniques to move queries between Excel and Power BI, ensuring seamless workflow integration. As the guide progresses, you'll delve into data import methods from flat files, Excel, web-based, and relational sources, while performing key transformations like appending, combining, and reshaping data.Advanced topics such as conditional logic, Power Query values, and M Language fundamentals will enhance your ability to customize and optimize queries. The book also covers the creation of parameters and custom functions, alongside applying sophisticated date and time techniques.Finally, you'll learn to optimize query performance and automate data refreshes, ensuring your analysis remains current. By the end of this guide, you'll have the confidence and expertise to effectively transform and manage data using Power Query, significantly enhancing your data analysis capabilities in Excel and Power BI.

420
Ładowanie...
EBOOK

Mastering Apache Solr 7.x. An expert guide to advancing, optimizing, and scaling your enterprise search

Sandeep Nair, Chintan Mehta, Dharmesh Vasoya

Apache Solr is the only standalone enterprise search server with a REST-like application interface. providing highly scalable, distributed search and index replication for many of the world's largest internet sites.To begin with, you would be introduced to how you perform full text search, multiple filter search, perform dynamic clustering and so on helping you to brush up the basics of Apache Solr. You will also explore the new features and advanced options released in Apache Solr 7.x which will get you numerous performance aspects and making data investigation simpler, easier and powerful. You will learn to build complex queries, extensive filters and how are they compiled in your system to bring relevance in your search tools. You will learn to carry out Solr scoring, elements affecting the document score and how you can optimize or tune the score for the application at hand. You will learn to extract features of documents, writing complex queries in re-ranking the documents. You will also learn advanced options helping you to know what content is indexed and how the extracted content is indexed. Throughout the book, you would go through complex problems with solutions along with varied approaches to tackle your business needs. By the end of this book, you will gain advanced proficiency to build out-of-box smart search solutions for your enterprise demands.

421
Ładowanie...
EBOOK

Mastering Apache Spark 2.x. Advanced techniques in complex Big Data processing, streaming analytics and machine learning - Second Edition

Romeo Kienzler

Apache Spark is an in-memory, cluster-based Big Data processing system that provides a wide range of functionalities such as graph processing, machine learning, stream processing, and more. This book will take your knowledge of Apache Spark to the next level by teaching you how to expand Spark’s functionality and build your data flows and machine/deep learning programs on top of the platform.The book starts with a quick overview of the Apache Spark ecosystem, and introduces you to the new features and capabilities in Apache Spark 2.x. You will then work with the different modules in Apache Spark such as interactive querying with Spark SQL, using DataFrames and DataSets effectively, streaming analytics with Spark Streaming, and performing machine learning and deep learning on Spark using MLlib and external tools such as H20 and Deeplearning4j. The book also contains chapters on efficient graph processing, memory management and using Apache Spark on the cloud.By the end of this book, you will have all the necessary information to master Apache Spark, and use it efficiently for Big Data processing and analytics.

422
Ładowanie...
EBOOK

Mastering Apache Storm. Real-time big data streaming using Kafka, Hbase and Redis

Ankit Jain

Apache Storm is a real-time Big Data processing framework that processes large amounts of data reliably, guaranteeing that every message will be processed. Storm allows you to scale your data as it grows, making it an excellent platform to solve your big data problems. This extensive guide will help you understand right from the basics to the advanced topics of Storm.The book begins with a detailed introduction to real-time processing and where Storm fits in to solve these problems. You’ll get an understanding of deploying Storm on clusters by writing a basic Storm Hello World example. Next we’ll introduce you to Trident and you’ll get a clear understanding of how you can develop and deploy a trident topology. We cover topics such as monitoring, Storm Parallelism, scheduler and log processing, in a very easy to understand manner. You will also learn how to integrate Storm with other well-known Big Data technologies such as HBase, Redis, Kafka, and Hadoop to realize the full potential of Storm.With real-world examples and clear explanations, this book will ensure you will have a thorough mastery of Apache Storm. You will be able to use this knowledge to develop efficient, distributed real-time applications to cater to your business needs.

423
Ładowanie...
EBOOK

Mastering Blockchain. Deeper insights into decentralization, cryptography, Bitcoin, and popular Blockchain frameworks

Imran Bashir

Blockchain is a distributed database that enables permanent, transparent, and secure storage of data. The blockchain technology is the backbone of cryptocurrency – in fact, it’s the shared public ledger upon which the entire Bitcoin network relies – and it’s gaining popularity with people who work in finance, government, and the arts.Blockhchain technology uses cryptography to keep data secure. This book gives a detailed description of this leading technology and its implementation in the real world.This book begins with the technical foundations of blockchain, teaching you the fundamentals of cryptography and how it keeps data secure. You will learn about the mechanisms behind cryptocurrencies and how to develop applications using Ethereum, a decentralized virtual machine. You will explore different blockchain solutions and get an exclusive preview into Hyperledger, an upcoming blockchain solution from IBM and the Linux Foundation. You will also be shown how to implement blockchain beyond currencies, scability with blockchain, and the future scope of this fascinating and powerful technology.

424
Ładowanie...
EBOOK

Mastering Business Intelligence with MicroStrategy. Master Business Intelligence with Microstrategy 10

Dmitry Anoshin, Himani Rana, Ning Ma, Neil...

Business intelligence is becoming more important by the day, with cloud offerings and mobile devices gaining wider acceptance and achieving better market penetration. MicroStrategy Reporting Suite is an absolute leader in the BI market and offers rich capabilities from basic data visualizations to predictive analytics. It lets you various delivery methods such as the Web, desktops, and mobiles.Using real-world BI scenarios, this book helps you to implement Business Analytics solutions in big e-commerce companies. It kicks off with MicroStrategy 10 features and then covers schema design models and techniques. Building upon your existing knowledge, the book will teach you advanced techniques for building documents and dashboards. It further teaches various graphical techniques for presenting data for analysis using maps, graphs, and advanced charts. Although MicroStrategy has rich functionality, the book will show how to customize it in order to meet your business requirements. You will also become familiar with the native analytical functions that will help you to maximize the impact of BI solutions with powerful predictive analytics. Furthermore, the book will focus on MicroStrategy Mobile Analytics along with data discovery and desktop capabilities such as connecting various data sources and building interactive dashboards. The book will also uncover best practices, troubleshooting techniques for MicroStrategy system administration, and also security and authentication techniques. Lastly, you will learn to use Hadoop for MicroStrategy reporting.By the end of the book, you will become proficient in evaluating any BI software in order to choose the best one that meets all business requirements.

426
428
Ładowanie...
EBOOK

Mastering Elastic Stack. Dive into data analysis with a pursuit of mastering ELK Stack on real-world scenarios

Ravi Kumar Gupta, Yuvraj Gupta

Even structured data is useless if it can’t help you to take strategic decisions and improve existing system. If you love to play with data, or your job requires you to process custom log formats, design a scalable analysis system, and manage logs to do real-time data analysis, this book is your one-stop solution. By combining the massively popular Elasticsearch, Logstash, Beats, and Kibana, elastic.co has advanced the end-to-end stack that delivers actionable insights in real time from almost any type of structured or unstructured data source. If your job requires you to process custom log formats, design a scalable analysis system, explore a variety of data, and manage logs, this book is your one-stop solution. You will learn how to create real-time dashboards and how to manage the life cycle of logs in detail through real-life scenarios.This book brushes up your basic knowledge on implementing the Elastic Stack and then dives deeper into complex and advanced implementations of the Elastic Stack. We’ll help you to solve data analytics challenges using the Elastic Stack and provide practical steps on centralized logging and real-time analytics with the Elastic Stack in production. You will get to grip with advanced techniques for log analysis and visualization. Newly announced features such as Beats and X-Pack are also covered in detail with examples.Toward the end, you will see how to use the Elastic stack for real-world case studies and we’ll show you some best practices and troubleshooting techniques for the Elastic Stack.

429
Ładowanie...
EBOOK

Mastering Elasticsearch 5.x. Master the intricacies of Elasticsearch 5 and use it to create flexible and scalable search solutions - Third Edition

Bharvi Dixit

Elasticsearch is a modern, fast, distributed, scalable, fault tolerant, and open source search and analytics engine. Elasticsearch leverages the capabilities of Apache Lucene, and provides a new level of control over how you can index and search even huge sets of data.This book will give you a brief recap of the basics and also introduce you to the new features of Elasticsearch 5. We will guide you through the intermediate and advanced functionalities of Elasticsearch, such as querying, indexing, searching, and modifying data. We’ll also explore advanced concepts, including aggregation, index control, sharding, replication, and clustering. We’ll show you the modules of monitoring and administration available in Elasticsearch, and will also cover backup and recovery. You will get an understanding of how you can scale your Elasticsearch cluster to contextualize it and improve its performance. We’ll also show you how you can create your own analysis plugin in Elasticsearch. By the end of the book, you will have all the knowledge necessary to master Elasticsearch and put it to efficient use.

430
Ładowanie...
EBOOK

Mastering Geospatial Analysis with Python. Explore GIS processing and learn to work with GeoDjango, CARTOframes and MapboxGL-Jupyter

Silas Toms, Paul Crickard, Eric van Rees

Python comes with a host of open source libraries and tools that help you work on professional geoprocessing tasks without investing in expensive tools. This book will introduce Python developers, both new and experienced, to a variety of new code libraries that have been developed to perform geospatial analysis, statistical analysis, and data management. This book will use examples and code snippets that will help explain how Python 3 differs from Python 2, and how these new code libraries can be used to solve age-old problems in geospatial analysis.You will begin by understanding what geoprocessing is and explore the tools and libraries that Python 3 offers. You will then learn to use Python code libraries to read and write geospatial data. You will then learn to perform geospatial queries within databases and learn PyQGIS to automate analysis within the QGIS mapping suite. Moving forward, you will explore the newly released ArcGIS API for Python and ArcGIS Online to perform geospatial analysis and create ArcGIS Online web maps. Further, you will deep dive into Python Geospatial web frameworks and learn to create a geospatial REST API.

431
Ładowanie...
EBOOK

Mastering Hadoop 3. Big data processing at scale to unlock unique business insights

Chanchal Singh, Manish Kumar

Apache Hadoop is one of the most popular big data solutions for distributed storage and for processing large chunks of data. With Hadoop 3, Apache promises to provide a high-performance, more fault-tolerant, and highly efficient big data processing platform, with a focus on improved scalability and increased efficiency.With this guide, you’ll understand advanced concepts of the Hadoop ecosystem tool. You’ll learn how Hadoop works internally, study advanced concepts of different ecosystem tools, discover solutions to real-world use cases, and understand how to secure your cluster. It will then walk you through HDFS, YARN, MapReduce, and Hadoop 3 concepts. You’ll be able to address common challenges like using Kafka efficiently, designing low latency, reliable message delivery Kafka systems, and handling high data volumes. As you advance, you’ll discover how to address major challenges when building an enterprise-grade messaging system, and how to use different stream processing systems along with Kafka to fulfil your enterprise goals.By the end of this book, you’ll have a complete understanding of how components in the Hadoop ecosystem are effectively integrated to implement a fast and reliable data pipeline, and you’ll be equipped to tackle a range of real-world problems in data pipelines.