Analiza danych
Pandas Basics. Mastering Data Analysis with Pandas
Mercury Learning and Information, Oswald Campesato
This book is essential for aspiring data scientists and anyone needing to perform data cleaning using Pandas and NumPy. It offers numerous code samples and comprehensive coverage of NumPy and Pandas features, including writing regular expressions. Chapter 3 introduces fundamental statistical concepts, while Chapter 7 delves into data visualization using Matplotlib and Seaborn. Companion files with code are available for download from the publisher.Starting with an introduction to Python, the course progresses through working with data, and then moves into Pandas, covering its functionalities in three detailed chapters. The statistical concepts provided are crucial for analyzing data effectively, while the visualization techniques enhance the ability to present data insights clearly.By the end of this course, users will have a solid foundation in data manipulation and cleaning, statistical analysis, and data visualization, enabling them to tackle real-world data science tasks confidently and efficiently.
William Ayd, Matthew Harrison, Wes McKinney
Unlock the full power of pandas 2.x with this hands-on cookbook, designed for Python developers, data analysts, and data scientists who need fast, efficient solutions for real-world data challenges. This book provides practical, ready-to-use recipes to streamline your workflow. With step-by-step guidance, you'll master data wrangling, visualization, performance optimization, and scalable data analysis using pandas’ most powerful features. From importing and merging large datasets to advanced time series analysis and SQL-like operations, this cookbook equips you with the tools to analyze, manipulate, and visualize data like a pro. Learn how to boost efficiency, optimize memory usage, and seamlessly integrate pandas with NumPy, PyArrow, and databases. This book will help you transform raw data into actionable insights with ease.*Email sign-up and proof of purchase required
Theodore Petrou
This book will provide you with unique, idiomatic, and fun recipes for both fundamental and advanced data manipulation tasks with pandas 0.20. Some recipes focus on achieving a deeper understanding of basic principles, or comparing and contrasting two similar operations. Other recipes will dive deep into a particular dataset, uncovering new and unexpected insights along the way.The pandas library is massive, and it's common for frequent users to be unaware of many of its more impressive features. The official pandas documentation, while thorough, does not contain many useful examples of how to piece together multiple commands like one would do during an actual analysis. This book guides you, as if you were looking over the shoulder of an expert, through practical situations that you are highly likely to encounter.Many advanced recipes combine several different features across the pandas 0.20 library to generate results.
Francesco Corti
This hands-on tutorial, filled with exercises and examples, introduces the reader to a variety of concepts within Pentaho Reporting. With screenshots that show you how reports look at design time as well as how they should look when rendered as PDF, Excel, HTML, Text, Rich-Text-File, XML, and CSV, this book also contains complete example source code that you can copy and paste into your environment to get up-and-running quickly. Updated to cover the features of Pentaho 8, this book will teach you everything you need to know to build fast, efficient reports using Pentaho. If your interest lies in the technical details of creating reports and you want to see how to solve common reporting problems with a minimum of fuss, this is the book for you.
María Carina Roldán
Capturing, manipulating, cleansing, transferring, and loading data effectively are the prime requirements in every IT organization. Achieving these tasks require people devoted to developing extensive software programs, or investing in ETL or data integration tools that can simplify this work.Pentaho Data Integration is a full-featured open source ETL solution that allows you to meet these requirements. Pentaho Data Integration has an intuitive, graphical, drag-and-drop design environment and its ETL capabilities are powerful. However, getting started with Pentaho Data Integration can be difficult or confusing.Pentaho Data Integration Beginner's Guide - Second Edition provides the guidance needed to overcome that difficulty, covering all the possible key features of Pentaho Data Integration.Pentaho Data Integration Beginner's Guide - Second Edition starts with the installation of Pentaho Data Integration software and then moves on to cover all the key Pentaho Data Integration concepts. Each chapter introduces new features, allowing you to gradually get involved with the tool. First, you will learn to do all kinds of data manipulation and work with plain files. Then, the book gives you a primer on databases and teaches you how to work with databases inside Pentaho Data Integration. Moreover, you will be introduced to data warehouse concepts and you will learn how to load data in a data warehouse. After that, you will learn to implement simple and complex processes. Finally, you will have the opportunity of applying and reinforcing all the learned concepts through the implementation of a simple datamart.With Pentaho Data Integration Beginner's Guide - Second Edition, you will learn everything you need to know in order to meet your data manipulation requirements.
Pentaho Data Integration Quick Start Guide. Create ETL processes using Pentaho
María Carina Roldán
Pentaho Data Integration(PDI) is an intuitive and graphical environment packed with drag and drop design and powerful Extract-Transform-Load (ETL) capabilities. Given its power and flexibility, initial attempts to use the Pentaho Data Integration tool can be difficult or confusing. This book is the ideal solution.This book reduces your learning curve with PDI. It provides the guidance needed to make you productive, covering the main features of Pentaho Data Integration. It demonstrates the interactive features of the graphical designer, and takes you through the main ETL capabilities that the tool offers.By the end of the book, you will be able to use PDI for extracting, transforming, and loading the types of data you encounter on a daily basis.
Podstawy wizualizacji danych. Zasady tworzenia atrakcyjnych wykresów
Claus O. Wilke
Aby skutecznie przekazać wiedzę płynącą z zebranych danych, szczególnie w przypadku nauk przyrodniczych, ekonomicznych i społecznych, warto sięgnąć po narzędzia do wizualizacji. Nie jest to trywialne zadanie: często trzeba korzystać z coraz większych i coraz bardziej złożonych zbiorów danych, a dostępne narzędzia do wizualizacji zapewniają oszałamiający wybór rozwiązań i opcji, w których łatwo się pogubić. Co więcej, podczas przygotowywania prezentacji należy wziąć pod uwagę szczególne właściwości ludzkiego umysłu w zakresie postrzegania i przyswajania informacji. Efektywna wizualizacja danych jest bardzo istotną sprawą: nierzadko od jakości przekazywanych w ten sposób informacji zależy trafność podejmowanych decyzji. Ten poradnik pomoże Ci uniknąć często spotykanych problemów z wizualizacją danych. Zawiera wskazówki, dzięki którym szybko zaczniesz tworzyć świetnie wyglądające i bogate w treść wykresy. Nauczysz się bezbłędnego dobierania najlepszego w danej sytuacji sposobu wizualizacji, poznasz reguły stosowania kolorów, wielkości i rodzaju czcionki, zachowywania odpowiednich proporcji poszczególnych części wykresu — i przekonasz się, że dobrymi decyzjami co do jego estetyki możesz zapewnić mu przejrzystość i elegancję. Znajdziesz tu również obszerny katalog wizualizacji, co ułatwi zapoznanie się z powszechnie używanymi metodami wizualizowania danych. Ważną częścią książki jest zestaw przykładów dobrze i źle przygotowanych wykresów z wyczerpującymi wyjaśnieniami. Najciekawsze koncepcje opisane w książce: kolor jako cenne narzędzie wyróżniania danych dostarczanie kluczowych informacji na wiele sposobów dobór technik wizualizacji do różnych rodzajów danych reguły pomocne w projektowaniu estetycznych wykresów technika stosowania wykresów w większym dokumencie A więc jak wygląda wykres, który decyduje o sukcesie prezentacji?