Categories
Ebooks
-
Business and economy
- Bitcoin
- Businesswoman
- Coaching
- Controlling
- E-business
- Economy
- Finances
- Stocks and investments
- Personal competence
- Computer in the office
- Communication and negotiation
- Small company
- Marketing
- Motivation
- Multimedia trainings
- Real estate
- Persuasion and NLP
- Taxes
- Social policy
- Guides
- Presentations
- Leadership
- Public Relation
- Reports, analyses
- Secret
- Social Media
- Sales
- Start-up
- Your career
- Management
- Project management
- Human Resources
-
For children
-
For youth
-
Education
-
Encyclopedias, dictionaries
-
E-press
- Architektura i wnętrza
- Biznes i Ekonomia
- Home and garden
- E-business
- Finances
- Personal finance
- Business
- Photography
- Computer science
- HR & Payroll
- Computers, Excel
- Accounts
- Culture and literature
- Scientific and academic
- Environmental protection
- Opinion-forming
- Education
- Taxes
- Travelling
- Psychology
- Religion
- Agriculture
- Book and press market
- Transport and Spedition
- Healthand beauty
-
History
-
Computer science
- Office applications
- Data bases
- Bioinformatics
- IT business
- CAD/CAM
- Digital Lifestyle
- DTP
- Electronics
- Digital photography
- Computer graphics
- Games
- Hacking
- Hardware
- IT w ekonomii
- Scientific software package
- School textbooks
- Computer basics
- Programming
- Mobile programming
- Internet servers
- Computer networks
- Start-up
- Operational systems
- Artificial intelligence
- Technology for children
- Webmastering
-
Other
-
Foreign languages
-
Culture and art
-
School reading books
-
Literature
- Antology
- Ballade
- Biographies and autobiographies
- For adults
- Dramas
- Diaries, memoirs, letters
- Epic, epopee
- Essay
- Fantasy and science fiction
- Feuilletons
- Work of fiction
- Humour and satire
- Other
- Classical
- Crime fiction
- Non-fiction
- Fiction
- Mity i legendy
- Nobelists
- Novellas
- Moral
- Okultyzm i magia
- Short stories
- Memoirs
- Travelling
- Narrative poetry
- Poetry
- Politics
- Popular science
- Novel
- Historical novel
- Prose
- Adventure
- Journalism, publicism
- Reportage novels
- Romans i literatura obyczajowa
- Sensational
- Thriller, Horror
- Interviews and memoirs
-
Natural sciences
-
Social sciences
-
School textbooks
-
Popular science and academic
- Archeology
- Bibliotekoznawstwo
- Cinema studies
- Philology
- Polish philology
- Philosophy
- Finanse i bankowość
- Geography
- Economy
- Trade. World economy
- History and archeology
- History of art and architecture
- Cultural studies
- Linguistics
- Literary studies
- Logistics
- Maths
- Medicine
- Humanities
- Pedagogy
- Educational aids
- Popular science
- Other
- Psychology
- Sociology
- Theatre studies
- Theology
- Economic theories and teachings
- Transport i spedycja
- Physical education
- Zarządzanie i marketing
-
Guides
-
Game guides
-
Professional and specialist guides
-
Law
- Health and Safety
- History
- Road Code. Driving license
- Law studies
- Healthcare
- General. Compendium of knowledge
- Academic textbooks
- Other
- Construction and local law
- Civil law
- Financial law
- Economic law
- Economic and trade law
- Criminal law
- Criminal law. Criminal offenses. Criminology
- International law
- International law
- Health care law
- Educational law
- Tax law
- Labor and social security law
- Public, constitutional and administrative law
- Family and Guardianship Code
- agricultural law
- Social law, labour law
- European Union law
- Industry
- Agricultural and environmental
- Dictionaries and encyclopedia
- Public procurement
- Management
-
Tourist guides and travel
- Africa
- Albums
- Southern America
- North and Central America
- Australia, New Zealand, Oceania
- Austria
- Asia
- Balkans
- Middle East
- Bulgary
- China
- Croatia
- The Czech Republic
- Denmark
- Egipt
- Estonia
- Europe
- France
- Mountains
- Greece
- Spain
- Holand
- Iceland
- Lithuania
- Latvia
- Mapy, Plany miast, Atlasy
- Mini travel guides
- Germany
- Norway
- Active travelling
- Poland
- Portugal
- Other
- Russia
- Romania
- Slovakia
- Slovenia
- Switzerland
- Sweden
- World
- Turkey
- Ukraine
- Hungary
- Great Britain
- Italy
-
Psychology
- Philosophy of life
- Kompetencje psychospołeczne
- Interpersonal communication
- Mindfulness
- General
- Persuasion and NLP
- Academic psychology
- Psychology of soul and mind
- Work psychology
- Relacje i związki
- Parenting and children psychology
- Problem solving
- Intellectual growth
- Secret
- Sexapeal
- Seduction
- Appearance and image
- Philosophy of life
-
Religion
-
Sport, fitness, diets
-
Technology and mechanics
Audiobooks
-
Business and economy
- Bitcoin
- Businesswoman
- Coaching
- Controlling
- E-business
- Economy
- Finances
- Stocks and investments
- Personal competence
- Communication and negotiation
- Small company
- Marketing
- Motivation
- Real estate
- Persuasion and NLP
- Taxes
- Guides
- Presentations
- Leadership
- Public Relation
- Secret
- Social Media
- Sales
- Start-up
- Your career
- Management
- Project management
- Human Resources
-
For children
-
For youth
-
Education
-
Encyclopedias, dictionaries
-
History
-
Computer science
-
Other
-
Foreign languages
-
Culture and art
-
School reading books
-
Literature
- Antology
- Ballade
- Biographies and autobiographies
- For adults
- Dramas
- Diaries, memoirs, letters
- Epic, epopee
- Essay
- Fantasy and science fiction
- Feuilletons
- Work of fiction
- Humour and satire
- Other
- Classical
- Crime fiction
- Non-fiction
- Fiction
- Mity i legendy
- Nobelists
- Novellas
- Moral
- Okultyzm i magia
- Short stories
- Memoirs
- Travelling
- Poetry
- Politics
- Popular science
- Novel
- Historical novel
- Prose
- Adventure
- Journalism, publicism
- Reportage novels
- Romans i literatura obyczajowa
- Sensational
- Thriller, Horror
- Interviews and memoirs
-
Natural sciences
-
Social sciences
-
Popular science and academic
-
Guides
-
Professional and specialist guides
-
Law
-
Tourist guides and travel
-
Psychology
- Philosophy of life
- Interpersonal communication
- Mindfulness
- General
- Persuasion and NLP
- Academic psychology
- Psychology of soul and mind
- Work psychology
- Relacje i związki
- Parenting and children psychology
- Problem solving
- Intellectual growth
- Secret
- Sexapeal
- Seduction
- Appearance and image
- Philosophy of life
-
Religion
-
Sport, fitness, diets
-
Technology and mechanics
Videocourses
-
Data bases
-
Big Data
-
Biznes, ekonomia i marketing
-
Cybersecurity
-
Data Science
-
DevOps
-
For children
-
Electronics
-
Graphics/Video/CAX
-
Games
-
Microsoft Office
-
Development tools
-
Programming
-
Personal growth
-
Computer networks
-
Operational systems
-
Software testing
-
Mobile devices
-
UX/UI
-
Web development
-
Management
Podcasts
- Ebooks
- Big data
- Data analysis
- Pig Design Patterns. Simplify Hadoop programming to create complex end-to-end Enterprise Big Data solutions with Pig
E-book details
Log in, If you're interested in the contents of the item.
Pig Design Patterns. Simplify Hadoop programming to create complex end-to-end Enterprise Big Data solutions with Pig
Ebook
- Pig Design Patterns
- Table of Contents
- Pig Design Patterns
- Credits
- Foreword
- About the Author
- Acknowledgments
- About the Reviewers
- www.PacktPub.com
- Support files, eBooks, discount offers and more
- Why Subscribe?
- Free Access for Packt account holders
- Support files, eBooks, discount offers and more
- Preface
- What this book covers
- Motivation for this book
- What you need for this book
- Who this book is for
- Conventions
- Reader feedback
- Customer support
- Downloading the example code
- Third-party libraries
- Datasets
- Errata
- Piracy
- Questions
- Downloading the example code
- What this book covers
- 1. Setting the Context for Design Patterns in Pig
- Understanding design patterns
- The scope of design patterns in Pig
- Hadoop demystified a quick reckoner
- The enterprise context
- Common challenges of distributed systems
- The advent of Hadoop
- Hadoop under the covers
- Understanding the Hadoop Distributed File System
- HDFS design goals
- Working of HDFS
- Understanding MapReduce
- Understanding how MapReduce works
- The MapReduce internals
- Pig a quick intro
- Understanding the rationale of Pig
- Understanding the relevance of Pig in the enterprise
- Working of Pig an overview
- Firing up Pig
- The use case
- Code listing
- The dataset
- Understanding Pig through the code
- Pigs extensibility
- Operators used in code
- The EXPLAIN operator
- Understanding Pig's data model
- Primitive types
- Complex types
- The relevance of schemas
- Summary
- 2. Data Ingest and Egress Patterns
- The context of data ingest and egress
- Types of data in the enterprise
- Ingest and egress patterns for multistructured data
- Considerations for log ingestion
- The Apache log ingestion pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Code for the CommonLogLoader class
- Code for the CombinedLogLoader class
- Results
- Additional information
- The Custom log ingestion pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- The image ingress and egress pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- The image Ingress Implementation
- The image egress implementation
- Code snippets
- The image ingress
- Pig script
- Image to a sequence UDF snippet
- The image egress
- Pig script
- Sequence to an image UDF
- The image ingress
- Results
- Additional information
- Considerations for log ingestion
- The ingress and egress patterns for the NoSQL data
- MongoDB ingress and egress patterns
- Background
- Motivation
- Use cases
- Pattern implementation
- The ingress implementation
- The egress implementation
- Code snippets
- The ingress code
- The egress code
- Results
- Additional information
- The HBase ingress and egress pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- The ingress implementation
- The egress implementation
- Code snippets
- The ingress code
- The egress code
- Results
- Additional information
- MongoDB ingress and egress patterns
- The ingress and egress patterns for structured data
- The Hive ingress and egress patterns
- Background
- Motivation
- Use cases
- Pattern implementation
- The ingress implementation
- The egress implementation
- Code snippets
- The ingress Code
- Importing data using RCFile
- Importing data using HCatalog
- The egress code
- The ingress Code
- Results
- Additional information
- The Hive ingress and egress patterns
- The ingress and egress patterns for semi-structured data
- The mainframe ingestion pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- XML ingest and egress patterns
- Background
- Motivation
- Motivation for ingesting raw XML
- Motivation for ingesting binary XML
- Motivation for egression of XML
- Use cases
- Pattern implementation
- The implementation of the XML raw ingestion
- The implementation of the XML binary ingestion
- Code snippets
- The XML raw ingestion code
- The XML binary ingestion code
- The XML egress code
- Pig script
- The XML storage
- Results
- Additional information
- The mainframe ingestion pattern
- JSON ingress and egress patterns
- Background
- Motivation
- Use cases
- Pattern implementation
- The ingress implementation
- The egress implementation
- Code snippets
- The ingress code
- The code for simple JSON
- The code for nested JSON
- The egress code
- The ingress code
- Results
- Additional information
- Background
- Summary
- 3. Data Profiling Patterns
- Data profiling for Big Data
- Big Data profiling dimensions
- Sampling considerations for profiling Big Data
- Sampling support in Pig
- Rationale for using Pig in data profiling
- The data type inference pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Pig script
- Java UDF
- Results
- Additional information
- The basic statistical profiling pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Pig script
- Macro
- Results
- Additional information
- The pattern-matching pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Pig script
- Macro
- Results
- Additional information
- The string profiling pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Pig script
- Macro
- Results
- Additional information
- The unstructured text profiling pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Pig script
- Java UDF for stemming
- Java UDF for generating TF-IDF
- Results
- Additional information
- Summary
- Data profiling for Big Data
- 4. Data Validation and Cleansing Patterns
- Data validation and cleansing for Big Data
- Choosing Pig for validation and cleansing
- The constraint validation and cleansing design pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- The regex validation and cleansing design pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- The corrupt data validation and cleansing design pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- The unstructured text data validation and cleansing design pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- Summary
- 5. Data Transformation Patterns
- Data transformation processes
- The structured-to-hierarchical transformation pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- The data normalization pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- The data integration pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- The aggregation pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- The data generalization pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- Summary
- 6. Understanding Data Reduction Patterns
- Data reduction a quick introduction
- Data reduction considerations for Big Data
- Dimensionality reduction the Principal Component Analysis design pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Limitations of PCA implementation
- Code snippets
- Results
- Additional information
- Numerosity reduction the histogram design pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- Numerosity reduction sampling design pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- Numerosity reduction clustering design pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- Summary
- 7. Advanced Patterns and Future Work
- The clustering pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- The topic discovery pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- The natural language processing pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- The classification pattern
- Background
- Motivation
- Use cases
- Pattern implementation
- Code snippets
- Results
- Additional information
- Future trends
- Emergence of data-driven patterns
- The emergence of solution-driven patterns
- Patterns addressing programmability constraints
- Summary
- The clustering pattern
- Index
- Title: Pig Design Patterns. Simplify Hadoop programming to create complex end-to-end Enterprise Big Data solutions with Pig
- Author: Pradeep Pasupuleti
- Original title: Pig Design Patterns. Simplify Hadoop programming to create complex end-to-end Enterprise Big Data solutions with Pig.
- ISBN: 9781783285563, 9781783285563
- Date of issue: 2014-04-17
- Format: Ebook
- Item ID: e_3bdv
- Publisher: Packt Publishing