|Pig in Action
Munging Big Data
M. Tim Jones
MEAP Began: May 2013
Softbound print: March 2014 (est.) | 325 pages
|Order today and start reading Pig in Action today through MEAP|
|MEAP + Ebook only - $35.99|
|MEAP + Print book (includes Ebook) when available - $44.99|
|* For more information, please see the MEAP FAQs page.|
|About MEAP Release Date Estimates|
Table of Contents, MEAP Chapters & Resources
|Table of Contents||Resources|
1 Introducing Pig and the Hadoop Platform - FREE
2 Problem Solving with Pig - AVAILABLE
3 Exploring Data with Grunt and Pig Latin - AVAILABLE
4 Getting Comfortable with Pig Data Types - AVAILABLE
5 Data Movement: Loading and Storing Data - AVAILABLE
6 Pig's Relational Operators - AVAILABLE
7 Pig's Built-in Functions - AVAILABLE
8 Extending Pig with User-defined Functions
9 Advanced Topics: Embedding Pig
10 Troubleshooting: when Things Go Awry
11 Optimizing: Getting the Most out of Pig
A Installing and Configuring Pig
B Pig Latin Grammar and Structure
C Taming Big Data with Map/Reduce
It's notoriously difficult to query Hadoop data using standard Map/Reduce programming techniques. Pig, and the Pig Latin scripting language, provides a SQL-like platform that simplifies query construction against data sets in Hadoop. You can use Pig both for immediate ad-hoc queries and for batch processing. Pig eases the obstacle of Map/Reduce and opens the door to processing large data sets for casual users, including experimentation on data sets. And it stands up well under stress-Yahoo uses Pig for over half the queries it runs on the world's largest Hadoop cluster.
Pig in Action introduces Pig and the Pig Latin language while teaching you the fundamentals of big data processing. You'll explore the intersection of business and data science as you walk through practical questions like executing standard queries, establishing automated data management processes and policies, and developing useful reports. Most importantly, you'll learn techniques to extract valuable insights from your data as you master the features of Pig.
- Pig and the Pig Latin language from the ground up
- Executing ad-hoc queries for instant results
- Automating repetitive processes and reports
- Patterns and best practices
- Working with modest data sets and massive clusters
Written for data professionals using Hadoop. No prior experience with Pig is required.
About the Authors
M. Tim Jones is a firmware architect with deep Hadoop and Pig experience. He's the author of Artificial Intelligence: A Systems Approach, GNU/Linux Application Programming, AI Application Programming, BSD Sockets Programming from a Multilanguage Perspective, and over 100 articles over a range of technical topics including Linux, Open-Source, Hadoop, the Hadoop ecosystem, and data science and visualization.
About the Early Access Version
This Early Access version of Pig in Action enables you to receive new chapters as they are being written. You can also interact with the authors to ask questions, provide feedback and errata, and help shape the final manuscript on the Author Online
Want to learn More?
Sign up to read more content when it is released and to receive news about this book.