Hadoop in Practice

Alex Holmes

October 2012
ISBN 9781617290237
536 pages

printed in black & white

catalog / Software Development / Databases / Data Processing and Analytics / Distributed Data Processing and Analytics

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

with subscription

$24.99

click to save $499.80 (20%)

check the box to apply

new edition available

this edition is free when you purchase Hadoop in Practice, Second Edition

Hadoop in Practice collects 85 Hadoop examples and presents them in a problem/solution format. Each technique addresses a specific task you'll face, like querying big data using Pig or writing a log file loader. You'll explore each problem step by step, learning both how to build and deploy that specific solution along with the thinking that went into its design. As you work through the tasks, you'll find yourself growing more comfortable with Hadoop and at home in the world of big data.

about the technology

Hadoop is an open source MapReduce platform designed to query and analyze data distributed across large clusters. Especially effective for big data systems, Hadoop powers mission-critical software at Apple, eBay, LinkedIn, Yahoo, and Facebook. It offers developers handy ways to store, manage, and analyze data.

about the book

Hadoop in Practice collects 85 battle-tested examples and presents them in a problem/solution format. It balances conceptual foundations with practical recipes for key problem areas like data ingress and egress, serialization, and LZO compression. You'll explore each technique step by step, learning how to build a specific solution along with the thinking that went into it. As a bonus, the book's examples create a well-structured and understandable codebase you can tweak to meet your own needs.

what's inside

Conceptual overview of Hadoop and MapReduce
85 practical, tested techniques
Real problems, real solutions
How to integrate MapReduce and R

about the reader

This book assumes you've already started exploring Hadoop and want concrete advice on how to use it in production.

about the author

Alex Holmes is a senior software engineer with extensive expertise in solving big data problems using Hadoop. He has presented at JavaOne and Jazoon and is a technical lead at VeriSign.

with subscription

$24.99

click to save $499.80 (20%)

check the box to apply

Interesting topics that tickle the creative brain.

Mark Kemna, Brillig

Ties together the Hadoop ecosystem technologies.

Ayon Sinha, Britely

Comprehensive … high-quality code samples.

Chris Nauroth, The Walt Disney Company

Covers all of the variants of Hadoop, not just the Apache distribution.

Ted Dunning, MapR Technologies

Charts a path to the future.

Alexey Gayduk, Grid Dynamics