Hadoop in Action
Second edition of this book is now available

Chuck Lam

December 2010 | 336 pages | B&W
ISBN: 9781935182191

$44.99 pBook + eBook (includes PDF, ePub, and Kindle)
$35.99 eBook only (includes PDF, ePub, and Kindle)
Browse all our mobile format eBooks.


 Look Inside Resources Downloads


Big data can be difficult to handle using traditional databases. Apache Hadoop is a NoSQL applications framework that runs on distributed clusters. This lets it scale to huge datasets. If you need analytic information from your data, Hadoop's the way to go.

Hadoop in Action introduces the subject and teaches you how to write programs in the MapReduce style. It starts with a few easy examples and then moves quickly to show Hadoop use in more complex data analysis tasks. Included are best practices and design patterns of MapReduce programming.

This book requires basic Java skills. Knowing basic statistical concepts can help with the more advanced examples.


About the Author

Chuck Lam is a Senior Engineer at RockYou! He has a PhD in pattern recognition from Stanford University.


“Hadoop in Action was the only resource I used to learn Hadoop and I know enough to be dangerous :) I recommend the book for anyone who wants to get into distributed systems and get a great understanding of map reduce algorithms.”
Chris Coley

“I really love this book, is made for normal people just trying to get something done. The streaming coverage is perty good, it's the best book for python type of people I've seen.”
Patrick Faith, review on Amazon.com