|Hadoop in Action
Second edition of this book is now available
December 2010 | 336 pages
|$44.99||pBook + eBook (includes PDF, ePub, and Kindle)|
|$35.99||eBook only (includes PDF, ePub, and Kindle)|
|Browse all our mobile format eBooks.|
Big data can be difficult to handle using traditional databases. Apache Hadoop is a NoSQL applications framework that runs on distributed clusters. This lets it scale to huge datasets. If you need analytic information from your data, Hadoop's the way to go.
Hadoop in Action introduces the subject and teaches you how to write programs in the MapReduce style. It starts with a few easy examples and then moves quickly to show Hadoop use in more complex data analysis tasks. Included are best practices and design patterns of MapReduce programming.
This book requires basic Java skills. Knowing basic statistical concepts can help with the more advanced examples.
- Introduction to MapReduce
- Examples illustrating ideas in practice
- Hadoop's Streaming API
- Other related tools, like Pig and Hive
About the Author
Chuck Lam is a Senior Engineer at RockYou! He has a PhD in pattern recognition from Stanford University.
WHAT REVIEWERS ARE SAYING
“Hadoop in Action was the only resource I used to learn Hadoop and I know enough to be dangerous :) I recommend the book for anyone who wants to get into distributed systems and get a great understanding of map reduce algorithms.”
“I really love this book, is made for normal people just trying to get something done. The streaming coverage is perty good, it's the best book for python type of people I've seen.”
—Patrick Faith, review on Amazon.com