contents


foreword
letter to the HBase community
preface
acknowledgments
about this book
about the authors
about the cover illustration

Part 1 HBase fundamentals

1 Introducing HBase
1.1 Data-management systems: a crash course
1.2 HBase use cases and success stories
1.3 Hello HBase
1.4 Summary
2 Getting started
2.1 Starting from scratch
2.2 Data manipulation
2.3 Data coordinates
2.4 Putting it all together
2.5 Data models
2.6 Table scans
2.7 Atomic operations
2.8 ACID semantics
2.9 Summary
3 Distributed HBase, HDFS, and MapReduce
3.1 A case for MapReduce
3.2 An overview of Hadoop MapReduce
3.3 HBase in distributed mode
3.4 HBase and MapReduce
3.5 Putting it all together
3.6 Availability and reliability at scale
3.7 Summary

Part 2 Advanced concepts

4 HBase table design
4.1 How to approach schema design
4.2 De-normalization is the word in HBase land
4.3 Heterogeneous data in the same table
4.4 Rowkey design strategies
4.5 I/O considerations
4.6 From relational to non-relational
4.7 Advanced column family configurations
4.8 Filtering data
4.9 Summary
5 Extending HBase with coprocessors
5.1 The two kinds of coprocessors
5.2 Implementing an observer
5.3 Implementing an endpoint
5.4 Summary
6 Alternative HBase clients
6.1 Scripting the HBase shell from UNIX
6.2 Programming the HBase shell using JRuby
6.3 HBase over REST
6.4 Using the HBase Thrift gateway from Python
6.5 Asynchbase: an alternative Java HBase client
6.6 Summary

Part 3 Example applications

7 HBase by example: OpenTSDB
7.1 An overview of OpenTSDB
7.2 Designing an HBase application
7.3 Implementing an HBase application
7.4 Summary
8 Scaling GIS on HBase
8.1 Working with geographic data
8.2 Designing a spatial index
8.3 Implementing the nearest-neighbors query
8.4 Pushing work server-side
8.5 Summary

Part 4 Operationalizing HBase

9 Deploying HBase
9.1 Planning your cluster
9.2 Deploying software
9.3 Distributions
9.4 Configuration
9.5 Managing the daemons
9.6 Summary
10 Operations
10.1 Monitoring your cluster
10.2 Performance of your HBase cluster
10.3 Cluster management
10.4 Backup and replication
10.5 Summary

 
appendix A Exploring the HBase system
appendix B More about the workings of HDFS
index