Exploring Streaming Data Analysis
With chapters selected by Alexander Dean
  • March 2020
  • ISBN 9781617298097
  • 94 pages
Modern large-scale systems need to be able to respond nimbly to multiple streams of data. Structuring your digital business around a centralized “event firehose” that collects, stores, and processes continuous event streams is the best way to reach that goal. This approach—the Unified Log Paradigm (ULP)—along with tools like Apache Kafka and Amazon Kinesis will help get you there. And this book will get you started!

About the book

Exploring Streaming Data Analysis is a timely primer that gives you a taste of performing analytics on event streams using a Lambda function on AWS (Amazon Web Services) and deploying and testing an AWS Lambda function. You’ll learn the algorithmic side of stream processing, focusing on the what and why of streaming analysis algorithms. You’ll cover common constraints, approaches for thinking about time, and techniques for summarization. Finally, you’ll take a look at how the Kafka Streams framework uses local state to extract the maximum amount of information from event streams. This mini ebook provides the well-rounded introduction you need to get up to speed in the basics of streaming data analysis!
Table of Contents detailed table of contents


Part 1: Analytics-on-write


11.1 Back to OOPS

11.2 Building our Lambda function

11.3 Running our Lambda function

Part 2: Algorithms for data analysis

Algorithms for data analysis

5.1 Accepting constraints and relaxing

5.2 Thinking about time

5.3 Summarization techniques

Part 3: Streams and state

Streams and state

4.1 Thinking of events

4.2 Applying stateful operations to Kafka Streams

4.3 Using state stores for lookups and previously seen data

4.4 Joining streams for added insight

4.5 Timestamps in Kafka Streams


What's inside

  • “Analytics-on-write” – Chapter 11 from Event Streams in Action by Alexander Dean and Valentin Crettaz
  • “Algorithms for data analysis” – Chapter 5 from Streaming Data by Andrew G. Psaltis
  • “Streams and state” – Chapter 4 from Kafka Streams in Action by William P. Bejeck Jr.

About the author

Alexander Dean is an experienced technologist with a passion for functional programming, cloud-based architectures, and big data technologies. He is the co-founder of Snowplow Analytics, an open source event processing and analytics platform.

placing your order...

Don't refresh or navigate away from the page.
eBook $0.00 PDF only + liveBook
Exploring Streaming Data Analysis (eBook) added to cart
continue shopping
go to cart

Prices displayed in rupees will be charged in USD when you check out.

FREE domestic shipping on three or more pBooks