table of content

1 Thinking in distributed systems: Models, mindsets, and mechanics

1.1 Software engineering and mental models

1.1.1 Mental models: The foundation of reasoning

1.1.2 Correct mental models

1.1.3 Complete mental models

1.2 Mental model of software systems

1.3 Different types of models

1.3.1 Different models describing the same aspects

1.3.2 Different models describing different aspects of a system

1.4 Thinking about distributed systems

1.4.1 Correctness

1.4.2 Scalability and reliability

1.4.3 Responsiveness

1.5 Two big ideas

1.5.1 Systems of systems

1.5.2 Global view vs. local view

1.6 Distributed Systems Incorporated

1.7 Navigating complexity

1.7.1 Simple yet complex

1.7.2 Emergent behavior

1.7.3 Changing perspective

1.7.4 Think globally; act locally

1.8 Thinking above the code

2 System models, order, and time

2.1 System models

2.1.1 Theory and practice

2.1.2 Synchronous distributed systems

2.1.3 Asynchronous distributed systems

2.1.4 Partially synchronous systems

2.1.5 Component and network behavior

2.1.6 Realistic system models

2.2 Order and time

2.2.1 The happened-before relationship

2.2.2 Time and clocks

2.2.3 Physical time and physical clocks

2.2.4 Logical time and logical clocks

2.2.5 Physical clocks vs. logical clocks

3 Failure tolerance

3.1 In theory

3.2 Types of failure tolerance

3.2.1 Masking failure tolerance

3.2.2 Nonmasking failure tolerance

3.2.3 Fail-safe failure tolerance

3.2.4 None of the above

3.3 In practice

3.3.1 System model

3.3.2 Failure handling

3.3.3 Failure classification

3.3.4 Failure detection

3.3.5 Failure mitigation

3.3.6 Putting everything together

4 Message delivery and processing

4.1 Exchanging messages

4.2 The uncertainty principle of message delivery and processing

4.2.1 Before sending the request

4.2.2 After sending the request and before receiving a response

4.2.3 After receiving a response

4.3 Silence and chatter

4.4 Exactly-once processing semantics

4.5 Idempotence

4.6 Case study: Charging a credit card

5 Transactions

5.1 Abstractions

5.2 The magic of transactions

5.2.1 Concurrency

5.2.2 Failure

5.3 The model of transactions

5.3.1 Correctness

5.3.2 Serializability

5.3.3 Completeness

5.3.4 Application-level abort

5.3.5 Platform-level abort

6 Distributed transactions

6.1 Atomic commitment: From a single RM to multiple RMs

6.1.1 Transaction on a single RM

6.1.2 Transaction on multiple RMs

6.1.3 Blocking and nonblocking

6.2 The essence of distributed transactions

6.3 Two-Phase Commit protocol

6.3.1 In the absence of failure

6.3.2 In the presence of failure

6.3.3 Improvement

7 Partitioning

7.1 Encyclopedias and volumes

7.2 Thinking in partitions

7.3 The mechanics of partitioning and balancing

7.4 (Re)partitioning

7.4.1 Types of partitioning

7.4.2 Data item to partition assignment strategies

7.5 Common item-based assignment strategies

7.5.1 Range partitioning

7.5.2 Hash partitioning

7.6 Repartitioning

7.6.1 Range partitioning

7.6.2 Hash partitioning

7.7 Consistent hashing

7.8 (Re)balancing and overpartitioning

8 Replication

8.1 Redundancy

8.2 Thinking about replication and consistency

8.3 Replication

8.4 The mechanics of replication

8.4.1 System model

8.4.2 Replication lag

8.4.3 Synchronous vs. asynchronous replication

8.4.4 State-based vs. log-based replication

8.4.5 Single-leader, multileader, and leaderless systems

9 Consistency

9.1 Consistency models

9.1.1 Common consistency models

9.1.2 Virtues and limitations

9.2 Linearizability

9.2.1 Queue and stack

9.2.2 Formal definition of linearizability

9.3 Eventual consistency

9.3.1 The shopping cart

9.3.2 Variants of eventual consistency

9.3.3 Implementation

9.4 Consistency, availability, and partition tolerance

9.4.1 History

9.4.2 Conjecture vs. theorem

9.4.3 CAP theorem

10 Distributed consensus

10.1 The challenge of reaching agreement

10.2 System model

10.3 State machine replication

10.4 The origin—and irony—of consensus

10.5 Implementing consensus

10.5.1 Leader-based consensus

10.5.2 Quorum-based consensus

10.5.3 Combining leader and quorum

10.6 Raft

10.6.1 The log

10.6.2 Terms

10.6.3 Leader Election protocol

10.6.4 Log Replication protocol

10.6.5 State machine safety

10.7 Raft puzzles

10.7.1 Puzzle 1

10.7.2 Puzzle 2

10.7.3 Puzzle 3

11 Durable executions

11.1 The pitfalls of partial executions

11.2 System model

11.2.1 Process definition

11.2.2 Process execution

11.3 The concept of failure-transparent recovery

11.4 Strategies of failure-transparent recovery

11.4.1 Restart

11.4.2 Resume

11.5 Implementation of failure-transparent recovery

11.5.1 Application-level implementation: Sagas

11.5.2 Platform-level implementation: Durable execution

12 Cloud and services

12.1 From proactive to reactive

12.2 Cloud computing

12.3 Cloud-native computing

12.4 Serverless computing

12.4.1 Traditional

12.4.2 Serverless

12.4.3 Cold path vs. hot path

12.5 Service

12.5.1 Global view vs. local view

12.5.2 Example recommendation service

12.6 Final thoughts

Overview

5 Transactions

Transactions are presented as one of software engineering’s most transformative abstractions: although born in databases, they were embraced by distributed systems because they let developers act as if concurrency and failure do not exist. The chapter reframes the common ACID view through the lens of correctness and completeness, positioning transactions as the API that translates higher-level, concurrency- and failure-agnostic intent into lower-level, concurrency- and failure-aware execution. This builds on a broader discussion of abstractions as domain-shaping tools that reduce complex, “ugly” interfaces into simpler, “beautiful” ones, and highlights the recursive layering from hardware to operating systems to databases—where transactions and tables become the clean interface atop messy realities.

The “magic” of transactions is illustrated with a money-transfer example that lacks visible guards for races or failures, yet still behaves correctly and completely because of transactional guarantees. Correctness is ensured by isolation via serializability: concurrent histories are valid only if they are equivalent to some serial order, preventing anomalies even when operations interleave. The chapter abstracts databases as collections of named objects and defines programs (design-time intent) versus transactions (runtime executions), introducing histories as interleavings of actions across transactions. Consistency is clarified as an application-level predicate (e.g., with or without overdraft via constraints), while atomicity, isolation, and durability are platform-level guarantees that the database system enforces regardless of domain semantics.

Completeness—doing all or nothing despite failures—is achieved through recovery mechanisms, sketched via Undo/Redo with a focus on Undo logging. Before each operation, an inverse is recorded in a transaction undo log, enabling two valid traces: a commit trace that applies all operations, and an abort trace that undoes partial progress. The chapter distinguishes application-level aborts from platform-level aborts (crashes and restarts), explaining recovery by scanning the log for uncommitted work and rolling it back. To be safe across crashes during both forward and rollback phases, undo operations must be restartable (noop + idempotent). Altogether, by shouldering concurrency control and failure recovery, transactions deliver the promised developer experience: concurrency- and failure-agnostic definitions that execute correctly, completely, and durably.

The transformative nature of abstractions, according to Tannenbaum and Bos

The transformative nature of abstractions, according to Helland

Equivalence between higher level and lower level

Possible effects of concurrency

Possible effects of failure

Definition and execution

The database system translates the failure-agnostic definition into a failure-aware and failure-tolerant execution.

Temporary consistency violation

Good and bad histories

Serializability

Summary

Abstractions are illusions, turning the ugly into the beautiful.
Abstractions are layered; an entity using the abstractions of a higher level is reduced into entities using the abstractions of the lower level.
Transactions are an abstraction that allows an application to pretend that concurrency and failure do not exist.
The database system translates the concurrency and failure-agnostic definitions into a concurrency and failure-aware and -tolerant executions.
Transactions are commonly introduced and discussed in the context of ACID.
By embracing a holistic viewpoint, we recognize that transactions create an encompassing world where the challenges of concurrency and failure become virtually non-existent.

FAQ

Why are transactions discussed in a distributed systems book?

Although born in databases, transactions were quickly adopted in distributed systems because they deliver a great developer experience. They hide the complexity of concurrency, partial synchrony, component failures, and unreliable networks, letting you build distributed applications (like web apps) as if those problems didn’t exist.

What do transactions guarantee in this chapter’s framing?

Transactions guarantee correctness and completeness. Practically, they let you pretend concurrency and failures do not exist by ensuring executions behave as if operations ran without interference and either apply fully or not at all.

How does this chapter’s view differ from the classic ACID-first explanation?

Instead of treating Atomicity, Consistency, Isolation, and Durability as separate end-goals, the chapter explains transactions through the unified lens of correctness (handling concurrency) and completeness (handling failures), with ACID as the underlying principles that enable those guarantees.

What are the ACID properties and which are application- vs platform-level?

- Atomicity: effects are all-or-nothing. - Consistency: moves the database from one consistent state to another (defined by application rules). - Isolation: executes as if it’s the only transaction (no interference). - Durability: once committed, effects persist. Consistency is application-level (depends on your invariants, like allowing or forbidding overdrafts). Atomicity, Isolation, and Durability are platform-level guarantees provided by the database.

What problems show up in the money transfer example, and how do transactions address them?

- Concurrency: interleaving read–write steps across concurrent transfers can produce incorrect balances (lost updates). - Failure: a crash between debiting the source and crediting the target yields a partial transfer. Transactions address both: isolation eliminates concurrency anomalies (correctness) and atomic commit/rollback eliminates partial effects (completeness).

What is serializability and why is it the correctness criterion here?

Serializability is a predicate on execution histories: a concurrent history is valid if it is equivalent to some serial order of the same transactions. It guarantees that even with concurrency, results and final state match a sequential execution, preventing anomalies.

How is serializability implemented at a high level?

The simple (but slow) way is a single lock over all objects, forcing sequential execution. More efficient approaches use finer-grained locking (and related techniques) to allow maximal concurrency while preserving serial (sequential) semantics.

How are programs and transactions modeled in this chapter?

A program (definition) becomes a transaction (execution). Operations on names become actions on objects. A transaction is modeled as a trace of triples ⟨t, a, o⟩. A history is an interleaving of such triples across multiple transactions, used to reason about correctness.

How do databases ensure completeness with Undo logging?

Before performing each operation, the database records its inverse in a Transaction Undo Log. - Commit trace: execute the regular operations (no failure). - Abort trace: execute some operations, then apply their recorded inverses in reverse order to restore the prior state. This enables recovery when failures occur.

What’s the difference between application-level and platform-level aborts, and why must Undo be restartable?

- Application-level abort: explicit abort or an implicit one (e.g., a constraint violation). The system applies the recorded undo operations. - Platform-level abort: a crash and restart at an arbitrary point. On recovery, the system scans the Undo Log to roll back any uncommitted transactions. Undo must be restartable—noop (applying op then undo equals just undo) and idempotent (repeating undo has no extra effect)—so recovery remains correct across repeated restarts.

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

pdf, ePub, online

$54.99 $41.24

you save $13.75 (25%)

include audio $24.99 $18.74

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$54.99 $41.24

you save $13.75 (25%)

include audio $24.99 $18.74

eBook

pdf, ePub, online

$54.99 $41.24

you save $13.75 (25%)

include audio $24.99 $18.74

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more