Fabulous Adventures in Data Structures and Algorithms you own this product

Eric Lippert

MEAP began October 2025
Last updated March 2026
Publication in Summer 2026 (estimated)

ISBN 9781633435032
400 pages (estimated)

Included with a Manning Online subscription

printed in black & white

catalog / Software Development / Software Engineering

resources: Source code Book forum Source code on GitHub

table of content

1 Starting a fabulous adventure

1.1 Defining data structures, algorithms and complexity

1.2 An immutable linked list

1.2.1 Performance so far

1.2.2 Reversing an immutable linked list

1.3 The challenges ahead

1.4 Summary

PART 1: EXTENDING THE BASICS

2 Immutable stacks and queues

2.1 Why immutability?

2.1.1 Correctness

2.1.2 Historical preservation

2.1.3 Security: TOCTOU me about it

2.1.4 Safer multithreading

2.1.5 Memoization for time performance

2.1.6 Persistence for space performance

2.1.7 The functional programming attitude

2.2 An immutable stack

2.2.1 A covariant immutable stack

2.3 A queue, a queue, an immutable queue

2.4 Mutable wrappers

2.5 Undo and redo

2.5.1 Create an army of clones

2.5.2 Use a different data structure and the command pattern

2.5.3 Undo and redo with mutable-over-immutable data structures

2.6 The Hughes List: build it for cheap, pay for it later

2.6.1 Reverse redux

2.6.2 Currying and partial application

2.6.3 Implementing the Hughes list

2.6.4 Complexity of the Hughes list

2.6.5 Where is the data in this data structure?

2.6.6 A couple more performance considerations

2.6.7 Reverse a linked list

2.7 Summary

3 An immutable deque

3.1 An immutable deque ADT

3.2 A bad naïve implementation

3.3 A finger tree

3.3.1 Step one: the mini-deque

3.3.2 A new definition of a deque

3.4 Visualizing the data structure

3.5 Amortized performance of the deque

3.6 Are we abusing the type system?

3.7 Concatenation of deques

3.7.1 Performance after adding concatenation

3.8 Why is this called a “finger tree”?

3.9 Summary

4 Memoizing immutable quadtrees to make a better Life

4.1 The rules of Life

4.2 A typical first attempt

4.2.1 Performance of the naïve implementation

4.2.2 Same algorithm, better constant factor

4.2.3 Change tracking improves the algorithm

4.3 An immutable quadtree

4.3.1 The IQuad interface

4.3.2 Implementing the 0-quad leaf cells

4.3.3 A strategy for compressing space

4.3.4 A general-purpose memoizer

4.3.5 A memoized quadtree implementation

4.3.6 Indexing an immutable quadtree like an array

4.3.7 A few more helpful extension methods

4.4 Gosper’s algorithm

4.4.1 The base case: stepping a 2-quad produces a 1-quad

4.4.2 A first attempt at a recursive algorithm

4.4.3 The grid always shrinks

4.4.4 Is this algorithm inefficient?

4.4.5 The big insight of Gosper’s Algorithm

4.4.6 Putting it all together

4.5 Summing up

5 What’s up with you, Directed Acyclic Word Graph?

5.1 Two problems, two weak solutions

5.1.1 Hash sets are a non-solution

5.1.2 Sorted lists are… fine?

5.2 The prefix tree

5.2.1 Nodes and edges

5.2.2 The trie building algorithm

5.2.3 Using a trie as a word list

5.2.4 How much memory are we saving with a trie?

5.3 Tries are not-so-secretly finite state automata

5.4 Building a DAWG

5.4.1 What problems must we solve to build a DAWG?

5.4.2 Equivalence of nodes

5.4.3 The DAWG building algorithm.

5.4.4 How much expense does optimizing the graph as you go add?

5.5 DAWG vs trie for ENABLE

5.6 Summing Up

6 Combinatorial algorithms

6.1 The Cartesian product

6.1.1 The Cartesian product of a few sets or sequences

6.1.2 The Cartesian product of arbitrarily many sequences

6.1.3 The connection to the integers

6.2 Permutations

6.2.1 Lexicographic permutations with repetitions

6.2.2 The factorial base representation of permutation numbers

6.2.3 The Fischer-Yates shuffling algorithm

6.2.4 A recursive “change ringing” algorithm

6.2.5 Even’s “change ringing” algorithm

6.3 Combinations

6.3.1 Lexicographic combinations, with a twist

6.3.2 Counting combinations

6.3.3 The combinatorial base representation of combinations

6.4 Summary

7 Interlude one: basic category theory for programmers

7.1 Covariant endofunctors in the category of types

7.2 Contravariant endofunctors in the category of types

7.3 Summary

PART 2: SEARCHING, SOLVING, INFERRING

8 Coloring graphs with backtracking search

8.1 Coloring South America

8.2 An immutable multi-dictionary

8.3 An immutable undirected graph

8.4 Coloring simple graphs

8.5 Solving sudokus with backtracking search

8.5.1 Graph coloring is NP-complete

8.5.2 Implementing a general backtracker

8.5.3 How could we improve?

8.5.4 Backtracking and the Cartesian product

8.6 Scheduling problems are graph coloring problems

8.7 Summary

9 Greedy iterative pretty printing

9.1 The pretty printing problem

9.2 Greedy algorithms, and making change

9.3 A greedy pretty printing algorithm

9.3.1 The doc data structure

9.3.2 Visualizing a doc; how deep does it get?

9.3.3 Phase two: implementing Fits() without recursion

9.3.4 Phase two continued: implementing Pretty() without recursion

9.4 Phase one: transforming parse trees with the visitor pattern

9.4.1 Implementing the visitor pattern

9.4.2 From parse tree to doc

9.5 Summary

10 Unification of binary terms

10.1 Unifying binary terms

10.1.1 Unifying binary terms, attempt one

10.1.2 Unifying binary terms, this time with an occurs check

10.2 The performance of binary term unification

10.3 Type inference and logic programming

10.4 Summary

11 Anti-unification of binary terms

11.1 Anti-unifying binary terms

11.2 The first-order binary term anti-unification algorithm

11.2.1 Performance of Reynolds’ anti-unification algorithm

11.3 Clone detection and fix deduction

11.4 Summary

12 Interlude two: monads

12.1 What’s the value for OO programmers?

12.2 How hard can it be to divide by two?

12.3 Generalizing to arbitrary functions with Map and Bind

12.4 The sequence monad is an additive monad

12.5 Summary

PART 3: PROBABILITIES

13 A better abstraction for randomness

14 The alias algorithm for the categorical distribution

15 Conditional probability with Bayes’ Theorem

16 Interlude three: the probability monad

17 Markov processes

18 Continuous distributions with the Metropolis algorithm

Overview

2 Immutable stacks and queues

This chapter motivates and demonstrates immutable data structures through practical, production-minded examples. It argues that immutability simplifies correctness and reasoning (facts don’t go stale), preserves history, mitigates TOCTOU security risks, and enables safe sharing across threads. It also highlights performance tradeoffs: memoization can speed computation but needs thread-safe caches, and persistence allows structural sharing that saves memory when multiple versions coexist. With a functional attitude, the chapter balances theory with pragmatic C# techniques, showing how immutability can reduce bugs, ease testing, and still fit into everyday, object-oriented codebases.

Starting from an ADT-first design, an immutable stack is built over a refined linked-list approach with a singleton empty instance and a non-empty node (item plus tail), yielding O(1) Push/Peek/Pop and linear enumeration, while avoiding nulls via a “null object” pattern. A neat C# variance trick makes the stack interface covariant by moving the input-taking Push off the interface into an extension method. The immutable queue follows by composing two immutable stacks—one for enqueues, one for dequeues—preserving the invariant that any non-empty queue has a non-empty dequeue stack. This design gives O(1) time for Enqueue, Peek, IsEmpty, and O(1) amortized Dequeue, with the worst case triggered when reversing the enqueue stack into the dequeue stack; enumeration is linear in time and can require linear extra memory in the worst case.

To add convenient mutability without sacrificing safety, the chapter wraps immutable structures in small mutable shells and then layers universal undo-redo support via an UndoRedo helper that snapshots states with persistent sharing. This delivers cheap snapshots, straightforward redo (often “for free”), and reasonable memory cost—but it also exposes a caveat: repeated worst-case operations (like repeatedly triggering a large reverse between undo/redo) defeat amortization. The finale introduces the Hughes list (difference list): a list represented by a function that concatenates its content onto a supplied tail. It makes Push, Append, and Concatenate O(1) by deferring work, but materializing (ToStack), Peek, and Pop become O(n). Internally, closures form a binary graph, which is elegant but raises stack-depth concerns on materialization. As a capstone, the chapter shows how ReverseOnto enables O(1) construction of a “reversed” Hughes list whose costs are paid only when realized—underscoring the broader theme: build flexibly and cheaply now, pay precisely when you actually need the data.

Each variable in the example code references a particular immutable stack. Stacks that have the same tail can both refer to the same immutable object.

There are six different immutable queues in the example and six different immutable stacks. Since the data structures are persistent and immutable, objects can be reused across different queues.

A Hughes list is secretly a binary graph! Delegates created from lambda bodies have automatically-created closure classes with fields hl1 and hl2 that refer to the left and right lists being concatenated.

Summary

Immutable data structures are easier to reason about and often safer than the equivalent mutable data structure
Persistence makes immutable data structures efficient in space
Immutable data structures can be made efficient in time as well, though there are pitfalls to be wary of when some operations are expensive
Undo-redo operations are straightforward when program state is immutable
The Hughes list changes the performance costs of seemingly expensive operations by performing them in the optimal order in the future, but it makes some normally cheap operations such as Pop expensive. There’s no free lunch, but costs can be pushed into the future.
We still haven’t got a truly double-ended queue (or “deque”) where pushing and popping on either end of the list is cheap. In the next chapter we’ll implement a deque using finger trees.

FAQ

What are the key benefits of immutable data structures highlighted in this chapter?

They make code easier to reason about (facts don’t change), improve correctness and testability, preserve history (e.g., ledgers, source control), mitigate TOCTOU security bugs, enable safe sharing across threads, unlock memoization (time) and persistence (space via structural sharing), and encourage a functional programming mindset while remaining usable from OO languages like C#.

How is an immutable stack implemented without using null, and how is the empty stack represented?

It uses the null object pattern: an EmptyStack singleton implements the same interface as non-empty nodes. Peek/Pop on the empty stack throw, enumeration yields nothing, and construction is only via pushing onto the empty stack. This avoids null checks and accidental dereferences.

Why can’t a straightforward immutable stack interface be covariant in C#, and how does the chapter make it covariant anyway?

Covariance requires the varying type parameter to appear only in output positions. A Push(T) method takes T as input, so the interface isn’t covariant. The workaround: remove Push from the interface, mark the interface out T, provide a static factory on the implementation and an extension method Push. Call sites stay the same, and covariance (e.g., IImStack<Tiger> to IImStack<Animal>) becomes legal.

How does the immutable queue use two stacks, and what are its time complexities?

It maintains two stacks: enqueues push onto the enqueue stack; dequeues pop from the dequeue stack. When the dequeue stack empties, it reverses the enqueue stack to become the new dequeue stack, preserving the invariant “non-empty queue ⇒ non-empty dequeue stack.” Amortized Dequeue is O(1), but the worst-case call that triggers reversal is O(n) time and space. Peek/IsEmpty are O(1), enumeration is O(n) time and can be O(n) extra memory during reversal.

Why is persistence a memory win with immutable stacks/queues?

New versions share structure with old versions (e.g., shared tails). Creating “modified” versions typically allocates only a few new nodes and reuses the rest, so each snapshot adds about O(1) extra memory on average instead of copying the whole structure.

What is a “mutable-over-immutable” wrapper and why use it?

It’s a thin mutable class holding a private reference to an immutable structure. “Mutating” methods just assign the field to a new immutable version. Benefits: ergonomic, thread-friendly snapshots, easy undo/redo, and clear reasoning while preserving the advantages of immutability under the hood.

How does the generic UndoRedo helper work, and what are its costs?

It keeps two immutable stacks of states: undo and redo. Do(newState) pushes the old state to undo and clears redo. Undo/Redo move the current state between these stacks. Extra time is O(1) per operation; extra memory is O(n) in the number of undoable actions. Thanks to persistence, each stored state shares most of its memory with neighbors.

What is a Hughes list (difference list), and when is it useful?

A Hughes list represents a list as a function (delegate) that, given a tail, produces the full list by concatenation. Building with Push, Append, and Concatenate is O(1) each, regardless of size, making it ideal for assembling large results in pieces before reading them.

Where does the work happen with Hughes lists, and what are the trade-offs?

The cost is deferred to materialization: ToStack, Peek, Pop, and enumeration traverse the captured closures and run in O(n) time (and use O(n) call stack depth). Pros: constant-time building/concatenation. Cons: deferred O(n) read, potential stack overflow for very large lists due to recursion depth.

What pitfalls and edge cases should I watch for (security, threading, amortization)?

- TOCTOU: mutable inputs can be changed between check/use; immutability mitigates this risk. - Memoization: internal caches mutate; ensure thread-safe caches if methods are called concurrently. - Amortization caveat: repeating the specific worst-case pattern (e.g., dequeue-after-undo in a loop) can repeatedly trigger O(n) reversals; mitigate by accepting it, choosing a different data structure (e.g., finger-tree deque), or memoizing reversals where appropriate.

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

pdf, ePub, online

$55.99 $27.99

you save $28.00 (50%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$55.99 $27.99

you save $28.00 (50%)

eBook

pdf, ePub, online

$55.99 $27.99

you save $28.00 (50%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more