Fabulous Adventures in Data Structures and Algorithms you own this product

Eric Lippert

MEAP began October 2025
Last updated May 2026
Publication in Fall 2026 (estimated)

ISBN 9781633435032
400 pages (estimated)

Included with a Manning Online subscription

printed in black & white

resources: Source code Book forum Source code on GitHub

table of content

1 Starting a fabulous adventure

1.1 Defining data structures, algorithms and complexity

1.2 An immutable linked list

1.2.1 Performance so far

1.2.2 Reversing an immutable linked list

1.3 The challenges ahead

1.4 Summary

PART 1: EXTENDING THE BASICS

2 Immutable stacks and queues

2.1 Why immutability?

2.1.1 Correctness

2.1.2 Historical preservation

2.1.3 Security: TOCTOU me about it

2.1.4 Safer multithreading

2.1.5 Memoization for time performance

2.1.6 Persistence for space performance

2.1.7 The functional programming attitude

2.2 An immutable stack

2.2.1 A covariant immutable stack

2.3 A queue, a queue, an immutable queue

2.4 Mutable wrappers

2.5 Undo and redo

2.5.1 Create an army of clones

2.5.2 Use a different data structure and the command pattern

2.5.3 Undo and redo with mutable-over-immutable data structures

2.6 The Hughes List: build it for cheap, pay for it later

2.6.1 Reverse redux

2.6.2 Currying and partial application

2.6.3 Implementing the Hughes list

2.6.4 Complexity of the Hughes list

2.6.5 Where is the data in this data structure?

2.6.6 A couple more performance considerations

2.6.7 Reverse a linked list

2.7 Summary

3 An immutable deque

3.1 An immutable deque ADT

3.2 A bad naïve implementation

3.3 A finger tree

3.3.1 Step one: the mini-deque

3.3.2 A new definition of a deque

3.4 Visualizing the data structure

3.5 Amortized performance of the deque

3.6 Are we abusing the type system?

3.7 Concatenation of deques

3.7.1 Performance after adding concatenation

3.8 Why is this called a “finger tree”?

3.9 Summary

4 Memoizing immutable quadtrees to make a better Life

4.1 The rules of Life

4.2 A typical first attempt

4.2.1 Performance of the naïve implementation

4.2.2 Same algorithm, better constant factor

4.2.3 Change tracking improves the algorithm

4.3 An immutable quadtree

4.3.1 The IQuad interface

4.3.2 Implementing the 0-quad leaf cells

4.3.3 A strategy for compressing space

4.3.4 A general-purpose memoizer

4.3.5 A memoized quadtree implementation

4.3.6 Indexing an immutable quadtree like an array

4.3.7 A few more helpful extension methods

4.4 Gosper’s algorithm

4.4.1 The base case: stepping a 2-quad produces a 1-quad

4.4.2 A first attempt at a recursive algorithm

4.4.3 The grid always shrinks

4.4.4 Is this algorithm inefficient?

4.4.5 The big insight of Gosper’s Algorithm

4.4.6 Putting it all together

4.5 Summing up

5 What’s up with you, Directed Acyclic Word Graph?

5.1 Two problems, two weak solutions

5.1.1 Hash sets are a non-solution

5.1.2 Sorted lists are… fine?

5.2 The prefix tree

5.2.1 Nodes and edges

5.2.2 The trie building algorithm

5.2.3 Using a trie as a word list

5.2.4 How much memory are we saving with a trie?

5.3 Tries are not-so-secretly finite state automata

5.4 Building a DAWG

5.4.1 What problems must we solve to build a DAWG?

5.4.2 Equivalence of nodes

5.4.3 The DAWG building algorithm.

5.4.4 How much expense does optimizing the graph as you go add?

5.5 DAWG vs trie for ENABLE

5.6 Summing Up

6 Combinatorial algorithms

6.1 The Cartesian product

6.1.1 The Cartesian product of a few sets or sequences

6.1.2 The Cartesian product of arbitrarily many sequences

6.1.3 The connection to the integers

6.2 Permutations

6.2.1 Lexicographic permutations with repetitions

6.2.2 The factorial base representation of permutation numbers

6.2.3 The Fischer-Yates shuffling algorithm

6.2.4 A recursive “change ringing” algorithm

6.2.5 Even’s “change ringing” algorithm

6.3 Combinations

6.3.1 Lexicographic combinations, with a twist

6.3.2 Counting combinations

6.3.3 The combinatorial base representation of combinations

6.4 Summary

7 Interlude one: basic category theory for programmers

7.1 Covariant endofunctors in the category of types

7.2 Contravariant endofunctors in the category of types

7.3 Summary

PART 2: SEARCHING, SOLVING, INFERRING

8 Coloring graphs with backtracking search

8.1 Coloring South America

8.2 An immutable multi-dictionary

8.3 An immutable undirected graph

8.4 Coloring simple graphs

8.5 Solving sudokus with backtracking search

8.5.1 Graph coloring is NP-complete

8.5.2 Implementing a general backtracker

8.5.3 How could we improve?

8.5.4 Backtracking and the Cartesian product

8.6 Scheduling problems are graph coloring problems

8.7 Summary

9 Greedy iterative pretty printing

9.1 The pretty printing problem

9.2 Greedy algorithms, and making change

9.3 A greedy pretty printing algorithm

9.3.1 The doc data structure

9.3.2 Visualizing a doc; how deep does it get?

9.3.3 Phase two: implementing Fits() without recursion

9.3.4 Phase two continued: implementing Pretty() without recursion

9.4 Phase one: transforming parse trees with the visitor pattern

9.4.1 Implementing the visitor pattern

9.4.2 From parse tree to doc

9.5 Summary

10 Unification and anti-unification

10.1 Unifying binary terms

10.1.1 Unifying binary terms, attempt one

10.1.2 Unifying binary terms, this time with an occurs check

10.2 The performance of binary term unification

10.3 Type inference and logic programming

10.4 Anti-unifying binary terms

10.5 The first-order binary term anti-unification algorithm

10.5.1 Performance of Reynolds’ anti-unification algorithm

10.6 Clone detection and fix deduction

10.7 Summary

11 Second abstract nonsense interlude: monads

11.1 What’s the value for OO programmers?

11.2 How hard can it be to divide by two?

11.3 Generalizing to arbitrary functions with Map and Bind

11.4 The sequence monad is an additive monad

11.5 Summary

PART 3: PROBABILITIES

12 A better abstraction for randomness

12.1 What are “probabilities”?

12.1.1 What are “discrete probability distributions”?

12.2 Generating uniform samples with Random

12.3 IDistribution<T> and IDiscreteDistribution<T>

12.4 Flipping an unfair coin with Bernoulli

12.5 Improving the ecosystem with extension methods

12.6 Representing unfair die rolls by adding a projection

12.7 Categorical algorithm one: make a big list

12.8 Categorical algorithm two: climb a ladder

12.9 Categorical algorithm three: rejecting rejection sampling

12.10 Categorical algorithm four: the alias algorithm

12.11 Filtering out a category

12.12 Summary

13 Conditional probability with Bayes’ Theorem

13.1 Bayes’ theorem

13.2 Likelihood functions and joint distributions

13.3 Updating priors by reasoning from effects to causes

13.4 Some applications of Bayesian reasoning

13.4.1 A few real-world examples

13.5 Unconditional likelihood functions are independent

13.6 Summary

14 Third abstract nonsense interlude: the probability monad

14.1 The requirements to be a monad

14.2 Probability distributions as additive monads

14.3 A critique

14.4 Summary

15 Sampling continuous distributions

15.1 What is a continuous probability distribution?

15.2 Sampling the continuous uniform distribution

15.3 Sampling the normal distribution

15.4 The inverse transform method

15.5 Rejection sampling

15.5.1 Clamping with rejection sampling

15.5.2 Problems with rejection sampling

15.6 Summary

16 Markov processes and the Metropolis algorithm

16.1 What is a Markov process?

16.2 Markov texts

16.3 Computing posteriors of continuous distributions

16.4 The Metropolis algorithm

16.5 Sampling posteriors with Metropolis

16.6 Summary

Appendix

Appendix A: Notes on C#

A.1 Value types and reference types

A.2 Generic types are fully instantiated at runtime

A.3 Nullable types

A.4 Type sugars and record types

A.5 Extension methods

A.6 Sequences, iterators, lambdas and LINQ

Appendix B: Further reading

Overview

7 Interlude one: basic category theory for programmers

This interlude introduces category theory as a way to think about programming at an even higher level of abstraction. The chapter frames software development as the construction and composition of abstractions, then presents category theory as the study of relationships between “stuff,” where the stuff can be anything: numbers, people, types, or even categories. A category is described as a collection of objects plus morphisms, which are arrow-like relationships between objects. Morphisms must include an identity arrow for every object and must be transitive: if you can get from one object to a second, and from the second to a third, then you can get from the first to the third.

The chapter then explains functors as functions between categories that preserve morphisms. A covariant functor maps objects from one category to another while keeping the direction and structure of the morphisms intact. When such a functor maps a category to itself, it is called an endofunctor. These ideas are connected to programming language type systems by treating types as objects in a category and assignment compatibility as the morphism: for example, a value of type Giraffe can be assigned to a variable of type Animal, so there is a morphism from Giraffe to Animal. Under this view, a generic interface such as IEnumerable<T> is covariant when the mapping from T to IEnumerable<T> preserves those assignment-compatibility arrows.

The chapter also explains contravariance through interfaces that consume values rather than produce them, such as a comparer. An object that can compare any two animals can also compare any two giraffes, so an IC<Animal> can be used where an IC<Giraffe> is required. In category-theory terms, the mapping from T to IC<T> preserves the existence of morphisms but reverses their direction, making it a contravariant functor. The main takeaway is that language-design terms such as covariance and contravariance come directly from category theory, and that these abstract ideas help explain practical rules in modern type systems.

A small category with three objects, represented by the nodes, and six morphisms, represented by the arrows.

A strangely familiar small category with four objects and ten morphisms.

The function F is represented as dashed arrows mapping from one category to another

The category of four types with the assignment compatibility morphism; the identity morphisms are elided for clarity. Note that both Giraffe and Tiger have morphisms to Animal but not to each other. A Giraffe can’t be assigned to a variable of type Tiger and a Tiger can’t be assigned to a variable of type Giraffe, but both can be assigned to a variable of type Animal or Object.

Adding four constructed generic types to our category. Notice how the structure of the morphisms of the interface types looks very similar to that of the original four types.

The mapping of a function from types to types, shown as dashed arrows.

Part of another infinite category of types, this time with a contravariant interface.

Summary

Category theory is a modern mathematical discipline that studies relationships between arbitrary objects. It’s cheekily characterized as “generalized abstract nonsense”.
A category is a collection of objects that have reflexive, transitive morphisms; we can think of a morphism as an arrow going from one object to another.
A function that maps objects of one category to another such that morphisms are preserved is a covariant functor. One that preserves but reverses morphisms is a contravariant functor. If the two categories are the same, it is an endofunctor.
The assignment compatibility relation “an expression of type X may be assigned to a variable of type Y” is a common morphism on a category where types are objects.
Covariant and contravariant interfaces are called that because generic interfaces can be thought of as covariant or contravariant functors: functions from types to types that preserve assignment compatibility morphisms.
Posetal categories, that define a partial order on a set of objects, are common and particularly useful when objects are types in a programming language. Semilattices are posets that have the additional nice property that you can always find the most specific type that is assignment compatible with any two types.
The relationship between category theory and analysis of programming languages is very deep; we’ve just scratched the surface. We’ll go a little deeper in subsequent interludes.

FAQ

What is category theory, and why is it jokingly called “generalized abstract nonsense”?

Category theory is a branch of mathematics that studies relationships between things rather than focusing on particular things like numbers. The “stuff” being related can be almost anything: numbers, sets, people, types in a programming language, or even categories themselves. Because it is highly abstract and general, category theorists jokingly call it “generalized abstract nonsense.”

Why is category theory useful to programmers?

Programming is largely about creating abstractions and composing smaller solutions into larger ones. Category theory studies composable relationships and operations, so its ideas can help programmers recognize design patterns, reason about type systems, and better understand how programming language features are structured.

What is a category?

A category consists of objects and morphisms. The objects can be anything. The morphisms are arrows connecting objects, representing some relationship such as “can get from here to there” or “can be assigned to.” Morphisms must obey two rules: every object has an identity morphism to itself, and morphisms are transitive.

What are morphisms?

Morphisms are arrows between objects in a category. They represent a relationship chosen for that category. For example, in a category of types, a morphism from type X to type Y might mean that an expression of type X can be assigned to a variable of type Y. Morphisms are reflexive and transitive.

What does it mean for morphisms to be reflexive and transitive?

Reflexive means every object has an identity morphism pointing to itself. If X is an object, then X → X is always a morphism. Transitive means that if A → B and B → C are morphisms, then A → C must also be a morphism.

What is a posetal category?

A posetal category is a category whose morphisms are based on a partial ordering relationship, such as “greater than or equal to,” “is an ancestor of,” or “is assignable to.” In the chapter’s type example, Giraffe and Tiger both point to Animal, and Animal points to Object, but Giraffe and Tiger do not point to each other.

What is a functor?

A functor is a function that maps objects from one category to objects in another category while preserving morphisms. If there is a morphism X → Y in the first category, a covariant functor F ensures there is also a morphism F(X) → F(Y) in the second category.

What is an endofunctor?

An endofunctor is a functor that maps objects from a category back into the same category. In programming-language type systems, a generic type constructor such as mapping T to IE<T> can be understood as an endofunctor in the category of types, provided it applies to all relevant types and preserves morphisms.

Why is IEnumerable<T> or IE<T> called covariant?

IE<T> is covariant because if a Giraffe can be assigned to an Animal, then IE<Giraffe> can also be assigned to IE<Animal>. The assignment relationship moves in the same direction before and after applying the generic type constructor. In C#, this is expressed with the out modifier, as in interface IE<out T>.

What does contravariance mean in the category-of-types example?

Contravariance means the existence of morphisms is preserved, but their direction is reversed. For example, if Giraffe can be assigned to Animal, then an animal-comparer IC<Animal> can be assigned to a giraffe-comparer IC<Giraffe>. In C#, this is expressed with the in modifier, as in interface IC<in T>.

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

pdf, ePub, online

$55.99 $36.39

you save $19.60 (35%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$55.99 $36.39

you save $19.60 (35%)

eBook

pdf, ePub, online

$55.99 $36.39

you save $19.60 (35%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more