Fabulous Adventures in Data Structures and Algorithms you own this product

Eric Lippert

MEAP began October 2025
Last updated May 2026
Publication in Fall 2026 (estimated)

ISBN 9781633435032
400 pages (estimated)

Included with a Manning Online subscription

printed in black & white

resources: Source code Book forum Source code on GitHub

table of content

1 Starting a fabulous adventure

1.1 Defining data structures, algorithms and complexity

1.2 An immutable linked list

1.2.1 Performance so far

1.2.2 Reversing an immutable linked list

1.3 The challenges ahead

1.4 Summary

PART 1: EXTENDING THE BASICS

2 Immutable stacks and queues

2.1 Why immutability?

2.1.1 Correctness

2.1.2 Historical preservation

2.1.3 Security: TOCTOU me about it

2.1.4 Safer multithreading

2.1.5 Memoization for time performance

2.1.6 Persistence for space performance

2.1.7 The functional programming attitude

2.2 An immutable stack

2.2.1 A covariant immutable stack

2.3 A queue, a queue, an immutable queue

2.4 Mutable wrappers

2.5 Undo and redo

2.5.1 Create an army of clones

2.5.2 Use a different data structure and the command pattern

2.5.3 Undo and redo with mutable-over-immutable data structures

2.6 The Hughes List: build it for cheap, pay for it later

2.6.1 Reverse redux

2.6.2 Currying and partial application

2.6.3 Implementing the Hughes list

2.6.4 Complexity of the Hughes list

2.6.5 Where is the data in this data structure?

2.6.6 A couple more performance considerations

2.6.7 Reverse a linked list

2.7 Summary

3 An immutable deque

3.1 An immutable deque ADT

3.2 A bad naïve implementation

3.3 A finger tree

3.3.1 Step one: the mini-deque

3.3.2 A new definition of a deque

3.4 Visualizing the data structure

3.5 Amortized performance of the deque

3.6 Are we abusing the type system?

3.7 Concatenation of deques

3.7.1 Performance after adding concatenation

3.8 Why is this called a “finger tree”?

3.9 Summary

4 Memoizing immutable quadtrees to make a better Life

4.1 The rules of Life

4.2 A typical first attempt

4.2.1 Performance of the naïve implementation

4.2.2 Same algorithm, better constant factor

4.2.3 Change tracking improves the algorithm

4.3 An immutable quadtree

4.3.1 The IQuad interface

4.3.2 Implementing the 0-quad leaf cells

4.3.3 A strategy for compressing space

4.3.4 A general-purpose memoizer

4.3.5 A memoized quadtree implementation

4.3.6 Indexing an immutable quadtree like an array

4.3.7 A few more helpful extension methods

4.4 Gosper’s algorithm

4.4.1 The base case: stepping a 2-quad produces a 1-quad

4.4.2 A first attempt at a recursive algorithm

4.4.3 The grid always shrinks

4.4.4 Is this algorithm inefficient?

4.4.5 The big insight of Gosper’s Algorithm

4.4.6 Putting it all together

4.5 Summing up

5 What’s up with you, Directed Acyclic Word Graph?

5.1 Two problems, two weak solutions

5.1.1 Hash sets are a non-solution

5.1.2 Sorted lists are… fine?

5.2 The prefix tree

5.2.1 Nodes and edges

5.2.2 The trie building algorithm

5.2.3 Using a trie as a word list

5.2.4 How much memory are we saving with a trie?

5.3 Tries are not-so-secretly finite state automata

5.4 Building a DAWG

5.4.1 What problems must we solve to build a DAWG?

5.4.2 Equivalence of nodes

5.4.3 The DAWG building algorithm.

5.4.4 How much expense does optimizing the graph as you go add?

5.5 DAWG vs trie for ENABLE

5.6 Summing Up

6 Combinatorial algorithms

6.1 The Cartesian product

6.1.1 The Cartesian product of a few sets or sequences

6.1.2 The Cartesian product of arbitrarily many sequences

6.1.3 The connection to the integers

6.2 Permutations

6.2.1 Lexicographic permutations with repetitions

6.2.2 The factorial base representation of permutation numbers

6.2.3 The Fischer-Yates shuffling algorithm

6.2.4 A recursive “change ringing” algorithm

6.2.5 Even’s “change ringing” algorithm

6.3 Combinations

6.3.1 Lexicographic combinations, with a twist

6.3.2 Counting combinations

6.3.3 The combinatorial base representation of combinations

6.4 Summary

7 Interlude one: basic category theory for programmers

7.1 Covariant endofunctors in the category of types

7.2 Contravariant endofunctors in the category of types

7.3 Summary

PART 2: SEARCHING, SOLVING, INFERRING

8 Coloring graphs with backtracking search

8.1 Coloring South America

8.2 An immutable multi-dictionary

8.3 An immutable undirected graph

8.4 Coloring simple graphs

8.5 Solving sudokus with backtracking search

8.5.1 Graph coloring is NP-complete

8.5.2 Implementing a general backtracker

8.5.3 How could we improve?

8.5.4 Backtracking and the Cartesian product

8.6 Scheduling problems are graph coloring problems

8.7 Summary

9 Greedy iterative pretty printing

9.1 The pretty printing problem

9.2 Greedy algorithms, and making change

9.3 A greedy pretty printing algorithm

9.3.1 The doc data structure

9.3.2 Visualizing a doc; how deep does it get?

9.3.3 Phase two: implementing Fits() without recursion

9.3.4 Phase two continued: implementing Pretty() without recursion

9.4 Phase one: transforming parse trees with the visitor pattern

9.4.1 Implementing the visitor pattern

9.4.2 From parse tree to doc

9.5 Summary

10 Unification and anti-unification

10.1 Unifying binary terms

10.1.1 Unifying binary terms, attempt one

10.1.2 Unifying binary terms, this time with an occurs check

10.2 The performance of binary term unification

10.3 Type inference and logic programming

10.4 Anti-unifying binary terms

10.5 The first-order binary term anti-unification algorithm

10.5.1 Performance of Reynolds’ anti-unification algorithm

10.6 Clone detection and fix deduction

10.7 Summary

11 Second abstract nonsense interlude: monads

11.1 What’s the value for OO programmers?

11.2 How hard can it be to divide by two?

11.3 Generalizing to arbitrary functions with Map and Bind

11.4 The sequence monad is an additive monad

11.5 Summary

PART 3: PROBABILITIES

12 A better abstraction for randomness

12.1 What are “probabilities”?

12.1.1 What are “discrete probability distributions”?

12.2 Generating uniform samples with Random

12.3 IDistribution<T> and IDiscreteDistribution<T>

12.4 Flipping an unfair coin with Bernoulli

12.5 Improving the ecosystem with extension methods

12.6 Representing unfair die rolls by adding a projection

12.7 Categorical algorithm one: make a big list

12.8 Categorical algorithm two: climb a ladder

12.9 Categorical algorithm three: rejecting rejection sampling

12.10 Categorical algorithm four: the alias algorithm

12.11 Filtering out a category

12.12 Summary

13 Conditional probability with Bayes’ Theorem

13.1 Bayes’ theorem

13.2 Likelihood functions and joint distributions

13.3 Updating priors by reasoning from effects to causes

13.4 Some applications of Bayesian reasoning

13.4.1 A few real-world examples

13.5 Unconditional likelihood functions are independent

13.6 Summary

14 Third abstract nonsense interlude: the probability monad

14.1 The requirements to be a monad

14.2 Probability distributions as additive monads

14.3 A critique

14.4 Summary

15 Sampling continuous distributions

15.1 What is a continuous probability distribution?

15.2 Sampling the continuous uniform distribution

15.3 Sampling the normal distribution

15.4 The inverse transform method

15.5 Rejection sampling

15.5.1 Clamping with rejection sampling

15.5.2 Problems with rejection sampling

15.6 Summary

16 Markov processes and the Metropolis algorithm

16.1 What is a Markov process?

16.2 Markov texts

16.3 Computing posteriors of continuous distributions

16.4 The Metropolis algorithm

16.5 Sampling posteriors with Metropolis

16.6 Summary

Appendix

Appendix A: Notes on C#

A.1 Value types and reference types

A.2 Generic types are fully instantiated at runtime

A.3 Nullable types

A.4 Type sugars and record types

A.5 Extension methods

A.6 Sequences, iterators, lambdas and LINQ

Appendix B: Further reading

Overview

5 What’s up with you, Directed Acyclic Word Graph?

This chapter introduces the problem of storing and searching large word lists efficiently, especially for operations such as checking whether a word exists and finding all words with a given prefix. It compares simple approaches first: hash sets are excellent for exact lookup but poor for prefix search, while sorted lists support both operations reasonably well through binary search but still require extra work to enumerate matches and can consume significant memory when stored as ordinary strings.

The chapter then develops tries, or prefix trees, as a better fit for prefix-based searching. A trie stores shared prefixes only once, allowing word lookup and prefix navigation in time proportional to the length of the input string rather than the size of the word list. However, tries can still waste substantial memory because many nodes, especially leaf nodes and common suffix patterns, are duplicated. This motivates viewing tries as finite state automata, where nodes are states, edges are character transitions, and accepted strings are the stored words.

The final major idea is the Directed Acyclic Word Graph, which improves on a trie by deduplicating suffixes as well as prefixes. The chapter explains an incremental algorithm for building a minimal DAWG from a sorted word list: each new word is added while the previously added word’s unique suffix is optimized by replacing equivalent nodes with canonical ones. This preserves efficient search operations while greatly reducing node count in realistic word lists. The chapter concludes that DAWGs offer an elegant and practical balance of fast lookup, prefix search, and compact storage, especially when combined with low-level byte-array representations for memory-constrained systems.

The trie for a nine-word list, laid out here left to right, and in alphabetical order from top to bottom. Node numbers are not data in the tree; they are just so we can refer to them in the discussion which follows. Double-circle nodes represent ends of words.

The trie for the word list “car”

The trie for the word list “car”, “care”.

A trie with nine words: car, care, cares, cars, fir, fire, firer, firers, firs

Deduplicating all the leaves of the trie produces an equivalent word graph.

We’ve added car, care, cares and cars. The graph is optimized for everything except “cars”. Node 6 is identical to node 5, but node 6 is in “cars” so we have not fixed it yet.

After adding “fir” the graph is optimized for every word except “fir”. State 9 could be replaced with state 5, so this graph is not optimized for the final word.

Do you notice something odd about this graph after adding “fire”?

After adding “firer” and “firers”, the nodes for “firers” are still not optimized.

We look for optimizations moving backwards from the end of the previous word added, and first discover that node 12 is redundant; node 11 should have an edge to node 5 instead.

We’ve now fully optimized the “ers” portion of “firers” and we’re ready to add “firs”.

“firs” is added, and everything is optimized except for node 13, which is redundant to node 4.

This is the smallest possible DAWG that represents those nine words.

Summing Up

Storing n words of average length m in a list such that we can generate all the words matching a particular prefix is a common problem. The number of words can be very large.
Hash sets are great at determining if a given word is contained in a set, but not at all good at generating completions given a prefix.
We can get O(m log n) search performance by using binary search on a sorted list, which is probably acceptable -- but this solution uses a surprisingly large amount of memory if we use off-the-shelf parts.
We can get O(m) search performance by using a trie to deduplicate prefixes, but the number of redundant nodes is potentially enormous.
By noticing that tries are a special case of finite state automata, we could use any one of many algorithms that minimize FSAs without changing the language they recognize. But starting with the huge, space-inefficient trie in memory and running an expensive, difficult minimization algorithm is not great; what we really want is to build an optimized DAWG one word at a time.
Efficiently detecting when two nodes are redundant and can be safely merged is a key sub-problem in graph minimization; if we are careful about not mutating a node after it is in the canonical set of optimized nodes, we can do so very efficiently.
We can make DAWGs very small in memory if we need to; this is exactly what game developers did back in the day.

FAQ

What problem motivates the use of tries and DAWGs in this chapter?

The motivating problem is storing a large word list so that common operations are fast, especially Contains—checking whether a word is legal—and StartsWith—listing all words beginning with a prefix, as in autocomplete or word-game move generation.

Why is a hash set not a good solution for prefix searching?

A HashSet<string> is excellent for Contains, because hashing a word takes O(m) time where m is the word length. However, hash sets deliberately distribute similar-looking words across different buckets, so they provide no efficient way to find all words that begin with a given prefix.

How does a sorted list support prefix search?

A sorted list can use binary search to find either the prefix itself or the first word that would come after the prefix. From that position, it scans forward and yields words while they continue to start with the prefix. This gives reasonable performance, but finding the starting point costs O(m log n), and checking each matching word’s prefix costs additional time.

What is a trie?

A trie, or prefix tree, is a tree-shaped data structure that deduplicates common prefixes. Each edge is labeled with a character, and each node represents a position between characters. A node is marked as an end-of-word node if the path from the root to that node spells a word in the list.

How are words added to a trie?

To add a word, start at the root and process each character in order. If an edge for the character already exists, follow it. Otherwise, create a new edge and node. After the final character, mark the resulting node as an end-of-word node.

What are the time complexities of trie operations?

Building a trie from n words of average length m takes O(mn), because every character must be processed. Checking whether a word exists takes O(m), since at most one edge is followed per character. Finding the node for a prefix also takes O(m), after which matching words can be enumerated from that node.

Why might a trie still use a lot of memory?

Although a trie deduplicates prefixes, it can contain many duplicate suffix structures. For example, many leaf nodes may be identical end-of-word nodes with no outgoing edges. In the ENABLE word list example, the trie had 387,881 nodes, including 111,507 identical leaves, so the object overhead can be substantial.

What is a DAWG?

A DAWG is a Directed Acyclic Word Graph. It represents a word list like a trie, but it deduplicates both common prefixes and common suffixes. Because suffixes may be shared by multiple prefixes, the structure is no longer a tree; it is an acyclic graph.

How is a DAWG related to finite state automata?

A trie or DAWG can be viewed as a finite state automaton. Nodes are states, edge labels are inputs, the root is the start state, and end-of-word nodes are accepting states. The word list is the language accepted by the automaton. Minimizing the automaton means reducing it to the fewest states while accepting the same words.

How effective is a DAWG compared with a trie for the ENABLE word list?

For the ENABLE benchmark, the trie had 387,881 nodes and 387,880 edges, while the DAWG had only 54,167 nodes and 122,975 edges. That means the DAWG used only about 14% as many nodes as the trie, mainly because it eliminated redundant leaves and shared common English suffixes such as -s, -ed, and -ing.

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

pdf, ePub, online

$55.99 $36.39

you save $19.60 (35%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$55.99 $36.39

you save $19.60 (35%)

eBook

pdf, ePub, online

$55.99 $36.39

you save $19.60 (35%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more