ManifoldCF in Action
Karl D. Wright

ISBN: 9781617290190

We regret that Manning Publications will not be publishing this title.

The last draft of the manuscript is available for free. See details below.

Download PDF Manuscript


As a manuscript release, this content is unfinished work, that has not benefited from numerous steps that Manning's published books get during the publication cycles. Some of these are:

Reviews. This manuscript was not put through all of the rigorous internal editorial and external peer reviews of a typical Manning book.
Copy edit. Manning has not ensured that the spelling, grammar and style are both correct and consistent.
Technical edit. Manning cannot ensure that the code listings, screen captures, and source code are both correct and consistent--so they too should be taken with caution.
Proofreading and typesetting. You should expect errors of all kinds.

Please note that this manuscript release of ManifoldCF in Action does not include an index, among other things.


Table of Contents     Resources
Part 1 Introducing ManifoldCF
  1 Meet ManifoldCF - AVAILABLE (PDF)

Part 2 Interacting with ManifoldCF
  2 Working with the crawler UI - AVAILABLE
  3 Integration using the API - AVAILABLE
  4 Integrating with the Authority Service - AVAILABLE

Part 3 Writing connectors
  5 Using the ManifoldCF infrastructure - AVAILABLE
  6 Ground rules for writing connectors - AVAILABLE
  7 Designing and writing repository connectors - AVAILABLE
  8 Designing and writing authority connectors - AVAILABLE
  9 Designing and writing output connectors - AVAILABLE

Part 4 ManifoldCF architecture
10 Organization and Architecture - AVAILABLE
11 Data structures and resource management - AVAILABLE
12 Thread architecture - AVAILABLE


No matter how exciting a search engine might be, it's worthless unless it has data to index. ManifoldCF is an open source framework for pulling content out of a repository and sending it on to targets such as Solr via a plug-in style, connector-based architecture. ManifoldCF includes connectors for numerous commercial and open source data sources, including Documentum, SharePoint, JDBC, and RSS.

ManifoldCF in Action is a comprehensive tutorial and reference that shows you how to integrate search with enterprise-level document repositories using ManifoldCF. The book begins with an architectural overview of ManifoldCF and how it fits into your application infrastructure. After covering the basics, it dives into examples showing typical integration tasks, such as setting up connections, using ManifoldCF as an engine under the control of another enterprise system, and integrating ManifoldCF's user-based security model with a search engine.

Although ManifoldCF provides connectors for a large number of repositories and search technologies, including Solr, FileNet, Windows shares, JDBC, Documentum, Meridio, and SharePoint, there are many for which no ManifoldCF connector yet exists. As you explore the ManifoldCF architecture, you'll learn how ManifoldCF interacts with individual connectors so that you can design your own custom connectors.


This book requires a working knowledge of Java, but no prior experience with search-based applications or ManifoldCF is needed.


Karl Wright has been developing ManifoldCF since 2006, from its roots at MetaCarta well before it became an Apache project. He has extensive experience in speech recognition and compiler development, and he is the author of Borland's Turbo Assembler. Karl holds Computer Science degrees from M.I.T. and Stanford.