click to
look inside
Look inside
FREE
You can see this entire book for free.
Click the table of contents to start reading.

Taming Text

How to Find, Organize, and Manipulate It
Grant S. Ingersoll, Thomas S. Morton, and Andrew L. Farris
Foreword by Liz Liddy
  • December 2012
  • ISBN 9781933988382
  • 320 pages
  • printed in black & white

placing your order...

Don't refresh or navigate away from the page.
print book Receive a print copy shipped to your door + the eBook in Kindle, ePub, & PDF formats + liveBook, our enhanced eBook format accessible from any web browser. $26.99 $44.99 you save: $18 (40%) pBook + eBook + liveBook
Additional shipping charges may apply
Prints and ships within 3-5 days
FREE domestic shipping on orders of three or more print books
Taming Text (print book) added to cart
continue shopping
go to cart

eBook Our eBooks come in Kindle, ePub, and DRM-free PDF formats + liveBook, our enhanced eBook format accessible from any web browser. $21.59 $35.99 you save: $14 (40%) 3 formats + liveBook
FREE domestic shipping on orders of three or more print books
Taming Text (eBook) added to cart
continue shopping
go to cart

Takes the mystery out of very complex processes.

From the Foreword by Liz Liddy, Dean, iSchool, Syracuse University
Look inside

Taming Text is a hands-on, example-driven guide to working with unstructured text in the context of real-world applications. This book explores how to automatically organize text using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. The book guides you through examples illustrating each of these topics, as well as the foundations upon which they are built.

about the book

There is so much text in our lives, we are practically drowning in it. Fortunately, there are innovative tools and techniques for managing unstructured information that can throw the smart developer a much-needed lifeline. You'll find them in this book.

Taming Text is a practical, example-driven guide to working with text in real applications. This book introduces you to useful techniques like full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. You'll explore real use cases as you systematically absorb the foundations upon which they are built.

Written in a clear and concise style, this book avoids jargon, explaining the subject in terms you can understand without a background in statistics or natural language processing. Examples are in Java, but the concepts can be applied in any language.

what's inside

  • When to use text-taming techniques
  • Important open-source libraries like Solr and Mahout
  • How to build text-processing applications

about the author

Grant Ingersoll is an engineer, speaker, and trainer, a Lucene committer, and a cofounder of the Mahout machine-learning project. Thomas Morton is the primary developer of OpenNLP and Maximum Entropy. Drew Farris is a technology consultant, soft ware developer, and contributor to Mahout, Lucene, and Solr.

FREE domestic shipping on orders of three or more print books

Text analysis and processing as it should be: clear, practical, and open source!

David Weiss, Carrot Search s.c.

Shows how to unlock and exploit information locked up in text documents.

Rick Wagner, Red Hat

Teaches text concepts with examples ... makes text search easy.

Doug Warren, Java Web Services

A great overview of tools and techniques for text processing.

Julien Nioche, DigitalPebble, Ltd.
RECENTLY VIEWED