Data Munging with Perl
David Cross
  • January 2001
  • ISBN 9781930110007
  • 304 pages

Data Munging with Perl book shows you how to process data productively with Perl. It discusses general munging techniques and how to think about data munging problems. You will learn how to decouple the various stages of munging programs, how to design data structures, how to emulate the Unix filter model, etc. If you need to work with complex data formats it will teach you how to do that and also how to build your own tools to process these formats. The book includes detailed techniques for processing HTML and XML. And, it shows you how to build your own parsers to process data of arbitrary complexity.

Table of Contents show full


about the cover illustration

Part I: Background

1. Introduction

2. Data

3. Dealing With Data

Part II: Data Munging

1. Regular Expressions

2. Unstructured Data

3. Record Oriented Data

4. Fixed Width and Binary Data

Part III: Basic Data Parsing

1. Introduction to Data Parsing

2. HTML and XML

3. Building Your Own Parsers

4. Conclusion




About the Technology

Your desktop dictionary may not include it, but 'munging' is a common term in the programmer's world. Many computing tasks require taking data from one computer system, manipulating it in some way, and passing it to another. Munging can mean manipulating raw data to achieve a final form. It can mean parsing or filtering data, or the many steps required for data recognition. Or it can be something as simple as converting hours worked plus pay rates into a salary cheque.

About the book

Translation rights for Data Munging with Perl have been granted for Japan, Brazil, and Germany. If you are interested in learning where to buy this book in a language other than English, please inquire at your local bookseller.

What's inside

  • Using CPAN modules like Number::Format, Date::Manip, Text::CSV_XS
  • Making your code concise using Perl's special variables (like $/, $? and $_)
  • Building data parsers using Parse::RecDescent
  • Processing of HTML and XML

About the reader

If you are a programmer who munges data, this book will save you time. It will teach you systematic and powerful techniques using Perl. If you are not a Perl programmer, this book may just convince you to add Perl to your repertoire.

About the author

Dave Cross is the owner and Managing Director of Magnum Solutions Ltd., an internet and database consultancy based in London. He has 12 years' experience working in the IT industry. He is an active member of the Perl community, the founder of the London Perl Mongers, and is also a regular columist for Perlmonth, the online Perl magazine.

eBook $23.50 PDF only

FREE domestic shipping on three or more pBooks