Welcome to Manning India!

We are pleased to be able to offer regional eBook pricing for Indian residents.
All eBook prices are discounted 40% or more!
Deep Learning with Structured Data
Mark Ryan
  • MEAP began August 2019
  • Publication in May 2020 (estimated)
  • ISBN 9781617296727
  • 325 pages (estimated)
  • printed in black & white

An absolute must for somebody wanting to do real work.

Joe Justesen
Deep learning offers the potential to identify complex patterns and relationships hidden in data of all sorts. Deep Learning with Structured Data shows you how to apply powerful deep learning analysis techniques to the kind of structured, tabular data you'll find in the relational databases that real-world businesses depend on. Filled with practical, relevant applications, this book teaches you how deep learning can augment your existing machine learning and business intelligence systems.
Table of Contents detailed table of contents

1 Why Deep Learning with Structured Data?

1.1 Overview of deep learning

1.2 Benefits and drawbacks of deep learning

1.3 Overview of the deep learning stack

1.4 Structured vs. unstructured data

1.5 Objections to deep learning with structured data

1.6 Why investigate deep learning with a structured data problem?

1.7 Code accompanying this book

1.8 What you need to know

1.9 Summary

2 Introduction to the example problem and Pandas dataframes

2.1 Development environment options for deep learning

2.2 Pandas dataframe in Python

2.3 Ingesting CSV files into Pandas dataframes

2.4 The major example: predicting streetcar delays

2.5 Why is a real-world dataset critical for learning about deep learning?

2.6 Format and scope of the input dataset

2.7 Summary

3 Preparing the Data Part 1: Exploring and cleansing the data

3.1 Ingesting XLS files into a Pandas dataframe

3.2 Using pickle to save your Pandas dataframe from one session to another

3.3 Exploring the data

3.4 Categorizing data into continuous, categorical and text categories

3.5 Problems in the dataset: gaps, errors, and guesses

3.6 How much data does deep learning need?

3.7 Summary

4 Preparing the Data Part 2: Transforming the Data

4.1 Dealing with incorrect values: Routes

4.2 Dealing with incorrect values: Vehicles

4.3 Dealing with inconsistent values: Location

4.4 Locations: going the distance

4.5 Fixing type mismatches

4.6 Dealing with rows that still contain bad data

4.7 Creating derived columns

4.8 Preparing non-numeric data to train a deep learning model

4.9 Summary

5 A Deeper Look at Deep Learning and the Deep Learning Stack

5.1 Overview of the end-to-end solution

5.2 Machine Learning vs. Deep Learning – a deeper look

5.3 What changed to make Deep Learning work?

5.4 Using Pandas to do what you would do with SQL

5.5 Introduction to Keras and TensorFlow

5.6 The power of embeddings

5.7 How the data structure defines the Keras model

5.8 Summary

6 Preparing and Building the Model

6.1 Data leakage and features that are fair game for training the model

6.2 Deriving the dataframe we will use to train the model

6.3 Transforming the dataframe into the format expected by the Keras model

6.4 Code to build a Keras model automatically based on the data structure

6.5 Examining the structure of the model

6.6 Model parameters

6.7 Summary

7 Training the Model

7.1 Reviewing the process of training a deep learning model

7.2 Selecting the train, validation and test datasets

7.3 Initial training run

7.4 Measuring the performance of your model

7.5 Early stopping – getting the best out of your training runs

7.6 Shortcuts to scoring

7.7 Saving trained models

7.8 Running a series of training experiments

7.9 Possible next steps for improving the model

7.10 Summary

8 Deploying and Maintaining the Model

8.1 Overview of model deployment

8.2 Deployment vs one-off scoring

8.3 More background on Rasa

8.4 If deployment is so important, why is it so hard?

8.5 Steps to deploy your model in Facebook Messenger with Rasa

8.6 Introduction to pipelines

8.7 Defining pipelines in the model training phase

8.8 Applying pipelines in the scoring phase

8.9 Maintaining a model after deployment

8.10 Summary

9 Recommended Next Steps

9.1 Reviewing what we have covered so far

9.2 What we could do next with the streetcar delay prediction project

9.3 Adding location details to the streetcar delay prediction project

9.4 Training our deep learning model with weather data

9.5 Adding season or time of day to the streetcar delay prediction project

9.6 Adapting the streetcar delay prediction model to an entirely new dataset: overview

9.7 Adapting the streetcar delay prediction model to an entirely new dataset: preparing the dataset and training the model

9.8 Adapting the streetcar delay prediction model to an entirely new dataset: deploying the model

9.9 Resources for additional learning

9.10 Summary


Appendix A: More on Deep Learning Environments

About the Technology

Most businesses are far more interested in accurate forecasting and fraud detection using their existing structured datasets than identifying cats in YouTube videos. Powerful deep learning techniques can efficiently extract insight from the kind of structured data collected by most businesses and organisations. Deep learning demands less feature tuning than other machine learning methods, takes less code to maintain, and can be automated to crawl your business’s databases in order to detect unanticipated patterns a human would never even notice. Thanks to the availability of cloud environments adapted to deep learning and to recent improvements in deep learning frameworks, deep learning is now a viable approach to solving problems with structured data.

About the book

Deep Learning with Structured Data shows you how to bring powerful deep learning techniques to your business’s structured data to predict trends and unlock hidden insights. In it, deep learning advocate Mark Ryan takes you through cleaning and preparing structured data for deep learning. You’ll learn the architecture of a Keras deep learning model, along with techniques for training, deploying, and maintaining your model. You’ll discover ways to get quick wins that can rapidly show whether your models are working, and techniques for monitoring your model’s ongoing functionality. Throughout, an end-to-end example using an open source transit delay dataset illustrates deep learning’s potential for unraveling problems and making predictions from large volumes of structured data.

What's inside

  • The benefits and drawbacks of deep learning
  • Organizing data for your deep learning model
  • The deep learning stack
  • Measuring performance of your models

About the reader

For readers with an intermediate knowledge of Python, Jupyter notebooks, and machine learning.

About the author

Mark Ryan has 20 years of experience leading teams delivering IBM’s premier relational database product. He holds a Master's degree in Computer Science from the University of Toronto.

Manning Early Access Program (MEAP) Read chapters as they are written, get the finished eBook as soon as it’s ready, and receive the pBook long before it's in bookstores.
MEAP combo $59.99 pBook + eBook + liveBook
MEAP eBook $47.99 pdf + ePub + kindle + liveBook
Prices displayed in rupees will be charged in USD when you check out.

placing your order...

Don't refresh or navigate away from the page.

FREE domestic shipping on three or more pBooks