Summarize News Articles with NLP, Deep Learning, and Python

Intermediate Python, Beginner TensorFlow/Keras, Basics of NLP, Basics of Deep Learning
skills learned
Convert an abstractive text summarization dataset to an extractive one, Train a deep learning model to perform extractive text summarization
Souradip Chakraborty and Sayak Paul
5 weeks · 7-10 hours per week · INTERMEDIATE
Look inside
News Media Corp needs to be quick if they want to get ahead of their competitors. Their current news frontpage is put together manually, in a time consuming process where human editors create flashcards that summarize articles. It’s too slow—so senior management wants to supercharge the process using natural language processing. To get this built, they’ve turned to you. Your challenge in this liveProject is to create an NLP model that can reduce turnaround time for news editors with an automatic text summarizer. To do this, you’ll need to prepare and process your dataset with tokenization and padding, extract meaningful statistics from it, and finally use your dataset to train a deep learning model that can speedily summarize a body text.
This project is designed for learning purposes and is not a complete, production-ready application or solution.

book resources

When you start your liveProject, you get full access to the following books for 90 days.

project authors

Souradip Chakraborty
Souradip Chakraborty is a Data Scientist at Walmart Labs, India in the field of Statistical Machine Learning. A Google Developers Expert in Machine Learning, he has published several US Patents and papers, and has been a regular speaker at various workshops and conferences.
Sayak Paul
Sayak works at PyImageSearch where he applies deep learning to solve problems in computer vision, and brings solutions to edge devices. He also provides Q&A support to PyImageSearch readers. Previously, Sayak developed projects and practice pools for DataCamp. Outside of work, Sayak enjoys writing technical articles and giving talks at developer meetups and conferences.


The liveProject is for intermediate Python programmers who know the basics of deep learning and NLP. To begin this liveProject, you should be familiar with the following topics:

  • Jupyter Notebooks
  • pandas
  • scikit-learn
  • Keras
  • Basic data manipulation and visualization
  • Rouge scoring
  • Tokenization
  • Word embeddings
  • Neural network architectures like convolutional neural networks and recurrent neural networks

you will learn

In this liveProject, you’ll master extractive text summarization, a well established field that intersects natural language processing and deep learning which is easily transferred to other NLP projects.

  • Preparing your data set with text-cleaning and text processing
  • Converting an abstractive text summarization dataset to an extractive one
  • Calculation of a Rouge score between a pair of sentences
  • Preprocessing a prepared extractive text summarization dataset
  • Preparing the train, test, and validation splits with the Python data ecosystem
  • Building deep learning models and evaluating them with TensorFlow and scikit-learn


You choose the schedule and decide how much time to invest as you build your project.
Project roadmap
Each project is divided into several achievable steps.
Get Help
While within the liveProject platform, get help from other participants and our expert mentors.
Compare with others
For each step, compare your deliverable to the solutions by the author and other participants.
book resources
Get full access to select books for 90 days. Permanent access to excerpts from Manning products are also included, as well as references to other resources.
Look inside

placing your order...

Don't refresh or navigate away from the page.
Manning Early Access Program (MEAP) In MEAP, you get immediate access to a liveProject under development, so you can participate while it is created, tested, and improved. Get started today, and pick up right where you've left off whenever we update the project!
liveProject $49.99 self-paced learning
Summarize News Articles with NLP, Deep Learning, and Python (liveProject) added to cart
continue shopping
go to cart

Prices displayed in rupees will be charged in USD when you check out.