In this liveProject, you will create an imbalanced dataset from the IMDb movie dataset. Your goal is to make a dataset where positive reviews are the minority. You’ll then test a theory that if you oversample positive reviews, you could rebalance the training data to build and train a text classification model. You finish up by examining the model performance with a confusion table, and basic metrics such as precision, accuracy and recall.
This project is designed for learning purposes and is not a complete, production-ready application or solution.