In this liveProject, you’ll step into the role of a data scientist working for an investment firm. Your company wants to make sure their investments meet European Union guidelines for environmental sustainability. That’s where you come in.
The EU taxonomy for sustainable finance is big, complex, and confusing. Your bosses need a program that saves them from searching through hundreds of pages whenever they have a query. You’ve been tasked with building a machine learning model that can pose certain questions to the EU guidelines, and return reliable answers.
Your challenges will include extracting text data from the EU taxonomy document, and matching environment questions with the corresponding paragraph in the guidelines. You’ll then set up a pretrained transformer Question-Answering model, evaluate its performance, and combine it with your question-paragraph model for an end-to-end solution. When you’re done, you’ll have an interface into which you can type a sustainable finance question and receive the correct answer from the EU guidelines.
This project is designed for learning purposes and is not a complete, production-ready application or solution.
Manning author Matteus Tanha shares what he likes about the Manning liveProject platform.
When you start your liveProject, you get full access to the following books for 90 days.
Matteus Tanha is the Head of Machine Learning at Alpha Quants, a consulting firm with a focus on Natural Language Processing solutions. He earned his Ph.D. specializing in machine learning applied to quantum chemistry from Carnegie Mellon University. For the past 6 years, Matteus has been developing various natural language processing solutions for companies across the sectors of finance, academia, and media.
This liveProject is for intermediate Python programmers and who already know the basics of data science and Machine Learning. To begin this liveProject, you will need to be familiar with:
Basics of pandas
Basics of NumPy
Basics of scikit-learn
Basics of data science
Basics of machine learning
you will learn
In this liveProject, you’ll get to grips with fundamentals of Information Retrieval and Natural Language Processing that are the cornerstone of data and deep learning projects.
Deep learning with Pytorch and Spacy
Extracting text from PDFs
Evaluating machine learning models
Loading and working with pretrained models
Transformers and auto-encoders
Word and paragraph embeddings
You choose the schedule and decide how much time to invest as you build your project.
Each project is divided into several achievable steps.
While within the liveProject platform, get help from other participants and our expert mentors.
Compare with others
For each step, compare your deliverable to the solutions by the author and other participants.
Get full access to select books for 90 days. Permanent access to excerpts from Manning products are also included, as well as references to other resources.