NLP Demystified

Contents

Subscribe for updates and announcements.

No spam or information selling. Ever.

Part One: Fundamentals and Classical Approaches

1. Introduction
published
We'll look at what makes NLP exciting, what makes it challenging, and what we'll learn in this course.
2. Tokenization
published
The usual first step in NLP is to chop our documents into smaller pieces in a process called Tokenization. We'll look at the challenges involved and how to get it done.
3. Basic Preprocessing
published
Depending on our goal, we may preprocess text further. We'll cover case-folding, stop word removal, stemming, and lemmatization. We'll go over their use cases and their tradeoffs.
4. Advanced Preprocessing
published
We'll look at tagging our tokens with useful information including part-of-speech tags and named entity tags. We'll also explore different types of sentence parsing to help extract the meaning of a sentence.
5. Measuring Document Similarity With Basic Bag-of-Words
published
To perform calculations or use machine learning algorithms, we need to first turn our text into numbers. We'll take our first step here by looking at the simplest representation possible, then look at how to perform document similarity.
6. Simple Document Search With TF-IDF
published
We'll consider the shortcomings of the basic bag-of-words approach, then improve our vectors with TF-IDF and use it for document search.
7. Building Models: Finding Patterns for Fun and Profit
published
Through a high-level overview of modelling, we'll look at the different types of machine learning, how to evaluate model performance, and what to do when things go wrong.
8. Naive Bayes: Fast and Simple Text Classification
published
We'll learn how the Naive Bayes classifier works under the hood, see how accuracy can go wrong and how to use precision and recall instead, and then build a text classifier while working through problems along the way.
9. Topic Modelling: Automatically Discovering Topics in Documents
published
What do you do when you need to make sense of a pile of documents and have no other information? We'll learn one approach to this problem using Latent Dirichlet Allocation. We'll cover how it works, then build a model to discover topics present in a document and to search for similar documents.

Part Two: Deep Learning for NLP