A Practical Introduction to Natural Language Processing

Abstract

This lecture will focus on the practical aspects of manipulating natural language for computation. It will include methods for representing natural language in computers (features, embeddings), applying classic approaches to preprocessing text (stemming, lemmatization), and incorporating context into these representations (n-grams, transformers). The lecture follows practical examples using synthetic and real text. The format is intended to be interactive; some experience running a Google Colab notebook is recommended.

Date
Sep 28, 2021 3:00 PM — 4:30 PM
Event
2021 Biostatistics Seminar Series
Location
Virtual
Avatar
Scientist

Alistair is a scientist focusing on data analysis in healthcare. This includes retrospective observational studies to generate new knowledge, predictive modeling using machine learning to prognosticate outcome, and curation of data to support the research enterprise.