Machine Learning for Text Analysis Series
In this 4-session series, we will cover the basics of how to use machine learning for text-analysis tasks in Python. Participants will learn how to create word embedding models, basic supervised models such as Random Forest and Logistic Regression, and unsupervised approaches such as k-means clustering. In the later sessions, we will cover how to use pre-trained large language models like BERT and GPT for zero-shot analysis (using the model with no additional training), and how to fine-tune a pre-trained model for your own text-analysis needs. Participants must have confidence in the basics of Python syntax, as well as familiarity with basic text-analysis methods (such as tokenization and word frequency analysis)
This class will be fully in-person, and participants will use their own laptops. Enrollment is limited to 20.
Dates of Machine Learning for Text Analysis Series (all held in Olin Library, Instruction Room 3 from 11:30 am–1 pm):
- Monday, April 7
- Wednesday, April 9
- Monday, April 14
- Wednesday, April 16
DataLab Workshops
DataLab is a collaboration between Data Services and TRIADS, Bernard Becker Medical Library, TechDen, and DI2 to provide a breadth of workshops from the basics of understanding data to working with data tools. These workshops are open to all WashU affiliates and are held in the fall and spring semesters.