Winter Closure

Multiple Danforth campus libraries will be closed and inaccessible to patrons from December 21 until January 2. Read on for more details. 

Stock photo of professor assisting a student on a laptop
Back to All Events
John M. Olin Library, Instruction Room 3

Machine Learning for Text Analysis Series

In this 4-session series, we will cover the basics of how to use machine learning for text-analysis tasks in Python. Participants will learn how to create word embedding models, basic supervised models such as Random Forest and Logistic Regression, and unsupervised approaches such as k-means clustering. In the later sessions, we will cover how to use pre-trained large language models like BERT and GPT for zero-shot analysis (using the model with no additional training), and how to fine-tune a pre-trained model for your own text-analysis needs. Participants must have confidence in the basics of Python syntax, as well as familiarity with basic text-analysis methods (such as tokenization and word frequency analysis)

This class will be fully in-person, and participants will use their own laptops. Enrollment is limited to 20.

Dates of Machine Learning for Text Analysis Series (all held in Olin Library, Instruction Room 3 from 11:30 am–1 pm):
  • Monday, April 7
  • Wednesday, April 9
  • Monday, April 14
  • Wednesday, April 16

DataLab Workshops 

DataLab is a collaboration between Data Services and TRIADS, Bernard Becker Medical Library, TechDen, and DI2 to provide a breadth of workshops from the basics of understanding data to working with data tools. These workshops are open to all WashU affiliates and are held in the fall and spring semesters.