Webscraping in Python
In this concise 1-session workshop, we will learn about web scraping using the Requests and BeautifulSoup library in Python. Often, the data essential for research is not neatly presented as a CSV or JSON file and we need to go out and search for it ourselves. Web scraping is one way of using an automated process to collect data from websites (like, Wikipedia). This workshop will introduce you to the basic components of html, which serves as the backbone for structuring content on web pages, and will guide you in constructing your very own web scraper.
This course is intended for graduate students, faculty and staff from any field at WashU interested in collecting data from websites. Participants are expected to have a basic proficiency in Python and some experience working with the Pandas library.
This class will be fully in-person, and participants will use their own laptops. Enrollment is limited to 30.
DataLab Workshops
DataLab is a collaboration between Data Services and TRIADS, Bernard Becker Medical Library, TechDen, and DI2 to provide a breadth of workshops from the basics of understanding data to working with data tools. These workshops are open to all WashU affiliates and are held in the fall and spring semesters.