Cornell University
View map

Join this workshop to discover how to pull online data using advanced web scraping techniques through Python. Learn skills to fill out forms, traverse website pages, extract text reviews and store online numeric data into a dataset for further analysis. This workshop will provide an interactive demo for extracting hotel reviews data (e.g., numeric rating, text reviews, date of stay, etc.) from Tripadvisor and storing it into a Python Pandas DataFrame. Using the Selenium library in Python, participants will be able to interact with a webpage to pull the desired information.   

Register Here 

Learning Aims: 

  • Installation instructions for performing web scraping on personal computer (Python) 
  • Format of HTML data and how to pull desired fields 
  • Interactive demo: Tripadvisor Hotel reviews. Apply filters, traverse pages, extract text and store into a dataset for further analysis. 

Prerequisites: 

Requirements:  

Instructor: Jacob Grippin 

0 people are interested in this event

User Activity

No recent activity