Web Scraping with rvest

Dr. Colin Rundel shows how to gathering data from the web with rvest. Web Scraping presents unique challenges to the researcher. Only rarely is the data we need available as a tidy rectangle that can be easily imported and directly analyzed. During this workshop we will discuss some of the common data formats (e.g. json, xml) and data sources (e.g. APIs, web scraping) as well as the tools / packages / best practices for ingesting these data using the R programming language. This workshop includes case-specific introductory examples of purrr::map

Prerequisite: Intro to R. All attendees are expected to be basically familiar with R, R Studio, and the Tidyverse.

Workshop Materials

Learning resources and workshop materials are available and shareable so you can learn at your own pace.

Rfun is a DVS learning series

The 'R We Having Fun Yet?' learning series is part of the broader Data & Visualization Services workshop series. DVS offers workshops on [R], Python, GIS and mapping, Research Data Management, and Visualization.

Rfun Blog

The blog features semester summaries of our workshop series and extra bits of information which may assist you in your practical data science journeys.