This workshop provides an introduction to working with census data in R using the tidycensus
package.
Since 1790, the US Census has been THE source of data about American people, providing valuable insights to social scientists and humanists. Mapping these data by census geographies adds more value by allowing researchers to explore spatial trends and outliers. This workshop will introduce three key packages for streamlining census data workflows in R: tigris
, tidycensus
, and tmap
. Participants will learn how to download census tabular data for one or more geographic aggregation units or years, download the associated census geographic data, and then join these data for analysis and mapping.
Specifically, we will:
- Describe the primary Census data products
- Introduce the R
tidycensus
package for working with Census Data - Use that packages to fetch decennial and ACS census data
- Use those packages to fetch census geographic boundary files
- Make maps of census data, symbolizing the color of those maps by the data values
R experience equivalent to the D-Lab R Fundamentals workshop series is required to follow along with the tutorial. Basic knowledge of census data and geospatial data will be very helpful.
Bring a laptop with R, RStudio and the following R packages listed below installed.
Here are the R packages you will need for this workshop:
tidyverse
ggplot2
sf
tidycensus
tigris
mapview
leaflet
If you have a Berkeley CalNet ID, you can run these lessons on UC Berkeley's DataHub by clicking . By using this link, you can save your work and come back to it at any time. When you want to return to your saved work, just go straight to DataHub (https://datahub.berkeley.edu), sign in, and you click on the Census-Data-in-R
folder.
If you don't have a CalNet ID, you are able to access the workshop by clicking . This link takes a moment to load, so patience is required!
The tidycensus
package, and any R package that accesses the Census APIs, require you to first get a Census API key. Get one now if you don’t have one yet here: (just takes a minute): https://api.census.gov/data/key_signup.html
This workshop in mainly about using R to access US Census data in dynamic and efficient ways. We also explore data-centric mapping techniques to explore our queries over space. Is geospatial topics interest you and you'd like to study more, we recommed these resources as good starting points:
-
Geospatial Fundamentals in R - D-Lab workshop focusing on mapping and geospatial analysis in R.
-
Leaflet for R - Well written introduciton to mapping with the
leaflet
package in R. -
Geocomputation in R - Textbook (Lovelace, Nowosad, and Muenchow, 2019) is an excellent resource for getting up and running.
-
R Shiny Tutorials -
shiny
is an R package that makes it easy to build interactive web apps straight from R, incuding maps!
- Patty Frontiera
- Irene Farah
- Avery Richards