Link Search Menu Expand Document

Outline

Statistics Canada releases Public Use Microdata Files (PUMFs) for the Census and many of its surveys. These de-identified files contain responses at the individual level and are a rich source of data for researchers.

This workshop demonstrates how to access, explore, and analyze Statistics Canada microdata using R. We will work with the Canadian Tobacco and Nicotine Survey but the content applies to any Statistics Canada PUMF, hundreds of which are freely available in Abacus, UBC Library’s data repository.

There are no prerequisites. Participants familiar with R and RStudio can follow along on their own workstations. For those new to R, the workshop showcases how R can be used for analysis and reproducibility.

If you are new to R and want to participate fully, please install R and RStudio before the workshop:

Learning objectives

At the end of this workshop, you will be able to:

  1. Describe how microdata differs from statistics, and how it can be used
  2. Find Statistics Canada Public Use Microdata Files (PUMFs) in Abacus, UBC Library’s data repository
  3. Interpret PUMF documentation, including user guides and data codebooks
  4. Appreciate how R can support your research process and improve reproducibility

Schedule

0:00 Welcome and using Zoom
0:05 Introduction to microdata
0:15 Finding Statistics Canada microdata in Abacus (PUMFs)
0:20 Orientation to the Canadian Tobacco and Nicotine Survey (CTNS)
0:30 Example of data analysis and visualisation in R
1:00 Post-workshop discussion (optional, up to 30mins)

Resources

For support finding or working with data: