Linking Public Records to Investigate Long-term Social Change
Project Description
This project uses the increasing availability of digitized public records to examine the long-term social, political, and economic outcomes of dramatic events. Though a variety of public records are increasingly available in digital format, linking individuals across different record sources presents complex methodological challenges. Some issues that we will investigate include the intergenerational impacts of political violence, the effects of mass shootings on the social and political attitudes of the victims, and the behavioral effects racial segregation and desegregation on economic well-being. Given that there are several streams of ongoing research, the specific topic can be matched to the interests of the applicant.
Supervisor
HENDRY, David James
Quota
1
Course type
UROP1000
UROP1100
UROP2100
UROP3100
UROP4100
Applicant's Roles
The applicant’s role may include conducting literature reviews, searching for, collecting, and cleaning quantitative data from the web, performing qualitative record matching between different data sources, developing and using machine learning models to perform automated record matching between data sources, data analysis, and assisting with the development of new research questions. No specialized prior knowledge is required, and the specific tasks can be matched with the applicant’s skill level and interests. Students with no prior knowledge of programming or software for statistical analysis are encouraged to learn the basics of at least one software platform for some aspect of the project (e.g., R and Python for web scraping, predictive modeling, and statistical analysis).
Applicant's Learning Objectives
Students will learn to formulate research questions, conduct literature reviews, perform qualitative and quantitative record matching, and perform basic web scraping routines. The project will give students hands-on experience with various aspects of large-scale record linking for social research.
Complexity of the project
Challenging