This repository provides updated data and code for IRS County-to-County Migration Data. This is based on the repository developed by Hauer & Byars (2019). Thanks for publishing this data!
This updated version resolves a few errors in the previous data, including:
- correct some messed FIPS codes in Year 1998 due to format discrepancies in raw data
- correct one coding error in raw data in Year 2003
- resolve error messages in processing excel files in Years 1992-
- resolve error messages in processing text files in Years 1990,1991
- correct underestimation of unidentified and total migrants in Years 1990,1991
- add clarification that the unidentified and total migrants include international migrants, as well as the possibility for future work to discern these two types
The updated data will generate different results compared to the previous version for migration in Year 1998, and total and unidentified migration in Years 1990, 1991, and 2003.
Detailed information can be found in changelog
Data is organized in identical format as the previous version:
LATEX
— Contains latex files to reproduce the main manuscript file. -R
- Contains the R scripts needed to reproduce the flat IRS migration datafile. -MigData
— Initial data resources, unprocessed. -DATA-PROCESSED
— Final, post-processed data.
- Feel free to create a new branch for further incorporation and analysis.
- Please properly cite all use of the underlying code or data. Thanks!
The final IRS migration data can be downloaded at DATA-PROCESSED/county_migration_data.txt
or reproduced by running the R/999-master_script.R
file.
ORIGIN
- Refers to the 5-digit FIPS code for the origin of the migrants.
DESTINATION
- Refers to the 5-digit FIPS code for the destination of the migrants.
1990:2010
- Refers to number of migrants who moved from ORIGIN to DESTINATION in a given year.
- NOTE: there is an additional 5-digit FIPS code in the both the ORIGIN and DESTINATION field. This is coded as 99999 and is in reference to all unidentified migration flows containing less than 10 tax filers as well as international migration flows.
For any issues with the functionality of these scripts please create an issue.
The data collected and presented is licensed under the Creative Commons Attribution 3.0 license, and the underlying code used to format, analyze and display that content is licensed under the MIT license.