Skip to content

yijiawen1203/dataset_summary

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

73 Commits
 
 

Repository files navigation

this file includes the website link to many useful commonly-used dataset in my involved papers

1 U.S. county-level population /population details https://www.census.gov/data/datasets/time-series/demo/popest/2010s-counties-total.html https://www.census.gov/data/tables/time-series/demo/popest/2010s-counties-detail.html

  1. US GDP and personal income https://apps.bea.gov/regional/docs/DataAvailability.cfm

3.Florida local unemployment rate https://www.floridajobs.org/workforce-statistics/data-center/statistical-programs/local-area-unemployment-statistics

  1. US CBSA shapefile boundary: https://data-usdot.opendata.arcgis.com/datasets/core-based-statistical-areas/explore?location=37.329731%2C-121.933150%2C4.31&showTable=true

4_2. US county shapefile boundary: https://www.census.gov/geographies/mapping-files/time-series/geo/carto-boundary-file.html

4_3. US State shapefile boundary: https://data-usdot.opendata.arcgis.com/datasets/4f7f0c7847164a1ba5e6cca926853ab6_0/explore

  1. census block distance: https://www.nber.org/research/data/block-distance-database

  2. SIPP data: Survey of Income and Program Participation https://www.nber.org/research/data/survey-income-and-program-participation-sipp

  3. geo spatial database: FGDL Metadata Explorer https://www.fgdl.org/metadataexplorer/explorer.jsp

  4. Geo spatial database: US DOT Transportation center: https://data-usdot.opendata.arcgis.com/search?collection=dataset

  5. IRS migration data https://www-irs-gov.proxy.lib.fsu.edu/statistics/soi-tax-stats-migration-data

101: material relating to NPL https://junjiehu.github.io/cs769-spring22/lectures/

102:useful website https://github.com/JunweiLiang/awesome_lists/blob/main/webpage.md

103: Data describing party political control of the legislature and governorship are from the Partisan Division of American State Governments database, published by the Inter-University Consortium for Political and Social Research (ICPSR).

deep learning course http://deeplearning.cs.cmu.edu/S21/index.html

computational social science course in stanford https://www.cc.gatech.edu/classes/AY2022/cs6471_spring/

#paper check repetition before submission 文章查重复 https://app.ithenticate.com/en_us/login

GDP in the US GDP archive data: GDP by msa, but the data is not available for public before 2007. https://www.bea.gov/news/archive?field_related_product_target_id=466&created_1=18&title= BEA GDP DATA website: https://www.bea.gov/data/gdp/gross-domestic-product#gdp the historical data is starting from 2002, earlier data is not available.

GDP by msa, from 2001-2012: i download from https://apps.bea.gov/regional/histdata/releases/0914gmp/index.cfm

before 2001: i cannot find the GDP data. in this case i get the GDP by state: https://apps.bea.gov/regional/histdata/releases/1204gsp/index.cfm (1998-2003) https://apps.bea.gov/regional/histdata/releases/0503gsp/index.cfm (1977-2001)

which msa belong to which states in the us https://apps.bea.gov/regional/docs/msalist.cfm, the downloaded data is saved at: /Users/yoga/Documents/Projects/5 explore patent/69_match_innovative_ind_1990_2022/metrolist.csv

patent

useful website of patent searching! https://guides.library.umass.edu/c.php?g=672608&p=4736575 https://ppubs.uspto.gov/pubwebapp/static/pages/ppubsbasic.html

patent data: https://patentsview.org/

patent data match with the industry https://www.uspto.gov/web/offices/ac/ido/oeip/taf/naics/doc/naics_info.htm https://www.uspto.gov/web/offices/ac/ido/oeip/taf/naics/naics_toc.htm

Match USPC2NAICS in 2002 classification: the data is from 2008 to 2014 https://www.uspto.gov/web/offices/ac/ido/oeip/taf/data/naics_conc/

match USPC2SIC: the data is from 2001 to 2008. https://www.uspto.gov/web/offices/ac/ido/oeip/taf/data/sic_conc/

CPC2IPC matching

https://www.uspto.gov/web/patents/classification/cpc/html/conB82Ytoipc.html

match chinese patent with chinese company https://www.nature.com/articles/sdata201842

employment data

classify by industry,Codes and Titles: 1990 to 2006, QCEW data were coded using the 2002 version of NAICS. 2007 to 2010, QCEW data were coded using the 2007 version of NAICS. 2011 to 2016, QCEW data were coded using the 2012 version of NAICS. https://www.bls.gov/cew/classifications/industry/industry-titles.htm

the detailed description of the NAICS https://data.bls.gov/cew/apps/bls_naics/v3/bls_naics_app.htm#tab=download&naics=2022

the employment data in the metropolitan areas in the U.S. I download the data here and use it in my dissertation https://www.bls.gov/cew/downloadable-data-files.htm

the meaning of the column names in the downloaded csv file:https://www.bls.gov/cew/additional-resources/open-data/csv-data-slices.htm the meaning of NAICS industry code explaination https://www.bls.gov/cew/classifications/industry/industry-titles.htm

county code fips crosswalk to county name:https://transition.fcc.gov/oet/info/maps/census/fips/fips.txt

CBSA metro area, micro area code fips crosswalk to CBSA name pdf:https://www2.census.gov/programs-surveys/cps/methodology/2015%20Geography%20Cover.pdf

CBSA crosswalk to county fips: https://www.nber.org/research/data/census-core-based-statistical-area-cbsa-federal-information-processing-series-fips-county-crosswalk

topic modeling tutorial

https://tm4ss.github.io/docs/Tutorial_6_Topic_Models.html

social network dataset

Stanford Large Network Dataset Collection: https://snap.stanford.edu/data/

spatial social network analysis https://friendlycities-gatech.github.io/SSN_tutorial/

social network workshop from duke univeristy

https://sites.duke.edu/dnac/2023-snh-workshop/

https://sites.duke.edu/dnac/training/ data of the training https://drive.google.com/drive/folders/1FiRzZFYhDFKRy77AJWcjpRk3PXFaO-wi

NLP tutorial

https://www.usna.edu/Users/cs/nchamber/courses/nlp/s23/resources.html

https://www.usna.edu/Users/cs/nchamber/courses/nlp/s23/cal.html

Berkley NLP course: https://cal-cs288.github.io/sp20/

standford NLP course: https://web.stanford.edu/class/cs224u/

embed images to vector space https://github.com/jina-ai/clip-as-service

potential grant

Charles Koch Foundation: Graduate research support https://charleskochfoundation.org/grants/general-proposals/

funding for summer school https://docs.google.com/spreadsheets/d/1-a1EhwY7dsac7CsvjMlwNWRq9o3_Do581FD3oevA6Ww/edit#gid=0

small grant in behavior economics https://www.russellsage.org/how-to-apply/small-grants

FSU Congress of Graduate Students conference funding https://fsu-sgacogs.smapply.us/prog/

Introduction to Data Mining course slides from university of minisoda. https://www-users.cse.umn.edu/~kumar001/dmbook/index.php#item4

funding for dissertation and graduate study Institute for Humane Studies at George Mason University https://www.theihs.org/academic-programs/

General CS learning material

Introduction to data mining: https://www-users.cse.umn.edu/~kumar001/dmbook/index.php#item4

urban planning course material/Syllabus

the university of Washington urban planning course syllabus. https://urbdp.be.uw.edu/programs/mup-graduate-degree/udp-course-listings-and-syllabi/

portland state university urban planning course syllabus

graduate courses https://www.pdx.edu/urban-studies-planning/graduate-courses#500

undergraduate courses: https://www.pdx.edu/urban-studies-planning/undergraduate-courses

UIUC course syllabus https://urban.illinois.edu/student-life/urban-planning-courses/

rugters course syllabus https://bloustein.rutgers.edu/graduate/doctoral-studies/

UCLA course syllabus https://luskin.ucla.edu/urban-planning/for-our-students#toggle-id-3

UNC course syllabus https://planning.unc.edu/courselisting/

FSU travel funding

https://sga.fsu.edu/cogs-individual-forms.shtml

FSU service submitting tickets, such as HPC

https://servicecenter.fsu.edu/s/my-service-center?isdtp=p1

analyzing literature

AI powered literature search https://researchrabbitapp.com/home

https://www.litmaps.com/

difference in difference learning meterial

https://www.jonathandroth.com/did-resources/

Causal Inference

https://www.mixtapesessions.io/sessions/

st. Louis regional data exchange https://rdx.stldata.org/

teaching statement

vanderbilt university center for teaching https://cft.vanderbilt.edu/guides-sub-pages/teaching-statements/

useful video for learn https://urbaneconomics.org/workshops/lectures2020/ urban economics.

washu course listing

https://acadinfo.wustl.edu/CourseListings/Semester/Listing.aspx

GIS Shapefile data

Florida's Geospatial Open Data: https://geodata.floridagio.gov/ US census data: https://www.census.gov/data.html Arcgis data: https://hub.arcgis.com/search ESRI training data: https://www.esri.com/training/catalog/search/ UF GIS Data: https://fgdl.org/ land boundary information system: https://www.labins.org/ US national geospatial program: https://www.usgs.gov/programs/national-geospatial-program/small-scale-data?openChapters=chpwater%23chpwater.

ESRI ArG GIS data download: https://livingatlas.arcgis.com/en/home/

other dataset havard dataverse: https://dataverse.harvard.edu/

US census geodatabase: https://www.census.gov/geographies/mapping-files/time-series/geo/tiger-geodatabase-file.2020.html

US census tiger: https://www.census.gov/cgi-bin/geo/shapefiles/index.php

https://www.census.gov/geographies/mapping-files/time-series/geo/tiger-geodatabase-file.2020.html

satellite imagery satellite imagery came from the National Agriculture Imagery Program (NAIP)

About

summarize the link to different dataset.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published