-
Notifications
You must be signed in to change notification settings - Fork 653
Default Demographic Data
By default, SyntheaTM contains publicly available demographic data obtained from the US Census Bureau. The data was post-processed to create population input data for every place (town and city) in the United States. This post-processed data can be used with Synthea to generate representative populations.
The census data files used to assemble the default demographics file (src/main/resources/geography/demographics.csv
) are as follows:
File Set | Description |
---|---|
sub-est2015* |
Subcounty population estimates |
cc-est2015-alldata* |
County population demographic distributions (gender, race, age groups) |
ACS_14_5YR_S1501* |
Education level attainment distributions |
ACS_14_5YR_S1901* |
Income level distributions |
Property | Detail |
---|---|
Files | sub-est2015* |
Source | US Census Bureau |
Year | 2010 Census |
URL | https://www2.census.gov/programs-surveys/popest/datasets/2010-2015/cities/totals/ |
Property | Detail |
---|---|
Files | cc-est2015-alldata* |
Source | US Census Bureau |
Year | 2010 Census |
URL | https://www2.census.gov/programs-surveys/popest/datasets/2010-2015/counties/asrh/ |
Property | Detail |
---|---|
Files | ACS_14_5YR_S1501* |
Source | US Census Bureau |
Year | 2010-2014 American Community Survey 5-Year Estimates |
URL | https://factfinder.census.gov/bkmk/table/1.0/en/ACS/14_5YR/S1501/0100000US.16000.004 |
URL2 | https://factfinder.census.gov/bkmk/table/1.0/en/ACS/14_5YR/S1501/0400000US01.06000 |
Property | Detail |
---|---|
Files | ACS_14_5YR_S1901* |
Source | US Census Bureau |
Year | 2010-2014 American Community Survey 5-Year Estimates |
URL | https://factfinder.census.gov/bkmk/table/1.0/en/ACS/14_5YR/S1901/0100000US.16000.004 |
URL2 | https://factfinder.census.gov/bkmk/table/1.0/en/ACS/14_5YR/S1901/0400000US01.06000 |
The SDoH file is county-level data. It is derived from the 2018 Social Vulnerability Index and the 2021 Community Health Rankings. The file has the following headers:
FIPS_CODE,
COUNTY_CODE,
COUNTY,
ST,
STATE,
FOOD_INSECURITY,
SEVERE_HOUSING_COST_BURDEN,
UNEMPLOYED,
NO_VEHICLE_ACCESS,
UNINSURED
The first five columns (FIPS_CODE,COUNTY_CODE,COUNTY,ST,STATE
) are required and fixed.
The remaining columns (FOOD_INSECURITY,SEVERE_HOUSING_COST_BURDEN,UNEMPLOYED,NO_VEHICLE_ACCESS,UNINSURED
), plus any others that might be added in the future, get added to the patient attributes based on the probability listed in the table cell.