Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States
- PMID: 29183967
- PMCID: PMC5740675
- DOI: 10.1073/pnas.1700035114
Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States
Abstract
The United States spends more than $250 million each year on the American Community Survey (ACS), a labor-intensive door-to-door study that measures statistics relating to race, gender, education, occupation, unemployment, and other demographic factors. Although a comprehensive source of data, the lag between demographic changes and their appearance in the ACS can exceed several years. As digital imagery becomes ubiquitous and machine vision techniques improve, automated data analysis may become an increasingly practical supplement to the ACS. Here, we present a method that estimates socioeconomic characteristics of regions spanning 200 US cities by using 50 million images of street scenes gathered with Google Street View cars. Using deep learning-based computer vision techniques, we determined the make, model, and year of all motor vehicles encountered in particular neighborhoods. Data from this census of motor vehicles, which enumerated 22 million automobiles in total (8% of all automobiles in the United States), were used to accurately estimate income, race, education, and voting patterns at the zip code and precinct level. (The average US precinct contains ∼1,000 people.) The resulting associations are surprisingly simple and powerful. For instance, if the number of sedans encountered during a drive through a city is higher than the number of pickup trucks, the city is likely to vote for a Democrat during the next presidential election (88% chance); otherwise, it is likely to vote Republican (82%). Our results suggest that automated systems for monitoring demographics may effectively complement labor-intensive approaches, with the potential to measure demographics with fine spatial resolution, in close to real time.
Keywords: computer vision; deep learning; demography; social analysis.
Copyright © 2017 the Author(s). Published by PNAS.
Conflict of interest statement
The authors declare no conflict of interest.
Figures



Similar articles
-
Residential scene classification for gridded population sampling in developing countries using deep convolutional neural networks on satellite imagery.Int J Health Geogr. 2018 May 9;17(1):12. doi: 10.1186/s12942-018-0132-1. Int J Health Geogr. 2018. PMID: 29743081 Free PMC article.
-
Health and the built environment in United States cities: measuring associations using Google Street View-derived indicators of the built environment.BMC Public Health. 2020 Feb 12;20(1):215. doi: 10.1186/s12889-020-8300-1. BMC Public Health. 2020. PMID: 32050938 Free PMC article.
-
Extended follow-up and spatial analysis of the American Cancer Society study linking particulate air pollution and mortality.Res Rep Health Eff Inst. 2009 May;(140):5-114; discussion 115-36. Res Rep Health Eff Inst. 2009. PMID: 19627030
-
Using machine learning to examine street green space types at a high spatial resolution: Application in Los Angeles County on socioeconomic disparities in exposure.Sci Total Environ. 2021 Sep 15;787:147653. doi: 10.1016/j.scitotenv.2021.147653. Epub 2021 May 8. Sci Total Environ. 2021. PMID: 36118158 Free PMC article.
-
Predicting socioeconomic indicators using transfer learning on imagery data: an application in Brazil.GeoJournal. 2023;88(1):1081-1102. doi: 10.1007/s10708-022-10618-3. Epub 2022 Mar 24. GeoJournal. 2023. PMID: 35345631 Free PMC article. Review.
Cited by
-
Using geospatial social media data for infectious disease studies: a systematic review.Int J Digit Earth. 2023;16(1):130-157. doi: 10.1080/17538947.2022.2161652. Epub 2023 Jan 3. Int J Digit Earth. 2023. PMID: 37997607 Free PMC article.
-
Urban visual intelligence: Uncovering hidden city profiles with street view images.Proc Natl Acad Sci U S A. 2023 Jul 4;120(27):e2220417120. doi: 10.1073/pnas.2220417120. Epub 2023 Jun 26. Proc Natl Acad Sci U S A. 2023. PMID: 37364096 Free PMC article.
-
Linking repeated subjective judgments and ConvNets for multimodal assessment of the immediate living environment.MethodsX. 2024 Jan 5;12:102556. doi: 10.1016/j.mex.2024.102556. eCollection 2024 Jun. MethodsX. 2024. PMID: 38283760 Free PMC article.
-
Multimodal deep learning from satellite and street-level imagery for measuring income, overcrowding, and environmental deprivation in urban areas.Remote Sens Environ. 2021 May;257:112339. doi: 10.1016/j.rse.2021.112339. Remote Sens Environ. 2021. PMID: 33941991 Free PMC article.
-
Ethical implications of AI and robotics in healthcare: A review.Medicine (Baltimore). 2023 Dec 15;102(50):e36671. doi: 10.1097/MD.0000000000036671. Medicine (Baltimore). 2023. PMID: 38115340 Free PMC article. Review.
References
-
- Department of Commerce, US Census Bureau US census bureau’s budget estimates. 2013 Available at www.osec.doc.gov/bmi/budget/fy13cbj/Census_FY2013_CongressionalJustifica.... Accessed September 13, 2014.
-
- Department of Commerce, US Census Bureau (2012) American community survey 5 year data (2008-2012). Available at https://factfinder.census.gov/faces/tableservices/jsf/pages/productview..... Accessed September 13, 2014.
-
- Department of Commerce, US Census Bureau (2010) Decennial census. Available at https://www.census.gov/data/developers/data-sets/decennial-census.html. Accessed September 13, 2014.
-
- Antenucci D, Cafarella M, Levenstein M, Ré C, Shapiro MD. Using Social Media to Measure Labor Market Flows. Technical Report 20010 National Bureau of Economic Research; Cambridge, MA: 2014.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials