High-Resolution Geospatial Database: National Criteria-Air-Pollutant Concentrations in the Contiguous U.S., 2016-2020

database[Title] 2025-04-23

Geosci Data J. 2025 Apr;12(2):e70005. doi: 10.1002/gdj3.70005. Epub 2025 Apr 7.

ABSTRACT

Concentration estimates for ambient air pollution are used widely in fields such as environmental epidemiology, health impact assessment, urban planning, environmental equity and sustainability. This study builds on previous efforts by developing an updated high-resolution geospatial database of population-weighted annual-average concentrations for six criteria air pollutants (PM2.5, PM10, CO, NO2, SO2, O3) across the contiguous U.S. during a five-year period (2016-2020). We developed Land Use Regression (LUR) models within a partial-least-squares-universal kriging framework by incorporating several land use, geospatial and satellite-based predictor variables. The LUR models were validated using conventional and clustered cross-validation, with the former consistently showing superior performance in capturing the variability of air quality. Most models demonstrated reliable performance (e.g., mean squared error-based R 2 > 0.8, standardised root mean squared error < 0.1). We used the best modelling approach to develop estimates by Census Block, which were then population-weighted averaged at Census Block Group, Census Tract and County geographies. Our database provides valuable insights into the dynamics of air pollution, with utility for environmental risk assessment, public health, policy and urban planning.

PMID:40256251 | PMC:PMC12007897 | DOI:10.1002/gdj3.70005