GSTDTAP  > 资源环境科学
DOI10.1289/EHP9752
Deep Ensemble Machine Learning Framework for the Estimation of PM2.5 Concentrations
Wenhua Yu; Shanshan Li; Tingting Ye; Rongbin Xu; Jiangning Song; Yuming Guo
2022-03-07
发表期刊Environmental Health Perspectives
出版年2022
英文摘要

Abstract

Background:

Accurate estimation of historical PM2.5 (particle matter with an aerodynamic diameter of less than 2.5μm) is critical and essential for environmental health risk assessment.

Objectives:

The aim of this study was to develop a multiple-level stacked ensemble machine learning framework for improving the estimation of the daily ground-level PM2.5 concentrations.

Methods:

An innovative deep ensemble machine learning framework (DEML) was developed to estimate the daily PM2.5 concentrations. The framework has a three-stage structure: At the first stage, four base models [gradient boosting machine (GBM), support vector machine (SVM), random forest (RF), and eXtreme gradient boosting (XGBoost)] were used to generate a new data set of PM2.5 concentrations for training the next-stage learners. At the second stage, three meta-models [RF, XGBoost, and Generalized Linear Model (GLM)] were used to estimate PM2.5 concentrations using a combination of the original data set and the predictions from the first-stage models. At the third stage, a nonnegative least squares (NNLS) algorithm was employed to obtain the optimal weights for PM2.5 estimation. We took the data from 133 monitoring stations in Italy as an example to implement the DEML to predict daily PM2.5 at each 1km×1km grid cell from 2015 to 2019 across Italy. We evaluated the model performance by performing 10-fold cross-validation (CV) and compared it with five benchmark algorithms [GBM, SVM, RF, XGBoost, and Super Learner (SL)].

Results:

The results revealed that the PM2.5 prediction performance of DEML [coefficients of determination (R2)=0.87 and root mean square error (RMSE)=5.38μg/m3] was superior to any benchmark models (with R2 of 0.51, 0.76, 0.83, 0.70, and 0.83 for GBM, SVM, RF, XGBoost, and SL approach, respectively). DEML displayed reliable performance in capturing the spatiotemporal variations of PM2.5 in Italy.

Discussion:

The proposed DEML framework achieved an outstanding performance in PM2.5 estimation, which could be used as a tool for more accurate environmental exposure assessment. https://doi.org/10.1289/EHP9752

领域资源环境
URL查看原文
引用统计
文献类型期刊论文
条目标识符http://119.78.100.173/C666/handle/2XK7JSWQ/347793
专题资源环境科学
推荐引用方式
GB/T 7714
Wenhua Yu,Shanshan Li,Tingting Ye,et al. Deep Ensemble Machine Learning Framework for the Estimation of PM2.5 Concentrations[J]. Environmental Health Perspectives,2022.
APA Wenhua Yu,Shanshan Li,Tingting Ye,Rongbin Xu,Jiangning Song,&Yuming Guo.(2022).Deep Ensemble Machine Learning Framework for the Estimation of PM2.5 Concentrations.Environmental Health Perspectives.
MLA Wenhua Yu,et al."Deep Ensemble Machine Learning Framework for the Estimation of PM2.5 Concentrations".Environmental Health Perspectives (2022).
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
PM2.5+Concentrations" target="_blank">谷歌学术中相似的文章
[Wenhua Yu]的文章
[Shanshan Li]的文章
[Tingting Ye]的文章
百度学术
PM2.5+Concentrations" target="_blank">百度学术中相似的文章
[Wenhua Yu]的文章
[Shanshan Li]的文章
[Tingting Ye]的文章
必应学术
PM2.5+Concentrations" target="_blank">必应学术中相似的文章
[Wenhua Yu]的文章
[Shanshan Li]的文章
[Tingting Ye]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。