Air quality prediction by machine learning methods

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Air quality prediction by machine learning methods Peng, Huiping

Abstract

As air pollution is a complex mixture of toxic components with considerable impact on humans, forecasting air pollution concentration emerges as a priority for improving life quality. In this study, air quality data (observational and numerical) were used to produce hourly spot concentration forecasts of ozone (O₃), particulate matter 2.5μm (PM₂.₅) and nitrogen dioxide (NO₂), up to 48 hours for six stations across Canada -- Vancouver, Edmonton, Winnipeg, Toronto, Montreal and Halifax. Using numerical data from an air quality model (GEM-MACH15) as predictors, forecast models for pollutant concentrations were built using multiple linear regression (MLR) and multi-layer perceptron neural networks (MLP NN). A relatively new method, the extreme learning machine (ELM), was also used to overcome the limitation of linear methods as well as the large computational demand of MLP NN. In operational forecasting, the continuous arrival of new data means frequent updating of the models is needed. This type of learning, called online sequential learning, is straightforward for MLR and ELM but not for MLP NN. Forecast performance of the online sequential MLR (OSMLR) and online sequential ELM (OSELM), together with stepwise MLR, all updated daily were compared with MLP NN updated seasonally, and the benchmark, updatable model output statistics (UMOS) from Environmental Canada. Overall OSELM tended to slightly outperform the other models including UMOS, being most successful with ozone forecasts and least with PM₂.₅ forecasts. MLP NN updated seasonally was generally underperforming the linear models MLR and OSMLR, indicating the need to update a nonlinear model frequently.

Item Metadata

Title	Air quality prediction by machine learning methods
Creator	Peng, Huiping
Publisher	University of British Columbia
Date Issued	2015
Description	As air pollution is a complex mixture of toxic components with considerable impact on humans, forecasting air pollution concentration emerges as a priority for improving life quality. In this study, air quality data (observational and numerical) were used to produce hourly spot concentration forecasts of ozone (O₃), particulate matter 2.5μm (PM₂.₅) and nitrogen dioxide (NO₂), up to 48 hours for six stations across Canada -- Vancouver, Edmonton, Winnipeg, Toronto, Montreal and Halifax. Using numerical data from an air quality model (GEM-MACH15) as predictors, forecast models for pollutant concentrations were built using multiple linear regression (MLR) and multi-layer perceptron neural networks (MLP NN). A relatively new method, the extreme learning machine (ELM), was also used to overcome the limitation of linear methods as well as the large computational demand of MLP NN. In operational forecasting, the continuous arrival of new data means frequent updating of the models is needed. This type of learning, called online sequential learning, is straightforward for MLR and ELM but not for MLP NN. Forecast performance of the online sequential MLR (OSMLR) and online sequential ELM (OSELM), together with stepwise MLR, all updated daily were compared with MLP NN updated seasonally, and the benchmark, updatable model output statistics (UMOS) from Environmental Canada. Overall OSELM tended to slightly outperform the other models including UMOS, being most successful with ozone forecasts and least with PM₂.₅ forecasts. MLP NN updated seasonally was generally underperforming the linear models MLR and OSMLR, indicating the need to update a nonlinear model frequently.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2015-10-24
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivs 2.5 Canada
DOI	10.14288/1.0166787
URI	http://hdl.handle.net/2429/55069
Degree	Master of Science - MSc
Program	Atmospheric Science
Affiliation	Science, Faculty of; Earth, Ocean and Atmospheric Sciences, Department of
Degree Grantor	University of British Columbia
Graduation Date	2015-11
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/2.5/ca/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Air quality prediction by machine learning methods Peng, Huiping

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights