Comparison of classical and machine-learning methods on spatio-temporal modeling of daily Ozone concentrations

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Effective actions to mitigate air pollution require of availability of high-resolution observations. Low-cost sensor technologies have emerged as an affordable solution to cope with this deficiency. However, since low-cost sensors are built with low-cost materials, they are prone to errors, gaps, bias, and noise. These problems need to be solved before data can be used to support research or decision making. Addressing lack of reliability in low-cost sensor data is a complex challenge that is still under research over several lines (e.g. accuracy estimation of low-cost sensor data). Current approaches in this line involve modeling, bias-correction, and more recently, data fusion methods relying on high-resolution air quality computational models. Overall, accuracy estimation can be reduced to a modeling problem. The focus of this work is studying, testing, and comparing suitable approaches for handling point-referenced spatio-temporal sensor data, particularly classical spatial models, spatio-temporal models, and popular machine learning methods. Among these approaches, Bayesian hierarchical models have a special consideration given the attention they have drawn during the last fifteen years. The benchmark supporting this comparison study is a real-life dataset made up of daily ozone observations taken from the USA Environmental Protection Agency (EPA) and meteorological variables extracted from the NCEP/NCAR Reanalysis Project (NNRP). The main contributions of this work are: (1) a systematic comparison of three kinds of models, using a 10-fold cross-validation exercise; and (2) a feature engineering method to create covariates meant to harness spatially correlated observations of point-referenced sensor data.

Original languageEnglish
Title of host publicationProceedings - 2020 46th Latin American Computing Conference, CLEI 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages56-65
Number of pages10
ISBN (Electronic)9780738130644
DOIs
StatePublished - Oct 2020
Event46th Latin American Computing Conference, CLEI 2020 - Virtual, Loja, Ecuador
Duration: 19 Oct 202023 Oct 2020

Publication series

NameProceedings - 2020 46th Latin American Computing Conference, CLEI 2020

Conference

Conference46th Latin American Computing Conference, CLEI 2020
Country/TerritoryEcuador
CityVirtual, Loja
Period19/10/2023/10/20

Keywords

  • Air pollution
  • Gaussian process
  • machine-learning
  • sensor data
  • spatio-temporal models

Fingerprint

Dive into the research topics of 'Comparison of classical and machine-learning methods on spatio-temporal modeling of daily Ozone concentrations'. Together they form a unique fingerprint.

Cite this