Skip to main navigation Skip to search Skip to main content

Optimisation in machine learning: An application to topsoil organic stocks prediction in a dry forest ecosystem

  • Anika Gebauer
  • , Victor M. Brito Gómez
  • , Mareike Ließ

Research output: Contribution to journalArticlepeer-review

11 Scopus citations

Abstract

Soil organic carbon (SOC) sequestration plays a key role in reducing the atmospheric greenhouse gas concentration. However, dry forest ecosystems in Ecuador are endangered to become a source of carbon emissions because of deforestation. Often spatial information, necessary to quantify potential carbon loss to the atmosphere, is missing. This particularly applies to remote areas of limited accessibility. This study aims to regionalise the SOC stocks of a small and poorly accessible dry forest ecosystem in southwestern Ecuador by using boosted regression tree (BRT) models. Resampling in a nested repeated k-fold cross validation approach was applied to develop robust models for a dataset of 118 samples with limited predictor information. To select an optimal set of model parameters, optimisation by differential evolution (DE) was applied for parameter tuning. Predictor selection was implemented using the same optimisation algorithm. This study demonstrates how the predictive performance of BRT models can be improved by applying an optimisation approach for parameter tuning and predictor selection. Model performance was improved by approximately 40% concerning the R2. Still, the results also demonstrated the difficulties of machine learning applications in small and highly heterogeneous natural areas. Very variable or even random factors were assumed to distort the relationship between predictor and response variables. We assume that the presented approach is particularly successful in the case of a real-valued multivariate space of tuning parameters. However, this requires testing in further machine learning applications and algorithms.

Original languageEnglish
Article number113846
JournalGeoderma
Volume354
DOIs
StatePublished - 15 Nov 2019
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 13 - Climate Action
    SDG 13 Climate Action

Keywords

  • Cross validation
  • Differential evolution
  • Dry forest
  • Machine learning
  • Model fitting
  • Soil organic carbon

Fingerprint

Dive into the research topics of 'Optimisation in machine learning: An application to topsoil organic stocks prediction in a dry forest ecosystem'. Together they form a unique fingerprint.

Cite this