Exploiting the full potential of Bayesian networks in predictive ecology
Date
2021
Authors
Ramazi, Pouria
Kunegel-Lion, Mélodie
Greiner, Russell
Lewis, Mark A.
Journal Title
Journal ISSN
Volume Title
Publisher
Methods in Ecology and Evolution
Abstract
1. Although ecological models used to make predictions from underlying covariates have a record of success, they also suffer from limitations. They are typically unable to make predictions when the value of one or more covariates is missing during the testing. Missing values can be estimated but methods are often unreliable and can result in poor accuracy. Similarly, missing values during the training can hinder parameter estimation of many ecological models. Bayesian networks can handle these and other limiting issues, such as having highly correlated covariates. However, they are rarely used to their full potential.
2. Indeed, Bayesian networks are commonly used to evaluate the knowledge of experts by constructing the network manually and often (incorrectly) interpreting the resulting network causally. We provide an approach to learn a Bayesian network fully from observed data, without relying on experts and show how to appropriately interpret the resulting network, both to identify how the variables (covariates and target) are interrelated and to answer probabilistic queries.
3. We apply this method to the case study of a mountain pine beetle infestation and find that the trained Bayesian network has a predictive accuracy of 0.88 AUC. We classify the covariates as primary and secondary in terms of contributing to the prediction and show that the predictive accuracy does not deteriorate when the secondary covariates are missing and degrades to only 0.76 when one of the primary covariates is missing.
4. As a complement to the previous work on constructing Bayesian networks by hand, we show that if instead, both the structure and parameters are learned only from data, we can achieve more accurate predictions as well as generate new insights about the underlying processes.
Description
Keywords
automatic learning, Bayesian network, invasive species, machine-learning, mountain pine beetle, pest, risk modelling, structure learning
Citation
Ramazi, P., Kunegel-Lion, M., Greiner, R., & Lewis, M. A. (2020). Exploiting the full potential of Bayesian networks in predictive ecology. Methods in Ecology and Evolution, 12(1), 135-149. https://doi.org/10.1111/2041-210x.13509