TY - JOUR
T1 - Assessing the performance of 38 machine learning models: the case of land consumption rates in Bavaria, Germany
AU - Hagenauer, Julian
AU - Omrani, Hichem
AU - Helbich, Marco
N1 - doi: 10.1080/13658816.2019.1579333
PY - 2019/2/22
Y1 - 2019/2/22
N2 - Machine learning (ML) is at the forefront of land-use change modeling. Due to numerous available ML approaches, the model choice is complex and usually based on ad hoc decisions, though informed through a few comparative studies that considered a limited number of models. This study contributes a comprehensive comparison of 38 ML models to examine land consumption rates (LCR) (i.e. the transition of landscapes to built-up areas). We modeled LCR for 2009?2015 in Bavaria, Germany, and predicted rates for 2015?2021 at a municipality level. To assess the performance of each approach, we measured the mean absolute error (MAE), the root-mean-square error (RMSE), and the coefficient of determination (R2) using cross-validation. All algorithms consistently predicted that the land consumption rate for Bavaria will increase. eXtreme gradient boosting decision trees performed best with respect to the RMSE (0.500) and R2 (0.183), while the support vector machine with polynomial kernel has the lowest MAE (0.288). The generalized additive model and the random forest models also performed well. We recommend these ML approaches for future land consumption and land-use change studies. A poor performance was found for recursive partitioning by decision trees, self-organizing maps, and partitioning using deletion, substitution, and addition moves.
AB - Machine learning (ML) is at the forefront of land-use change modeling. Due to numerous available ML approaches, the model choice is complex and usually based on ad hoc decisions, though informed through a few comparative studies that considered a limited number of models. This study contributes a comprehensive comparison of 38 ML models to examine land consumption rates (LCR) (i.e. the transition of landscapes to built-up areas). We modeled LCR for 2009?2015 in Bavaria, Germany, and predicted rates for 2015?2021 at a municipality level. To assess the performance of each approach, we measured the mean absolute error (MAE), the root-mean-square error (RMSE), and the coefficient of determination (R2) using cross-validation. All algorithms consistently predicted that the land consumption rate for Bavaria will increase. eXtreme gradient boosting decision trees performed best with respect to the RMSE (0.500) and R2 (0.183), while the support vector machine with polynomial kernel has the lowest MAE (0.288). The generalized additive model and the random forest models also performed well. We recommend these ML approaches for future land consumption and land-use change studies. A poor performance was found for recursive partitioning by decision trees, self-organizing maps, and partitioning using deletion, substitution, and addition moves.
KW - land consumption
KW - land-use
KW - machine learning
KW - model comparison
KW - germany
UR - https://www.tandfonline.com/doi/full/10.1080/13658816.2019.1579333
UR - http://www.mendeley.com/research/assessing-performance-38-machine-learning-models-case-land-consumption-rates-bavaria-germany
U2 - 10.1080/13658816.2019.1579333
DO - 10.1080/13658816.2019.1579333
M3 - Article
SN - 1365-8816
SP - 1
EP - 21
JO - International Journal of Geographical Information Science
JF - International Journal of Geographical Information Science
ER -