Comprehensively evaluating the performance of species distribution models across clades and resolutions: choosing the right tool for the job
Context Species distribution modeling (SDM) is an integral tool for conservation, biogeography, and climate change biology. However, practioners have to choose from increasingly numerous SDM algorithms performing well under different conditions, including clade and resolution. Objectives To identify the most suitable SDM algorithms for trees, birds, mammals, and insects, uncover the driving factors of predictive performance, and examine how resolution affects performance and variable importance. Methods We use 27 SDM implementations, including random forests (RF), boosted regression trees (BRT), and Mahalanobis distance (MAH), and a comprehensive dataset of 49 species in Europe (trees, birds, mammals, and insects) to fit a total of 19,845 models, at 20 km, 10 km, and 5 km resolution. For selected species, we also compare the mapped predictions of 3 algorithms, and assess how variable importance changes with resolution for BRT. Results RF and BRT outperformed in terms of model performance (AUC = 0.938) for all clades (but not species), whereas decision trees, MaxLike, and Lasso overall underperformed (AUC = 0.848). The performance majorly depended on both clade (F = 101.4) and its interaction with resolution (F = 133.2), and displayed a general decline with resolution, while variable importance exhibited complex shifts in response to resolution. Conclusions RF and BRT are highly recommended but may require bias correction methods, whereas decision trees appeared unfavorable—particularly at higher resolutions. Given the complicated picture at the species level, varying tendencies to overfit, and resolution effects on both model performance and variable importance, we urge to routinely explore a range of algorithms, parametrizations, and resolutions. Graphical abstract.
Medienart: |
Artikel |
---|
Erscheinungsjahr: |
2022 |
---|---|
Erschienen: |
2022 |
Enthalten in: |
Zur Gesamtaufnahme - volume:37 |
---|---|
Enthalten in: |
Landscape ecology - 37(2022), 8 vom: 17. Juni, Seite 2045-2063 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Wunderlich, Rainer Ferdinand [VerfasserIn] |
---|
Links: |
Volltext [lizenzpflichtig] |
---|
Themen: |
Algorithm |
---|
Anmerkungen: |
© The Author(s), under exclusive licence to Springer Nature B.V. 2022 |
---|
doi: |
10.1007/s10980-022-01465-1 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
OLC2079252089 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | OLC2079252089 | ||
003 | DE-627 | ||
005 | 20230509132723.0 | ||
007 | tu | ||
008 | 221220s2022 xx ||||| 00| ||eng c | ||
024 | 7 | |a 10.1007/s10980-022-01465-1 |2 doi | |
035 | |a (DE-627)OLC2079252089 | ||
035 | |a (DE-He213)s10980-022-01465-1-p | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
082 | 0 | 4 | |a 570 |a 910 |a 630 |q VZ |
084 | |a 12 |2 ssgn | ||
084 | |a BIODIV |q DE-30 |2 fid | ||
100 | 1 | |a Wunderlich, Rainer Ferdinand |e verfasserin |4 aut | |
245 | 1 | 0 | |a Comprehensively evaluating the performance of species distribution models across clades and resolutions: choosing the right tool for the job |
264 | 1 | |c 2022 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ohne Hilfsmittel zu benutzen |b n |2 rdamedia | ||
338 | |a Band |b nc |2 rdacarrier | ||
500 | |a © The Author(s), under exclusive licence to Springer Nature B.V. 2022 | ||
520 | |a Context Species distribution modeling (SDM) is an integral tool for conservation, biogeography, and climate change biology. However, practioners have to choose from increasingly numerous SDM algorithms performing well under different conditions, including clade and resolution. Objectives To identify the most suitable SDM algorithms for trees, birds, mammals, and insects, uncover the driving factors of predictive performance, and examine how resolution affects performance and variable importance. Methods We use 27 SDM implementations, including random forests (RF), boosted regression trees (BRT), and Mahalanobis distance (MAH), and a comprehensive dataset of 49 species in Europe (trees, birds, mammals, and insects) to fit a total of 19,845 models, at 20 km, 10 km, and 5 km resolution. For selected species, we also compare the mapped predictions of 3 algorithms, and assess how variable importance changes with resolution for BRT. Results RF and BRT outperformed in terms of model performance (AUC = 0.938) for all clades (but not species), whereas decision trees, MaxLike, and Lasso overall underperformed (AUC = 0.848). The performance majorly depended on both clade (F = 101.4) and its interaction with resolution (F = 133.2), and displayed a general decline with resolution, while variable importance exhibited complex shifts in response to resolution. Conclusions RF and BRT are highly recommended but may require bias correction methods, whereas decision trees appeared unfavorable—particularly at higher resolutions. Given the complicated picture at the species level, varying tendencies to overfit, and resolution effects on both model performance and variable importance, we urge to routinely explore a range of algorithms, parametrizations, and resolutions. Graphical abstract | ||
650 | 4 | |a Algorithm | |
650 | 4 | |a Clades | |
650 | 4 | |a Species distribution model | |
650 | 4 | |a Model performance | |
650 | 4 | |a Resolution | |
700 | 1 | |a Mukhtar, Hussnain |4 aut | |
700 | 1 | |a Lin, Yu-Pin |0 (orcid)0000-0003-1954-334X |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Landscape ecology |d Springer Netherlands, 1987 |g 37(2022), 8 vom: 17. Juni, Seite 2045-2063 |w (DE-627)130857424 |w (DE-600)1027798-5 |w (DE-576)052841901 |x 0921-2973 |7 nnns |
773 | 1 | 8 | |g volume:37 |g year:2022 |g number:8 |g day:17 |g month:06 |g pages:2045-2063 |
856 | 4 | 1 | |u https://doi.org/10.1007/s10980-022-01465-1 |z lizenzpflichtig |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a SYSFLAG_A | ||
912 | |a GBV_OLC | ||
912 | |a FID-BIODIV | ||
912 | |a SSG-OLC-ARC | ||
912 | |a SSG-OLC-FOR | ||
912 | |a SSG-OLC-PHA | ||
912 | |a SSG-OLC-DE-84 | ||
951 | |a AR | ||
952 | |d 37 |j 2022 |e 8 |b 17 |c 06 |h 2045-2063 |