TY - UNPB
T1 - Identifying summary statistics for approximate Bayesian computation in a phylogenetic island biogeography model
AU - Xie, Shu
AU - Valente, Luis
AU - Etienne, Rampal S.
PY - 2023/10/17
Y1 - 2023/10/17
N2 - Estimation of parameters of evolutionary island biogeography models, such as colonization and diversification rates, is important for a better understanding of island systems. A popular statistical inference framework is likelihood-based estimation of parameters using island species richness and phylogenetic data. Likelihood approaches require that the likelihood can be computed analytically or numerically, but with the increasing complexity of island biogeography models, this is often unfeasible. Simulation-based estimation methods may then be a promising alternative. One such method is approximate Bayesian computation (ABC), which compares summary statistics of the empirical data with the output of model simulations. However, ABC demands the definition of summary statistics that sufficiently describe the data, which is yet to be explored in island biogeography. Here, we propose a set of summary statistics and use it in an ABC framework for the estimation of parameters of an island biogeography model, DAISIE (Dynamic Assembly of Island biota through Speciation, Immigration and Extinction). For this model, likelihood-based inference is possible, which gives us the opportunity to assess the performance of the summary statistics. DAISIE currently only allows maximum likelihood estimation (MLE), so we additionally develop a likelihood-based Bayesian inference framework using Markov Chain Monte Carlo (MCMC) to enable comparison with the ABC results (i.e., making the same assumptions on prior distributions). We simulated phylogenies of island communities subject to colonization, speciation, and extinction using the DAISIE simulation model and compared the estimated parameters using the three inference approaches (MLE, MCMC and ABC). Our results show that the ABC algorithm performs well in estimating colonization and diversification rates, except when the species richness or amount of phylogenetic information from an island are low. We find that compared to island species diversity statistics, summary statistics that make use of phylogenetic and temporal patterns (e.g., the number of species through time) significantly improve ABC inference accuracy, especially in estimating colonization and anagenesis rates, as well as making inference converge considerably faster and perform better under the same number of iterations. Island biogeography is rapidly developing new simulation models that can explain the complexity of island biodiversity, and our study provides a set of informative summary statistics that can be used in island biogeography studies for which likelihood-based inference methods are not an option.Competing Interest StatementThe authors have declared no competing interest.
AB - Estimation of parameters of evolutionary island biogeography models, such as colonization and diversification rates, is important for a better understanding of island systems. A popular statistical inference framework is likelihood-based estimation of parameters using island species richness and phylogenetic data. Likelihood approaches require that the likelihood can be computed analytically or numerically, but with the increasing complexity of island biogeography models, this is often unfeasible. Simulation-based estimation methods may then be a promising alternative. One such method is approximate Bayesian computation (ABC), which compares summary statistics of the empirical data with the output of model simulations. However, ABC demands the definition of summary statistics that sufficiently describe the data, which is yet to be explored in island biogeography. Here, we propose a set of summary statistics and use it in an ABC framework for the estimation of parameters of an island biogeography model, DAISIE (Dynamic Assembly of Island biota through Speciation, Immigration and Extinction). For this model, likelihood-based inference is possible, which gives us the opportunity to assess the performance of the summary statistics. DAISIE currently only allows maximum likelihood estimation (MLE), so we additionally develop a likelihood-based Bayesian inference framework using Markov Chain Monte Carlo (MCMC) to enable comparison with the ABC results (i.e., making the same assumptions on prior distributions). We simulated phylogenies of island communities subject to colonization, speciation, and extinction using the DAISIE simulation model and compared the estimated parameters using the three inference approaches (MLE, MCMC and ABC). Our results show that the ABC algorithm performs well in estimating colonization and diversification rates, except when the species richness or amount of phylogenetic information from an island are low. We find that compared to island species diversity statistics, summary statistics that make use of phylogenetic and temporal patterns (e.g., the number of species through time) significantly improve ABC inference accuracy, especially in estimating colonization and anagenesis rates, as well as making inference converge considerably faster and perform better under the same number of iterations. Island biogeography is rapidly developing new simulation models that can explain the complexity of island biodiversity, and our study provides a set of informative summary statistics that can be used in island biogeography studies for which likelihood-based inference methods are not an option.Competing Interest StatementThe authors have declared no competing interest.
U2 - 10.1101/2023.10.13.562305
DO - 10.1101/2023.10.13.562305
M3 - Preprint
T3 - BioRxiv
BT - Identifying summary statistics for approximate Bayesian computation in a phylogenetic island biogeography model
PB - BioRxiv
ER -