Saved in:
Bibliographic Details
Main Authors: Xie, Tianyu, Yuan, Musu, Deng, Minghua, Zhang, Cheng
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2409.05282
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917771530469376
author Xie, Tianyu
Yuan, Musu
Deng, Minghua
Zhang, Cheng
author_facet Xie, Tianyu
Yuan, Musu
Deng, Minghua
Zhang, Cheng
contents Probability estimation of tree topologies is one of the fundamental tasks in phylogenetic inference. The recently proposed subsplit Bayesian networks (SBNs) provide a powerful probabilistic graphical model for tree topology probability estimation by properly leveraging the hierarchical structure of phylogenetic trees. However, the expectation maximization (EM) method currently used for learning SBN parameters does not scale up to large data sets. In this paper, we introduce several computationally efficient methods for training SBNs and show that variance reduction could be the key for better performance. Furthermore, we also introduce the variance reduction technique to improve the optimization of SBN parameters for variational Bayesian phylogenetic inference (VBPI). Extensive synthetic and real data experiments demonstrate that our methods outperform previous baseline methods on the tasks of tree topology probability estimation as well as Bayesian phylogenetic inference using SBNs.
format Preprint
id arxiv_https___arxiv_org_abs_2409_05282
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Improving Tree Probability Estimation with Stochastic Optimization and Variance Reduction
Xie, Tianyu
Yuan, Musu
Deng, Minghua
Zhang, Cheng
Populations and Evolution
Computation
Machine Learning
Probability estimation of tree topologies is one of the fundamental tasks in phylogenetic inference. The recently proposed subsplit Bayesian networks (SBNs) provide a powerful probabilistic graphical model for tree topology probability estimation by properly leveraging the hierarchical structure of phylogenetic trees. However, the expectation maximization (EM) method currently used for learning SBN parameters does not scale up to large data sets. In this paper, we introduce several computationally efficient methods for training SBNs and show that variance reduction could be the key for better performance. Furthermore, we also introduce the variance reduction technique to improve the optimization of SBN parameters for variational Bayesian phylogenetic inference (VBPI). Extensive synthetic and real data experiments demonstrate that our methods outperform previous baseline methods on the tasks of tree topology probability estimation as well as Bayesian phylogenetic inference using SBNs.
title Improving Tree Probability Estimation with Stochastic Optimization and Variance Reduction
topic Populations and Evolution
Computation
Machine Learning
url https://arxiv.org/abs/2409.05282