Saved in:
Bibliographic Details
Main Authors: Cooper, Connor, Webb, Geoffrey I., Schmidt, Daniel F.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2505.23320
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918038045982720
author Cooper, Connor
Webb, Geoffrey I.
Schmidt, Daniel F.
author_facet Cooper, Connor
Webb, Geoffrey I.
Schmidt, Daniel F.
contents Bayesian network classifiers (BNCs) possess a number of properties desirable for a modern classifier: They are easily interpretable, highly scalable, and offer adaptable complexity. However, traditional methods for learning BNCs have historically underperformed when compared to leading classification methods such as random forests. Recent parameter smoothing techniques using hierarchical Dirichlet processes (HDPs) have enabled BNCs to achieve performance competitive with random forests on categorical data, but these techniques are relatively inflexible, and require a complicated, specialized sampling process. In this paper, we introduce a novel method for parameter estimation that uses a log-linear regression to approximate the behaviour of HDPs. As a linear model, our method is remarkably flexible and simple to interpret, and can leverage the vast literature on learning linear models. Our experiments show that our method can outperform HDP smoothing while being orders of magnitude faster, remaining competitive with random forests on categorical data.
format Preprint
id arxiv_https___arxiv_org_abs_2505_23320
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Efficient Parameter Estimation for Bayesian Network Classifiers using Hierarchical Linear Smoothing
Cooper, Connor
Webb, Geoffrey I.
Schmidt, Daniel F.
Machine Learning
Bayesian network classifiers (BNCs) possess a number of properties desirable for a modern classifier: They are easily interpretable, highly scalable, and offer adaptable complexity. However, traditional methods for learning BNCs have historically underperformed when compared to leading classification methods such as random forests. Recent parameter smoothing techniques using hierarchical Dirichlet processes (HDPs) have enabled BNCs to achieve performance competitive with random forests on categorical data, but these techniques are relatively inflexible, and require a complicated, specialized sampling process. In this paper, we introduce a novel method for parameter estimation that uses a log-linear regression to approximate the behaviour of HDPs. As a linear model, our method is remarkably flexible and simple to interpret, and can leverage the vast literature on learning linear models. Our experiments show that our method can outperform HDP smoothing while being orders of magnitude faster, remaining competitive with random forests on categorical data.
title Efficient Parameter Estimation for Bayesian Network Classifiers using Hierarchical Linear Smoothing
topic Machine Learning
url https://arxiv.org/abs/2505.23320