Saved in:
Bibliographic Details
Main Authors: Sato, Haru-Tada, Matsuzaki, Fuka, Takahashi, Jun-ichiro
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2504.17685
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912778049028096
author Sato, Haru-Tada
Matsuzaki, Fuka
Takahashi, Jun-ichiro
author_facet Sato, Haru-Tada
Matsuzaki, Fuka
Takahashi, Jun-ichiro
contents This study explores the potential of small language model(SLM) ensembles to achieve accuracy comparable to proprietary large language models (LLMs). We propose Ensemble Bayesian Inference (EBI), a novel approach that applies Bayesian estimation to combine judgments from multiple SLMs, allowing them to exceed the performance limitations of individual models. Our experiments on diverse tasks(aptitude assessments and consumer profile analysis in both Japanese and English) demonstrate EBI's effectiveness. Notably, we analyze cases where incorporating models with negative Lift values into ensembles improves overall performance, and we examine the method's efficacy across different languages. These findings suggest new possibilities for constructing high-performance AI systems with limited computational resources and for effectively utilizing models with individually lower performance. Building on existing research on LLM performance evaluation, ensemble methods, and open-source LLM utilization, we discuss the novelty and significance of our approach.
format Preprint
id arxiv_https___arxiv_org_abs_2504_17685
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching Tasks
Sato, Haru-Tada
Matsuzaki, Fuka
Takahashi, Jun-ichiro
Computation and Language
Artificial Intelligence
This study explores the potential of small language model(SLM) ensembles to achieve accuracy comparable to proprietary large language models (LLMs). We propose Ensemble Bayesian Inference (EBI), a novel approach that applies Bayesian estimation to combine judgments from multiple SLMs, allowing them to exceed the performance limitations of individual models. Our experiments on diverse tasks(aptitude assessments and consumer profile analysis in both Japanese and English) demonstrate EBI's effectiveness. Notably, we analyze cases where incorporating models with negative Lift values into ensembles improves overall performance, and we examine the method's efficacy across different languages. These findings suggest new possibilities for constructing high-performance AI systems with limited computational resources and for effectively utilizing models with individually lower performance. Building on existing research on LLM performance evaluation, ensemble methods, and open-source LLM utilization, we discuss the novelty and significance of our approach.
title Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching Tasks
topic Computation and Language
Artificial Intelligence
url https://arxiv.org/abs/2504.17685