Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Hui-Mean, Foo, Chang, Yuan-chin Ivan
Format:	Preprint
Published:	2026
Subjects:	Computation Machine Learning 62
Online Access:	https://arxiv.org/abs/2601.00615
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918269385965568
author	Hui-Mean, Foo Chang, Yuan-chin Ivan
author_facet	Hui-Mean, Foo Chang, Yuan-chin Ivan
contents	Modern optimization problems in scientific and engineering domains often rely on expensive black-box evaluations, such as those arising in physical simulations or deep learning pipelines, where gradient information is unavailable or unreliable. In these settings, conventional optimization methods quickly become impractical due to prohibitive computational costs and poor scalability. We propose ALMAB-DC, a unified and modular framework for scalable black-box optimization that integrates active learning, multi-armed bandits, and distributed computing, with optional GPU acceleration. The framework leverages surrogate modeling and information-theoretic acquisition functions to guide informative sample selection, while bandit-based controllers dynamically allocate computational resources across candidate evaluations in a statistically principled manner. These decisions are executed asynchronously within a distributed multi-agent system, enabling high-throughput parallel evaluation. We establish theoretical regret bounds for both UCB-based and Thompson-sampling-based variants and develop a scalability analysis grounded in Amdahl's and Gustafson's laws. Empirical results across synthetic benchmarks, reinforcement learning tasks, and scientific simulation problems demonstrate that ALMAB-DC consistently outperforms state-of-the-art black-box optimizers. By design, ALMAB-DC is modular, uncertainty-aware, and extensible, making it particularly well suited for high-dimensional, resource-intensive optimization challenges.
format	Preprint
id	arxiv_https___arxiv_org_abs_2601_00615
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Integrating Multi-Armed Bandit, Active Learning, and Distributed Computing for Scalable Optimization Hui-Mean, Foo Chang, Yuan-chin Ivan Computation Machine Learning 62 Modern optimization problems in scientific and engineering domains often rely on expensive black-box evaluations, such as those arising in physical simulations or deep learning pipelines, where gradient information is unavailable or unreliable. In these settings, conventional optimization methods quickly become impractical due to prohibitive computational costs and poor scalability. We propose ALMAB-DC, a unified and modular framework for scalable black-box optimization that integrates active learning, multi-armed bandits, and distributed computing, with optional GPU acceleration. The framework leverages surrogate modeling and information-theoretic acquisition functions to guide informative sample selection, while bandit-based controllers dynamically allocate computational resources across candidate evaluations in a statistically principled manner. These decisions are executed asynchronously within a distributed multi-agent system, enabling high-throughput parallel evaluation. We establish theoretical regret bounds for both UCB-based and Thompson-sampling-based variants and develop a scalability analysis grounded in Amdahl's and Gustafson's laws. Empirical results across synthetic benchmarks, reinforcement learning tasks, and scientific simulation problems demonstrate that ALMAB-DC consistently outperforms state-of-the-art black-box optimizers. By design, ALMAB-DC is modular, uncertainty-aware, and extensible, making it particularly well suited for high-dimensional, resource-intensive optimization challenges.
title	Integrating Multi-Armed Bandit, Active Learning, and Distributed Computing for Scalable Optimization
topic	Computation Machine Learning 62
url	https://arxiv.org/abs/2601.00615

Similar Items