Saved in:
Bibliographic Details
Main Authors: Hui-Mean, Foo, Chang, Yuan-chin Ivan
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.00615
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918269385965568
author Hui-Mean, Foo
Chang, Yuan-chin Ivan
author_facet Hui-Mean, Foo
Chang, Yuan-chin Ivan
contents Modern optimization problems in scientific and engineering domains often rely on expensive black-box evaluations, such as those arising in physical simulations or deep learning pipelines, where gradient information is unavailable or unreliable. In these settings, conventional optimization methods quickly become impractical due to prohibitive computational costs and poor scalability. We propose ALMAB-DC, a unified and modular framework for scalable black-box optimization that integrates active learning, multi-armed bandits, and distributed computing, with optional GPU acceleration. The framework leverages surrogate modeling and information-theoretic acquisition functions to guide informative sample selection, while bandit-based controllers dynamically allocate computational resources across candidate evaluations in a statistically principled manner. These decisions are executed asynchronously within a distributed multi-agent system, enabling high-throughput parallel evaluation. We establish theoretical regret bounds for both UCB-based and Thompson-sampling-based variants and develop a scalability analysis grounded in Amdahl's and Gustafson's laws. Empirical results across synthetic benchmarks, reinforcement learning tasks, and scientific simulation problems demonstrate that ALMAB-DC consistently outperforms state-of-the-art black-box optimizers. By design, ALMAB-DC is modular, uncertainty-aware, and extensible, making it particularly well suited for high-dimensional, resource-intensive optimization challenges.
format Preprint
id arxiv_https___arxiv_org_abs_2601_00615
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Integrating Multi-Armed Bandit, Active Learning, and Distributed Computing for Scalable Optimization
Hui-Mean, Foo
Chang, Yuan-chin Ivan
Computation
Machine Learning
62
Modern optimization problems in scientific and engineering domains often rely on expensive black-box evaluations, such as those arising in physical simulations or deep learning pipelines, where gradient information is unavailable or unreliable. In these settings, conventional optimization methods quickly become impractical due to prohibitive computational costs and poor scalability. We propose ALMAB-DC, a unified and modular framework for scalable black-box optimization that integrates active learning, multi-armed bandits, and distributed computing, with optional GPU acceleration. The framework leverages surrogate modeling and information-theoretic acquisition functions to guide informative sample selection, while bandit-based controllers dynamically allocate computational resources across candidate evaluations in a statistically principled manner. These decisions are executed asynchronously within a distributed multi-agent system, enabling high-throughput parallel evaluation. We establish theoretical regret bounds for both UCB-based and Thompson-sampling-based variants and develop a scalability analysis grounded in Amdahl's and Gustafson's laws. Empirical results across synthetic benchmarks, reinforcement learning tasks, and scientific simulation problems demonstrate that ALMAB-DC consistently outperforms state-of-the-art black-box optimizers. By design, ALMAB-DC is modular, uncertainty-aware, and extensible, making it particularly well suited for high-dimensional, resource-intensive optimization challenges.
title Integrating Multi-Armed Bandit, Active Learning, and Distributed Computing for Scalable Optimization
topic Computation
Machine Learning
62
url https://arxiv.org/abs/2601.00615