Guardado en:
| Autores principales: | Han, Chuying, Feng, Yasong, Wang, Tianyu |
|---|---|
| Formato: | Preprint |
| Publicado: |
2023
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2305.11509 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Breaking the KV Cache Bottleneck: Fan Duality Model Achieves O(1) Decode Memory with Superior Associative Recall
por: Fan, Yasong
Publicado: (2026)
por: Fan, Yasong
Publicado: (2026)
Multiplayer Information Asymmetric Bandits in Metric Spaces
por: Chang, William, et al.
Publicado: (2025)
por: Chang, William, et al.
Publicado: (2025)
The Minimal Search Space for Conditional Causal Bandits
por: Simoes, Francisco N. F. Q., et al.
Publicado: (2025)
por: Simoes, Francisco N. F. Q., et al.
Publicado: (2025)
Robust Batched Bandits
por: Guo, Yunwen, et al.
Publicado: (2025)
por: Guo, Yunwen, et al.
Publicado: (2025)
Batched Stochastic Bandit for Nondegenerate Functions
por: Liu, Yu, et al.
Publicado: (2024)
por: Liu, Yu, et al.
Publicado: (2024)
Random-Effects Algorithm for Random Objects in Metric Spaces
por: Matabuena, Marcos, et al.
Publicado: (2026)
por: Matabuena, Marcos, et al.
Publicado: (2026)
Transformers in the Dark: Navigating Unknown Search Spaces via Bandit Feedback
por: Kim, Jungtaek, et al.
Publicado: (2026)
por: Kim, Jungtaek, et al.
Publicado: (2026)
Transfer Learning for Bayesian Optimization on Heterogeneous Search Spaces
por: Fan, Zhou, et al.
Publicado: (2023)
por: Fan, Zhou, et al.
Publicado: (2023)
Offline Local Search for Online Stochastic Bandits
por: Benadè, Gerdus, et al.
Publicado: (2026)
por: Benadè, Gerdus, et al.
Publicado: (2026)
Linear Bandits beyond Inner Product Spaces, the case of Bandit Optimal Transport
por: Croissant, Lorenzo
Publicado: (2025)
por: Croissant, Lorenzo
Publicado: (2025)
Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization
por: Qiu, Jiahao, et al.
Publicado: (2024)
por: Qiu, Jiahao, et al.
Publicado: (2024)
Multi-Objective Multi-Agent Bandits: From Learning Efficiency to Fairness Optimization
por: Wang, John, et al.
Publicado: (2026)
por: Wang, John, et al.
Publicado: (2026)
The Bandit Whisperer: Communication Learning for Restless Bandits
por: Zhao, Yunfan, et al.
Publicado: (2024)
por: Zhao, Yunfan, et al.
Publicado: (2024)
Infinity Search: Approximate Vector Search with Projections on q-Metric Spaces
por: Pariente, Antonio, et al.
Publicado: (2025)
por: Pariente, Antonio, et al.
Publicado: (2025)
BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
por: Hou, Yunlong, et al.
Publicado: (2025)
por: Hou, Yunlong, et al.
Publicado: (2025)
Representative Action Selection for Large Action Space: From Bandits to MDPs
por: Zhou, Quan, et al.
Publicado: (2025)
por: Zhou, Quan, et al.
Publicado: (2025)
On Generation in Metric Spaces
por: Li, Jiaxun, et al.
Publicado: (2026)
por: Li, Jiaxun, et al.
Publicado: (2026)
An Entropic Metric for Measuring Calibration of Machine Learning Models
por: Sumler, Daniel James, et al.
Publicado: (2025)
por: Sumler, Daniel James, et al.
Publicado: (2025)
Direction-Aware Offline-to-Online Learning in Linear Contextual Bandits
por: Han, Zean, et al.
Publicado: (2026)
por: Han, Zean, et al.
Publicado: (2026)
Random Coordinate Descent on the Wasserstein Space of Probability Measures
por: Xu, Yewei, et al.
Publicado: (2026)
por: Xu, Yewei, et al.
Publicado: (2026)
On the Hardness of Bandit Learning
por: Brukhim, Nataly, et al.
Publicado: (2025)
por: Brukhim, Nataly, et al.
Publicado: (2025)
Multi-Objective Neural Architecture Search by Learning Search Space Partitions
por: Zhao, Yiyang, et al.
Publicado: (2024)
por: Zhao, Yiyang, et al.
Publicado: (2024)
Learning Weakly Convex Sets in Metric Spaces
por: Stadtländer, Eike, et al.
Publicado: (2021)
por: Stadtländer, Eike, et al.
Publicado: (2021)
Random Normed k-Means: A Paradigm-Shift in Clustering within Probabilistic Metric Spaces
por: Hemdanou, Abderrafik Laakel, et al.
Publicado: (2025)
por: Hemdanou, Abderrafik Laakel, et al.
Publicado: (2025)
Optimal Kernel Quantile Learning with Random Features
por: Wang, Caixing, et al.
Publicado: (2024)
por: Wang, Caixing, et al.
Publicado: (2024)
Learned Random Label Predictions as a Neural Network Complexity Metric
por: Becker, Marlon, et al.
Publicado: (2024)
por: Becker, Marlon, et al.
Publicado: (2024)
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
por: Liu, Xutong, et al.
Publicado: (2024)
por: Liu, Xutong, et al.
Publicado: (2024)
A General Recipe for the Analysis of Randomized Multi-Armed Bandit Algorithms
por: Baudry, Dorian, et al.
Publicado: (2023)
por: Baudry, Dorian, et al.
Publicado: (2023)
Distributed Multi-Agent Bandits Over Erdős-Rényi Random Networks
por: Liu, Jingyuan, et al.
Publicado: (2025)
por: Liu, Jingyuan, et al.
Publicado: (2025)
Graph Feedback Bandits with Similar Arms
por: Qi, Han, et al.
Publicado: (2024)
por: Qi, Han, et al.
Publicado: (2024)
Geometry of Uncertainty: Learning Metric Spaces for Multimodal State Estimation in RL
por: Reichlin, Alfredo, et al.
Publicado: (2026)
por: Reichlin, Alfredo, et al.
Publicado: (2026)
Offline Learning for Combinatorial Multi-armed Bandits
por: Liu, Xutong, et al.
Publicado: (2025)
por: Liu, Xutong, et al.
Publicado: (2025)
Compute Allocation in Evolutionary Search: From Depth-Breadth to Multi-Armed Bandits
por: Xing, Sixue, et al.
Publicado: (2026)
por: Xing, Sixue, et al.
Publicado: (2026)
Observation-Augmented Contextual Multi-Armed Bandits for Robotic Search and Exploration
por: Wakayama, Shohei, et al.
Publicado: (2023)
por: Wakayama, Shohei, et al.
Publicado: (2023)
Differential Privacy in Kernelized Contextual Bandits via Random Projections
por: Pavlovic, Nikola, et al.
Publicado: (2025)
por: Pavlovic, Nikola, et al.
Publicado: (2025)
Learning in Position-Aware Multinomial Logit Bandits: From Multiplicative to General Position Effects
por: Chen, Xi, et al.
Publicado: (2026)
por: Chen, Xi, et al.
Publicado: (2026)
Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learning
por: Liu, Junyan, et al.
Publicado: (2024)
por: Liu, Junyan, et al.
Publicado: (2024)
Random Search Neural Networks for Efficient and Expressive Graph Learning
por: Ito, Michael, et al.
Publicado: (2025)
por: Ito, Michael, et al.
Publicado: (2025)
Improving Diffusion Language Model Decoding through Joint Search in Generation Order and Token Space
por: Shen, Yangyi, et al.
Publicado: (2026)
por: Shen, Yangyi, et al.
Publicado: (2026)
Robust Decentralized Multi-armed Bandits: From Corruption-Resilience to Byzantine-Resilience
por: Hu, Zicheng, et al.
Publicado: (2025)
por: Hu, Zicheng, et al.
Publicado: (2025)
Ejemplares similares
-
Breaking the KV Cache Bottleneck: Fan Duality Model Achieves O(1) Decode Memory with Superior Associative Recall
por: Fan, Yasong
Publicado: (2026) -
Multiplayer Information Asymmetric Bandits in Metric Spaces
por: Chang, William, et al.
Publicado: (2025) -
The Minimal Search Space for Conditional Causal Bandits
por: Simoes, Francisco N. F. Q., et al.
Publicado: (2025) -
Robust Batched Bandits
por: Guo, Yunwen, et al.
Publicado: (2025) -
Batched Stochastic Bandit for Nondegenerate Functions
por: Liu, Yu, et al.
Publicado: (2024)