Saved in:
| Main Authors: | Cheng, Zhuoyu, Hatano, Kohei, Takimoto, Eiji |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.20734 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Adversarial Bandit Optimization with Globally Bounded Perturbations to Linear Losses
by: Cheng, Zhuoyu, et al.
Published: (2026)
by: Cheng, Zhuoyu, et al.
Published: (2026)
Multi-thresholding Good Arm Identification with Bandit Feedback
by: Jiang, Xuanke, et al.
Published: (2025)
by: Jiang, Xuanke, et al.
Published: (2025)
Spectral bandits
by: Kocák, Tomáš, et al.
Published: (2026)
by: Kocák, Tomáš, et al.
Published: (2026)
Functional multi-armed bandit and the best function identification problems
by: Dorn, Yuriy, et al.
Published: (2025)
by: Dorn, Yuriy, et al.
Published: (2025)
Linear bandits with polylogarithmic minimax regret
by: Lumbreras, Josep, et al.
Published: (2024)
by: Lumbreras, Josep, et al.
Published: (2024)
Prior-informed optimization of treatment recommendation via bandit algorithms trained on large language model-processed historical records
by: Nessari, Saman, et al.
Published: (2025)
by: Nessari, Saman, et al.
Published: (2025)
Reinforcement learning with combinatorial actions for coupled restless bandits
by: Xu, Lily, et al.
Published: (2025)
by: Xu, Lily, et al.
Published: (2025)
Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers
by: Lin, Xiaoqiang, et al.
Published: (2023)
by: Lin, Xiaoqiang, et al.
Published: (2023)
Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation
by: Patil, Gandharv, et al.
Published: (2022)
by: Patil, Gandharv, et al.
Published: (2022)
Softmax gradient policy for variance minimization and risk-averse multi armed bandits
by: Turinici, Gabriel
Published: (2026)
by: Turinici, Gabriel
Published: (2026)
On the optimal regret of collaborative personalized linear bandits
by: Huang, Bruce, et al.
Published: (2025)
by: Huang, Bruce, et al.
Published: (2025)
Reward-Punishment Reinforcement Learning with Maximum Entropy
by: Wang, Jiexin, et al.
Published: (2024)
by: Wang, Jiexin, et al.
Published: (2024)
On the Rate of Convergence of GD in Non-linear Neural Networks: An Adversarial Robustness Perspective
by: Smorodinsky, Guy, et al.
Published: (2026)
by: Smorodinsky, Guy, et al.
Published: (2026)
Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
by: Lim, Han-Dong, et al.
Published: (2025)
by: Lim, Han-Dong, et al.
Published: (2025)
Adversarial Graph Disentanglement
by: Zheng, Shuai, et al.
Published: (2021)
by: Zheng, Shuai, et al.
Published: (2021)
LLM-ABBA: Understanding time series via symbolic approximation
by: Chen, Xinye, et al.
Published: (2024)
by: Chen, Xinye, et al.
Published: (2024)
Adversarial Robustness Overestimation and Instability in TRADES
by: Li, Jonathan Weiping, et al.
Published: (2024)
by: Li, Jonathan Weiping, et al.
Published: (2024)
V2X-VLM: End-to-End V2X Cooperative Autonomous Driving Through Large Vision-Language Models
by: You, Junwei, et al.
Published: (2024)
by: You, Junwei, et al.
Published: (2024)
Frugality in second-order optimization: floating-point approximations for Newton's method
by: Carrino, Giuseppe, et al.
Published: (2025)
by: Carrino, Giuseppe, et al.
Published: (2025)
Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning
by: Liu, Qihao, et al.
Published: (2025)
by: Liu, Qihao, et al.
Published: (2025)
On Minimizing Adversarial Counterfactual Error in Adversarial RL
by: Belaire, Roman, et al.
Published: (2024)
by: Belaire, Roman, et al.
Published: (2024)
MemLoss: Enhancing Adversarial Training with Recycling Adversarial Examples
by: Mahdi, Soroush, et al.
Published: (2025)
by: Mahdi, Soroush, et al.
Published: (2025)
Towards Interpretable Adversarial Examples via Sparse Adversarial Attack
by: Lin, Fudong, et al.
Published: (2025)
by: Lin, Fudong, et al.
Published: (2025)
EgoSurgery-Phase: A Dataset of Surgical Phase Recognition from Egocentric Open Surgery Videos
by: Fujii, Ryo, et al.
Published: (2024)
by: Fujii, Ryo, et al.
Published: (2024)
Adversarial Examples Might be Avoidable: The Role of Data Concentration in Adversarial Robustness
by: Pal, Ambar, et al.
Published: (2023)
by: Pal, Ambar, et al.
Published: (2023)
How Worst-Case Are Adversarial Attacks? Linking Adversarial and Perturbation Robustness
by: Rossolini, Giulio
Published: (2026)
by: Rossolini, Giulio
Published: (2026)
Discriminative Adversarial Unlearning
by: Sharma, Rohan, et al.
Published: (2024)
by: Sharma, Rohan, et al.
Published: (2024)
Planning to avoid ambiguous states through Gaussian approximations to non-linear sensors in active inference agents
by: Kouw, Wouter M.
Published: (2024)
by: Kouw, Wouter M.
Published: (2024)
FairSTG: Countering performance heterogeneity via collaborative sample-level optimization
by: Lin, Gengyu, et al.
Published: (2024)
by: Lin, Gengyu, et al.
Published: (2024)
Adversarial Reasoning at Jailbreaking Time
by: Sabbaghi, Mahdi, et al.
Published: (2025)
by: Sabbaghi, Mahdi, et al.
Published: (2025)
Conflict-Aware Adversarial Training
by: Xue, Zhiyu, et al.
Published: (2024)
by: Xue, Zhiyu, et al.
Published: (2024)
Adversarially Robust Decision Transformer
by: Tang, Xiaohang, et al.
Published: (2024)
by: Tang, Xiaohang, et al.
Published: (2024)
Adversarial Training: A Survey
by: Zhao, Mengnan, et al.
Published: (2024)
by: Zhao, Mengnan, et al.
Published: (2024)
Adversarial Attacks on Hyperbolic Networks
by: van Spengler, Max, et al.
Published: (2024)
by: van Spengler, Max, et al.
Published: (2024)
A Closer Look at Adversarial Suffix Learning for Jailbreaking LLMs: Augmented Adversarial Trigger Learning
by: Wang, Zhe, et al.
Published: (2025)
by: Wang, Zhe, et al.
Published: (2025)
Multi-Worker Selection based Distributed Swarm Learning for Edge IoT with Non-i.i.d. Data
by: Yao, Zhuoyu, et al.
Published: (2025)
by: Yao, Zhuoyu, et al.
Published: (2025)
Memory Determines Learning Direction: A Theory of Gradient-Based Optimization in State Space Models
by: Guan, JingChuan, et al.
Published: (2025)
by: Guan, JingChuan, et al.
Published: (2025)
Adversarial Diffusion for Robust Reinforcement Learning
by: Foffano, Daniele, et al.
Published: (2025)
by: Foffano, Daniele, et al.
Published: (2025)
Adversarial Training for Process Reward Models
by: Juneja, Gurusha, et al.
Published: (2025)
by: Juneja, Gurusha, et al.
Published: (2025)
Algorithms for Adversarially Robust Deep Learning
by: Robey, Alexander
Published: (2025)
by: Robey, Alexander
Published: (2025)
Similar Items
-
Adversarial Bandit Optimization with Globally Bounded Perturbations to Linear Losses
by: Cheng, Zhuoyu, et al.
Published: (2026) -
Multi-thresholding Good Arm Identification with Bandit Feedback
by: Jiang, Xuanke, et al.
Published: (2025) -
Spectral bandits
by: Kocák, Tomáš, et al.
Published: (2026) -
Functional multi-armed bandit and the best function identification problems
by: Dorn, Yuriy, et al.
Published: (2025) -
Linear bandits with polylogarithmic minimax regret
by: Lumbreras, Josep, et al.
Published: (2024)