:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cheng, Zhuoyu, Hatano, Kohei, Takimoto, Eiji
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2505.20734
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Adversarial Bandit Optimization with Globally Bounded Perturbations to Linear Losses
by: Cheng, Zhuoyu, et al.
Published: (2026)

Multi-thresholding Good Arm Identification with Bandit Feedback
by: Jiang, Xuanke, et al.
Published: (2025)

Spectral bandits
by: Kocák, Tomáš, et al.
Published: (2026)

Functional multi-armed bandit and the best function identification problems
by: Dorn, Yuriy, et al.
Published: (2025)

Linear bandits with polylogarithmic minimax regret
by: Lumbreras, Josep, et al.
Published: (2024)

Prior-informed optimization of treatment recommendation via bandit algorithms trained on large language model-processed historical records
by: Nessari, Saman, et al.
Published: (2025)

Reinforcement learning with combinatorial actions for coupled restless bandits
by: Xu, Lily, et al.
Published: (2025)

Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers
by: Lin, Xiaoqiang, et al.
Published: (2023)

Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation
by: Patil, Gandharv, et al.
Published: (2022)

Softmax gradient policy for variance minimization and risk-averse multi armed bandits
by: Turinici, Gabriel
Published: (2026)

On the optimal regret of collaborative personalized linear bandits
by: Huang, Bruce, et al.
Published: (2025)

Reward-Punishment Reinforcement Learning with Maximum Entropy
by: Wang, Jiexin, et al.
Published: (2024)

On the Rate of Convergence of GD in Non-linear Neural Networks: An Adversarial Robustness Perspective
by: Smorodinsky, Guy, et al.
Published: (2026)

Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
by: Lim, Han-Dong, et al.
Published: (2025)

Adversarial Graph Disentanglement
by: Zheng, Shuai, et al.
Published: (2021)

LLM-ABBA: Understanding time series via symbolic approximation
by: Chen, Xinye, et al.
Published: (2024)

Adversarial Robustness Overestimation and Instability in TRADES
by: Li, Jonathan Weiping, et al.
Published: (2024)

V2X-VLM: End-to-End V2X Cooperative Autonomous Driving Through Large Vision-Language Models
by: You, Junwei, et al.
Published: (2024)

Frugality in second-order optimization: floating-point approximations for Newton's method
by: Carrino, Giuseppe, et al.
Published: (2025)

Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning
by: Liu, Qihao, et al.
Published: (2025)

On Minimizing Adversarial Counterfactual Error in Adversarial RL
by: Belaire, Roman, et al.
Published: (2024)

MemLoss: Enhancing Adversarial Training with Recycling Adversarial Examples
by: Mahdi, Soroush, et al.
Published: (2025)

Towards Interpretable Adversarial Examples via Sparse Adversarial Attack
by: Lin, Fudong, et al.
Published: (2025)

EgoSurgery-Phase: A Dataset of Surgical Phase Recognition from Egocentric Open Surgery Videos
by: Fujii, Ryo, et al.
Published: (2024)

Adversarial Examples Might be Avoidable: The Role of Data Concentration in Adversarial Robustness
by: Pal, Ambar, et al.
Published: (2023)

How Worst-Case Are Adversarial Attacks? Linking Adversarial and Perturbation Robustness
by: Rossolini, Giulio
Published: (2026)

Discriminative Adversarial Unlearning
by: Sharma, Rohan, et al.
Published: (2024)

Planning to avoid ambiguous states through Gaussian approximations to non-linear sensors in active inference agents
by: Kouw, Wouter M.
Published: (2024)

FairSTG: Countering performance heterogeneity via collaborative sample-level optimization
by: Lin, Gengyu, et al.
Published: (2024)

Adversarial Reasoning at Jailbreaking Time
by: Sabbaghi, Mahdi, et al.
Published: (2025)

Conflict-Aware Adversarial Training
by: Xue, Zhiyu, et al.
Published: (2024)

Adversarially Robust Decision Transformer
by: Tang, Xiaohang, et al.
Published: (2024)

Adversarial Training: A Survey
by: Zhao, Mengnan, et al.
Published: (2024)

Adversarial Attacks on Hyperbolic Networks
by: van Spengler, Max, et al.
Published: (2024)

A Closer Look at Adversarial Suffix Learning for Jailbreaking LLMs: Augmented Adversarial Trigger Learning
by: Wang, Zhe, et al.
Published: (2025)

Multi-Worker Selection based Distributed Swarm Learning for Edge IoT with Non-i.i.d. Data
by: Yao, Zhuoyu, et al.
Published: (2025)

Memory Determines Learning Direction: A Theory of Gradient-Based Optimization in State Space Models
by: Guan, JingChuan, et al.
Published: (2025)

Adversarial Diffusion for Robust Reinforcement Learning
by: Foffano, Daniele, et al.
Published: (2025)

Adversarial Training for Process Reward Models
by: Juneja, Gurusha, et al.
Published: (2025)

Algorithms for Adversarially Robust Deep Learning
by: Robey, Alexander
Published: (2025)