:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tang, Xiaohang, Marques, Afonso, Kamalaruban, Parameswaran, Bogunovic, Ilija
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2407.18414
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

wd1: Weighted Policy Optimization for Reasoning in Diffusion Language Models
by: Tang, Xiaohang, et al.
Published: (2025)

Robust Multi-Objective Controlled Decoding of Large Language Models
by: Son, Seongho, et al.
Published: (2025)

RSPO: Regularized Self-Play Alignment of Large Language Models
by: Tang, Xiaohang, et al.
Published: (2025)

LLM-WikiRace Benchmark: How Far Can LLMs Plan over Real-World Knowledge Graphs?
by: Ziomek, Juliusz, et al.
Published: (2026)

Proximal Curriculum with Task Correlations for Deep Reinforcement Learning
by: Tzannetos, Georgios, et al.
Published: (2024)

GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models
by: Tang, Xiaohang, et al.
Published: (2026)

Corruption Robust Offline Reinforcement Learning with Human Feedback
by: Mandal, Debmalya, et al.
Published: (2024)

Emergent Bias and Fairness in Multi-Agent Decision Systems
by: Madigan, Maeve, et al.
Published: (2025)

Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
by: Ramesh, Shyam Sundhar, et al.
Published: (2023)

Learning Personalized Decision Support Policies
by: Bhatt, Umang, et al.
Published: (2023)

Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
by: Güzel, Ahmet H., et al.
Published: (2025)

PROWL: Prioritized Regret-Driven Optimization for World Model Learning
by: Güzel, Ahmet H., et al.
Published: (2026)

Robust Decision Aggregation with Adversarial Experts
by: Guo, Yongkang, et al.
Published: (2024)

Robust Bayesian Optimisation with Unbounded Corruptions
by: Ezzerg, Abdelhamid, et al.
Published: (2025)

Imagined Autocurricula
by: Güzel, Ahmet H., et al.
Published: (2025)

This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs
by: Wolf, Lorenz, et al.
Published: (2025)

Sample Efficient Preference Alignment in LLMs via Active Exploration
by: Mehta, Viraj, et al.
Published: (2023)

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks
by: Ramesh, Shyam Sundhar, et al.
Published: (2026)

Informativeness of Reward Functions in Reinforcement Learning
by: Devidze, Rati, et al.
Published: (2024)

Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs
by: Tzannetos, Georgios, et al.
Published: (2025)

REDUCR: Robust Data Downsampling Using Class Priority Reweighting
by: Bankes, William, et al.
Published: (2023)

Robustness Tokens: Towards Adversarial Robustness of Transformers
by: Pulfer, Brian, et al.
Published: (2025)

Robust Lagrangian and Adversarial Policy Gradient for Robust Constrained Markov Decision Processes
by: Bossens, David M.
Published: (2023)

Robustness-enhanced Uplift Modeling with Adversarial Feature Desensitization
by: Sun, Zexu, et al.
Published: (2023)

Sim2Act: Robust Simulation-to-Decision Learning via Adversarial Calibration and Group-Relative Perturbation
by: Cao, Hongyu, et al.
Published: (2026)

Investigating the Impact of Quantization on Adversarial Robustness
by: Li, Qun, et al.
Published: (2024)

Sample-efficient Bayesian Optimisation Using Known Invariances
by: Brown, Theodore, et al.
Published: (2024)

Adversarial Preference Learning for Robust LLM Alignment
by: Wang, Yuanfu, et al.
Published: (2025)

Explainable Transformer-Based Email Phishing Classification with Adversarial Robustness
by: P, Sajad U
Published: (2025)

Adversarial Examples Might be Avoidable: The Role of Data Concentration in Adversarial Robustness
by: Pal, Ambar, et al.
Published: (2023)

How Worst-Case Are Adversarial Attacks? Linking Adversarial and Perturbation Robustness
by: Rossolini, Giulio
Published: (2026)

Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer
by: Nguyen, Minh Hoang, et al.
Published: (2025)

Maintaining Adversarial Robustness in Continuous Learning
by: Ru, Xiaolei, et al.
Published: (2024)

Adversarial Robustness Overestimation and Instability in TRADES
by: Li, Jonathan Weiping, et al.
Published: (2024)

Adversarial Diffusion for Robust Reinforcement Learning
by: Foffano, Daniele, et al.
Published: (2025)

Algorithms for Adversarially Robust Deep Learning
by: Robey, Alexander
Published: (2025)

Bridging Symmetry and Robustness: On the Role of Equivariance in Enhancing Adversarial Robustness
by: Wang, Longwei, et al.
Published: (2025)

Decision Transformer vs. Decision Mamba: Analysing the Complexity of Sequential Decision Making in Atari Games
by: Yan, Ke
Published: (2024)

Decision Predicate Graphs: Enhancing Interpretability in Tree Ensembles
by: Arrighi, Leonardo, et al.
Published: (2024)

Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes
by: Moon, Sang Bin, et al.
Published: (2024)