Saved in:
| Main Authors: | Tang, Xiaohang, Marques, Afonso, Kamalaruban, Parameswaran, Bogunovic, Ilija |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.18414 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
wd1: Weighted Policy Optimization for Reasoning in Diffusion Language Models
by: Tang, Xiaohang, et al.
Published: (2025)
by: Tang, Xiaohang, et al.
Published: (2025)
Robust Multi-Objective Controlled Decoding of Large Language Models
by: Son, Seongho, et al.
Published: (2025)
by: Son, Seongho, et al.
Published: (2025)
RSPO: Regularized Self-Play Alignment of Large Language Models
by: Tang, Xiaohang, et al.
Published: (2025)
by: Tang, Xiaohang, et al.
Published: (2025)
LLM-WikiRace Benchmark: How Far Can LLMs Plan over Real-World Knowledge Graphs?
by: Ziomek, Juliusz, et al.
Published: (2026)
by: Ziomek, Juliusz, et al.
Published: (2026)
Proximal Curriculum with Task Correlations for Deep Reinforcement Learning
by: Tzannetos, Georgios, et al.
Published: (2024)
by: Tzannetos, Georgios, et al.
Published: (2024)
GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models
by: Tang, Xiaohang, et al.
Published: (2026)
by: Tang, Xiaohang, et al.
Published: (2026)
Corruption Robust Offline Reinforcement Learning with Human Feedback
by: Mandal, Debmalya, et al.
Published: (2024)
by: Mandal, Debmalya, et al.
Published: (2024)
Emergent Bias and Fairness in Multi-Agent Decision Systems
by: Madigan, Maeve, et al.
Published: (2025)
by: Madigan, Maeve, et al.
Published: (2025)
Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
by: Ramesh, Shyam Sundhar, et al.
Published: (2023)
by: Ramesh, Shyam Sundhar, et al.
Published: (2023)
Learning Personalized Decision Support Policies
by: Bhatt, Umang, et al.
Published: (2023)
by: Bhatt, Umang, et al.
Published: (2023)
Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
by: Güzel, Ahmet H., et al.
Published: (2025)
by: Güzel, Ahmet H., et al.
Published: (2025)
PROWL: Prioritized Regret-Driven Optimization for World Model Learning
by: Güzel, Ahmet H., et al.
Published: (2026)
by: Güzel, Ahmet H., et al.
Published: (2026)
Robust Decision Aggregation with Adversarial Experts
by: Guo, Yongkang, et al.
Published: (2024)
by: Guo, Yongkang, et al.
Published: (2024)
Robust Bayesian Optimisation with Unbounded Corruptions
by: Ezzerg, Abdelhamid, et al.
Published: (2025)
by: Ezzerg, Abdelhamid, et al.
Published: (2025)
Imagined Autocurricula
by: Güzel, Ahmet H., et al.
Published: (2025)
by: Güzel, Ahmet H., et al.
Published: (2025)
This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs
by: Wolf, Lorenz, et al.
Published: (2025)
by: Wolf, Lorenz, et al.
Published: (2025)
Sample Efficient Preference Alignment in LLMs via Active Exploration
by: Mehta, Viraj, et al.
Published: (2023)
by: Mehta, Viraj, et al.
Published: (2023)
Multi-Task GRPO: Reliable LLM Reasoning Across Tasks
by: Ramesh, Shyam Sundhar, et al.
Published: (2026)
by: Ramesh, Shyam Sundhar, et al.
Published: (2026)
Informativeness of Reward Functions in Reinforcement Learning
by: Devidze, Rati, et al.
Published: (2024)
by: Devidze, Rati, et al.
Published: (2024)
Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs
by: Tzannetos, Georgios, et al.
Published: (2025)
by: Tzannetos, Georgios, et al.
Published: (2025)
REDUCR: Robust Data Downsampling Using Class Priority Reweighting
by: Bankes, William, et al.
Published: (2023)
by: Bankes, William, et al.
Published: (2023)
Robustness Tokens: Towards Adversarial Robustness of Transformers
by: Pulfer, Brian, et al.
Published: (2025)
by: Pulfer, Brian, et al.
Published: (2025)
Robust Lagrangian and Adversarial Policy Gradient for Robust Constrained Markov Decision Processes
by: Bossens, David M.
Published: (2023)
by: Bossens, David M.
Published: (2023)
Robustness-enhanced Uplift Modeling with Adversarial Feature Desensitization
by: Sun, Zexu, et al.
Published: (2023)
by: Sun, Zexu, et al.
Published: (2023)
Sim2Act: Robust Simulation-to-Decision Learning via Adversarial Calibration and Group-Relative Perturbation
by: Cao, Hongyu, et al.
Published: (2026)
by: Cao, Hongyu, et al.
Published: (2026)
Investigating the Impact of Quantization on Adversarial Robustness
by: Li, Qun, et al.
Published: (2024)
by: Li, Qun, et al.
Published: (2024)
Sample-efficient Bayesian Optimisation Using Known Invariances
by: Brown, Theodore, et al.
Published: (2024)
by: Brown, Theodore, et al.
Published: (2024)
Adversarial Preference Learning for Robust LLM Alignment
by: Wang, Yuanfu, et al.
Published: (2025)
by: Wang, Yuanfu, et al.
Published: (2025)
Explainable Transformer-Based Email Phishing Classification with Adversarial Robustness
by: P, Sajad U
Published: (2025)
by: P, Sajad U
Published: (2025)
Adversarial Examples Might be Avoidable: The Role of Data Concentration in Adversarial Robustness
by: Pal, Ambar, et al.
Published: (2023)
by: Pal, Ambar, et al.
Published: (2023)
How Worst-Case Are Adversarial Attacks? Linking Adversarial and Perturbation Robustness
by: Rossolini, Giulio
Published: (2026)
by: Rossolini, Giulio
Published: (2026)
Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer
by: Nguyen, Minh Hoang, et al.
Published: (2025)
by: Nguyen, Minh Hoang, et al.
Published: (2025)
Maintaining Adversarial Robustness in Continuous Learning
by: Ru, Xiaolei, et al.
Published: (2024)
by: Ru, Xiaolei, et al.
Published: (2024)
Adversarial Robustness Overestimation and Instability in TRADES
by: Li, Jonathan Weiping, et al.
Published: (2024)
by: Li, Jonathan Weiping, et al.
Published: (2024)
Adversarial Diffusion for Robust Reinforcement Learning
by: Foffano, Daniele, et al.
Published: (2025)
by: Foffano, Daniele, et al.
Published: (2025)
Algorithms for Adversarially Robust Deep Learning
by: Robey, Alexander
Published: (2025)
by: Robey, Alexander
Published: (2025)
Bridging Symmetry and Robustness: On the Role of Equivariance in Enhancing Adversarial Robustness
by: Wang, Longwei, et al.
Published: (2025)
by: Wang, Longwei, et al.
Published: (2025)
Decision Transformer vs. Decision Mamba: Analysing the Complexity of Sequential Decision Making in Atari Games
by: Yan, Ke
Published: (2024)
by: Yan, Ke
Published: (2024)
Decision Predicate Graphs: Enhancing Interpretability in Tree Ensembles
by: Arrighi, Leonardo, et al.
Published: (2024)
by: Arrighi, Leonardo, et al.
Published: (2024)
Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes
by: Moon, Sang Bin, et al.
Published: (2024)
by: Moon, Sang Bin, et al.
Published: (2024)
Similar Items
-
wd1: Weighted Policy Optimization for Reasoning in Diffusion Language Models
by: Tang, Xiaohang, et al.
Published: (2025) -
Robust Multi-Objective Controlled Decoding of Large Language Models
by: Son, Seongho, et al.
Published: (2025) -
RSPO: Regularized Self-Play Alignment of Large Language Models
by: Tang, Xiaohang, et al.
Published: (2025) -
LLM-WikiRace Benchmark: How Far Can LLMs Plan over Real-World Knowledge Graphs?
by: Ziomek, Juliusz, et al.
Published: (2026) -
Proximal Curriculum with Task Correlations for Deep Reinforcement Learning
by: Tzannetos, Georgios, et al.
Published: (2024)