Saved in:
| Main Authors: | Pedley, James, Etheridge, Benjamin, Roberts, Stephen J., Quinzan, Francesco |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.12939 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning
by: Ye, Kai, et al.
Published: (2025)
by: Ye, Kai, et al.
Published: (2025)
Model Merging by Output-Space Projection
by: Evans, Bethan, et al.
Published: (2026)
by: Evans, Bethan, et al.
Published: (2026)
Learning Decision Policies with Instrumental Variables through Double Machine Learning
by: Shao, Daqian, et al.
Published: (2024)
by: Shao, Daqian, et al.
Published: (2024)
Double Machine Learning for Conditional Moment Restrictions: IV Regression, Proximal Causal Learning and Beyond
by: Shao, Daqian, et al.
Published: (2025)
by: Shao, Daqian, et al.
Published: (2025)
Learning Counterfactually Invariant Predictors
by: Quinzan, Francesco, et al.
Published: (2022)
by: Quinzan, Francesco, et al.
Published: (2022)
Double Machine Learning Based Structure Identification from Temporal Data
by: Angelis, Emmanouil, et al.
Published: (2023)
by: Angelis, Emmanouil, et al.
Published: (2023)
Detecting Changes in Causal Dependence with Kernels and Copulas
by: Gavioli-Akilagun, Shakeel, et al.
Published: (2026)
by: Gavioli-Akilagun, Shakeel, et al.
Published: (2026)
Doubly Robust Alignment for Large Language Models
by: Xu, Erhan, et al.
Published: (2025)
by: Xu, Erhan, et al.
Published: (2025)
AI Alignment in Medical Imaging: Unveiling Hidden Biases Through Counterfactual Analysis
by: Ma, Haroui, et al.
Published: (2025)
by: Ma, Haroui, et al.
Published: (2025)
Evolving Robustness--Exploration Trade-off in Online Reinforcement Learning via Quantile Bayesian Risk MDPs
by: Song, Meichen, et al.
Published: (2026)
by: Song, Meichen, et al.
Published: (2026)
Enhancing Certifiable Semantic Robustness via Robust Pruning of Deep Neural Networks
by: Hu, Hanjiang, et al.
Published: (2025)
by: Hu, Hanjiang, et al.
Published: (2025)
To Distill or Decide? Understanding the Algorithmic Trade-off in Partially Observable Reinforcement Learning
by: Song, Yuda, et al.
Published: (2025)
by: Song, Yuda, et al.
Published: (2025)
Learning to Orchestrate Agents under Uncertainty
by: Oliver, Mary Chriselda Antony, et al.
Published: (2026)
by: Oliver, Mary Chriselda Antony, et al.
Published: (2026)
Where Do Reasoning Models Refuse?
by: Yamaguchi, Kureha, et al.
Published: (2025)
by: Yamaguchi, Kureha, et al.
Published: (2025)
CAMP in the Odyssey: Provably Robust Reinforcement Learning with Certified Radius Maximization
by: Wang, Derui, et al.
Published: (2025)
by: Wang, Derui, et al.
Published: (2025)
BASIS: Batchwise Advantage Estimation from Single-Rollout Information Sharing for LLM Reasoning
by: Gong, Shijin, et al.
Published: (2026)
by: Gong, Shijin, et al.
Published: (2026)
Representation Invariance and Allocation: When Subgroup Balance Matters
by: Alloula, Anissa, et al.
Published: (2025)
by: Alloula, Anissa, et al.
Published: (2025)
Robustness in Text-Attributed Graph Learning: Insights, Trade-offs, and New Defenses
by: Lei, Runlin, et al.
Published: (2025)
by: Lei, Runlin, et al.
Published: (2025)
Navigating Trade-offs: Policy Summarization for Multi-Objective Reinforcement Learning
by: Osika, Zuzanna, et al.
Published: (2024)
by: Osika, Zuzanna, et al.
Published: (2024)
On the Trade-offs between Adversarial Robustness and Actionable Explanations
by: Krishna, Satyapriya, et al.
Published: (2023)
by: Krishna, Satyapriya, et al.
Published: (2023)
Deep Learning for Options Trading: An End-To-End Approach
by: Tan, Wee Ling, et al.
Published: (2024)
by: Tan, Wee Ling, et al.
Published: (2024)
A Fundamental Accuracy--Robustness Trade-off in Regression and Classification
by: Bahmani, Sohail
Published: (2024)
by: Bahmani, Sohail
Published: (2024)
Architecture Selection via the Trade-off Between Accuracy and Robustness
by: Deng, Zhun, et al.
Published: (2019)
by: Deng, Zhun, et al.
Published: (2019)
Multi-level Certified Defense Against Poisoning Attacks in Offline Reinforcement Learning
by: Liu, Shijie, et al.
Published: (2025)
by: Liu, Shijie, et al.
Published: (2025)
DeePM: Regime-Robust Deep Learning for Systematic Macro Portfolio Management
by: Wood, Kieran, et al.
Published: (2026)
by: Wood, Kieran, et al.
Published: (2026)
RS-Reg: Probabilistic and Robust Certified Regression Through Randomized Smoothing
by: Rekavandi, Aref Miri, et al.
Published: (2024)
by: Rekavandi, Aref Miri, et al.
Published: (2024)
Learning Better Certified Models from Empirically-Robust Teachers
by: De Palma, Alessandro
Published: (2026)
by: De Palma, Alessandro
Published: (2026)
Certifiably-Robust Federated Adversarial Learning via Randomized Smoothing
by: Chen, Cheng, et al.
Published: (2021)
by: Chen, Cheng, et al.
Published: (2021)
Position: Certified Robustness Does Not (Yet) Imply Model Security
by: Cullen, Andrew C., et al.
Published: (2025)
by: Cullen, Andrew C., et al.
Published: (2025)
A Recipe for Improved Certifiable Robustness
by: Hu, Kai, et al.
Published: (2023)
by: Hu, Kai, et al.
Published: (2023)
Tackling Length Inflation Without Trade-offs: Group Relative Reward Rescaling for Reinforcement Learning
by: Li, Zichao, et al.
Published: (2026)
by: Li, Zichao, et al.
Published: (2026)
On the Trade-off between Flatness and Optimization in Distributed Learning
by: Cao, Ying, et al.
Published: (2024)
by: Cao, Ying, et al.
Published: (2024)
LightningRL: Breaking the Accuracy-Parallelism Trade-off of Block-wise dLLMs via Reinforcement Learning
by: Hu, Yanzhe, et al.
Published: (2026)
by: Hu, Yanzhe, et al.
Published: (2026)
Graph Representational Learning: When Does More Expressivity Hurt Generalization?
by: Maskey, Sohir, et al.
Published: (2025)
by: Maskey, Sohir, et al.
Published: (2025)
When More Experts Hurt: Underfitting in Multi-Expert Learning to Defer
by: Liu, Shuqi, et al.
Published: (2026)
by: Liu, Shuqi, et al.
Published: (2026)
Certifiably Robust Encoding Schemes
by: Saxena, Aman, et al.
Published: (2024)
by: Saxena, Aman, et al.
Published: (2024)
Conformal Linguistic Calibration: Trading-off between Factuality and Specificity
by: Jiang, Zhengping, et al.
Published: (2025)
by: Jiang, Zhengping, et al.
Published: (2025)
Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policies
by: Gross, Dennis, et al.
Published: (2024)
by: Gross, Dennis, et al.
Published: (2024)
Robust Shielding for Safe Reinforcement Learning
by: Court, Edwin Hamel-De le, et al.
Published: (2026)
by: Court, Edwin Hamel-De le, et al.
Published: (2026)
Robust off-policy Reinforcement Learning via Soft Constrained Adversary
by: Nakanishi, Kosuke, et al.
Published: (2024)
by: Nakanishi, Kosuke, et al.
Published: (2024)
Similar Items
-
Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning
by: Ye, Kai, et al.
Published: (2025) -
Model Merging by Output-Space Projection
by: Evans, Bethan, et al.
Published: (2026) -
Learning Decision Policies with Instrumental Variables through Double Machine Learning
by: Shao, Daqian, et al.
Published: (2024) -
Double Machine Learning for Conditional Moment Restrictions: IV Regression, Proximal Causal Learning and Beyond
by: Shao, Daqian, et al.
Published: (2025) -
Learning Counterfactually Invariant Predictors
by: Quinzan, Francesco, et al.
Published: (2022)