:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pedley, James, Etheridge, Benjamin, Roberts, Stephen J., Quinzan, Francesco
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2510.12939
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning
by: Ye, Kai, et al.
Published: (2025)

Model Merging by Output-Space Projection
by: Evans, Bethan, et al.
Published: (2026)

Learning Decision Policies with Instrumental Variables through Double Machine Learning
by: Shao, Daqian, et al.
Published: (2024)

Double Machine Learning for Conditional Moment Restrictions: IV Regression, Proximal Causal Learning and Beyond
by: Shao, Daqian, et al.
Published: (2025)

Learning Counterfactually Invariant Predictors
by: Quinzan, Francesco, et al.
Published: (2022)

Double Machine Learning Based Structure Identification from Temporal Data
by: Angelis, Emmanouil, et al.
Published: (2023)

Detecting Changes in Causal Dependence with Kernels and Copulas
by: Gavioli-Akilagun, Shakeel, et al.
Published: (2026)

Doubly Robust Alignment for Large Language Models
by: Xu, Erhan, et al.
Published: (2025)

AI Alignment in Medical Imaging: Unveiling Hidden Biases Through Counterfactual Analysis
by: Ma, Haroui, et al.
Published: (2025)

Evolving Robustness--Exploration Trade-off in Online Reinforcement Learning via Quantile Bayesian Risk MDPs
by: Song, Meichen, et al.
Published: (2026)

Enhancing Certifiable Semantic Robustness via Robust Pruning of Deep Neural Networks
by: Hu, Hanjiang, et al.
Published: (2025)

To Distill or Decide? Understanding the Algorithmic Trade-off in Partially Observable Reinforcement Learning
by: Song, Yuda, et al.
Published: (2025)

Learning to Orchestrate Agents under Uncertainty
by: Oliver, Mary Chriselda Antony, et al.
Published: (2026)

Where Do Reasoning Models Refuse?
by: Yamaguchi, Kureha, et al.
Published: (2025)

CAMP in the Odyssey: Provably Robust Reinforcement Learning with Certified Radius Maximization
by: Wang, Derui, et al.
Published: (2025)

BASIS: Batchwise Advantage Estimation from Single-Rollout Information Sharing for LLM Reasoning
by: Gong, Shijin, et al.
Published: (2026)

Representation Invariance and Allocation: When Subgroup Balance Matters
by: Alloula, Anissa, et al.
Published: (2025)

Robustness in Text-Attributed Graph Learning: Insights, Trade-offs, and New Defenses
by: Lei, Runlin, et al.
Published: (2025)

Navigating Trade-offs: Policy Summarization for Multi-Objective Reinforcement Learning
by: Osika, Zuzanna, et al.
Published: (2024)

On the Trade-offs between Adversarial Robustness and Actionable Explanations
by: Krishna, Satyapriya, et al.
Published: (2023)

Deep Learning for Options Trading: An End-To-End Approach
by: Tan, Wee Ling, et al.
Published: (2024)

A Fundamental Accuracy--Robustness Trade-off in Regression and Classification
by: Bahmani, Sohail
Published: (2024)

Architecture Selection via the Trade-off Between Accuracy and Robustness
by: Deng, Zhun, et al.
Published: (2019)

Multi-level Certified Defense Against Poisoning Attacks in Offline Reinforcement Learning
by: Liu, Shijie, et al.
Published: (2025)

DeePM: Regime-Robust Deep Learning for Systematic Macro Portfolio Management
by: Wood, Kieran, et al.
Published: (2026)

RS-Reg: Probabilistic and Robust Certified Regression Through Randomized Smoothing
by: Rekavandi, Aref Miri, et al.
Published: (2024)

Learning Better Certified Models from Empirically-Robust Teachers
by: De Palma, Alessandro
Published: (2026)

Certifiably-Robust Federated Adversarial Learning via Randomized Smoothing
by: Chen, Cheng, et al.
Published: (2021)

Position: Certified Robustness Does Not (Yet) Imply Model Security
by: Cullen, Andrew C., et al.
Published: (2025)

A Recipe for Improved Certifiable Robustness
by: Hu, Kai, et al.
Published: (2023)

Tackling Length Inflation Without Trade-offs: Group Relative Reward Rescaling for Reinforcement Learning
by: Li, Zichao, et al.
Published: (2026)

On the Trade-off between Flatness and Optimization in Distributed Learning
by: Cao, Ying, et al.
Published: (2024)

LightningRL: Breaking the Accuracy-Parallelism Trade-off of Block-wise dLLMs via Reinforcement Learning
by: Hu, Yanzhe, et al.
Published: (2026)

Graph Representational Learning: When Does More Expressivity Hurt Generalization?
by: Maskey, Sohir, et al.
Published: (2025)

When More Experts Hurt: Underfitting in Multi-Expert Learning to Defer
by: Liu, Shuqi, et al.
Published: (2026)

Certifiably Robust Encoding Schemes
by: Saxena, Aman, et al.
Published: (2024)

Conformal Linguistic Calibration: Trading-off between Factuality and Specificity
by: Jiang, Zhengping, et al.
Published: (2025)

Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policies
by: Gross, Dennis, et al.
Published: (2024)

Robust Shielding for Safe Reinforcement Learning
by: Court, Edwin Hamel-De le, et al.
Published: (2026)

Robust off-policy Reinforcement Learning via Soft Constrained Adversary
by: Nakanishi, Kosuke, et al.
Published: (2024)