Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Pedley, James, Etheridge, Benjamin, Roberts, Stephen J., Quinzan, Francesco
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2510.12939
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915555539156992
author	Pedley, James Etheridge, Benjamin Roberts, Stephen J. Quinzan, Francesco
author_facet	Pedley, James Etheridge, Benjamin Roberts, Stephen J. Quinzan, Francesco
contents	Reinforcement learning (RL) policies deployed in real-world environments must remain reliable under adversarial perturbations. At the same time, modern deep RL agents are heavily over-parameterized, raising costs and fragility concerns. While pruning has been shown to improve robustness in supervised learning, its role in adversarial RL remains poorly understood. We develop the first theoretical framework for certified robustness under pruning in state-adversarial Markov decision processes (SA-MDPs). For Gaussian and categorical policies with Lipschitz networks, we prove that element-wise pruning can only tighten certified robustness bounds; pruning never makes the policy less robust. Building on this, we derive a novel three-term regret decomposition that disentangles clean-task performance, pruning-induced performance loss, and robustness gains, exposing a fundamental performance--robustness frontier. Empirically, we evaluate magnitude and micro-pruning schedules on continuous-control benchmarks with strong policy-aware adversaries. Across tasks, pruning consistently uncovers reproducible ``sweet spots'' at moderate sparsity levels, where robustness improves substantially without harming - and sometimes even enhancing - clean performance. These results position pruning not merely as a compression tool but as a structural intervention for robust RL.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_12939
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Pruning Cannot Hurt Robustness: Certified Trade-offs in Reinforcement Learning Pedley, James Etheridge, Benjamin Roberts, Stephen J. Quinzan, Francesco Machine Learning Reinforcement learning (RL) policies deployed in real-world environments must remain reliable under adversarial perturbations. At the same time, modern deep RL agents are heavily over-parameterized, raising costs and fragility concerns. While pruning has been shown to improve robustness in supervised learning, its role in adversarial RL remains poorly understood. We develop the first theoretical framework for certified robustness under pruning in state-adversarial Markov decision processes (SA-MDPs). For Gaussian and categorical policies with Lipschitz networks, we prove that element-wise pruning can only tighten certified robustness bounds; pruning never makes the policy less robust. Building on this, we derive a novel three-term regret decomposition that disentangles clean-task performance, pruning-induced performance loss, and robustness gains, exposing a fundamental performance--robustness frontier. Empirically, we evaluate magnitude and micro-pruning schedules on continuous-control benchmarks with strong policy-aware adversaries. Across tasks, pruning consistently uncovers reproducible ``sweet spots'' at moderate sparsity levels, where robustness improves substantially without harming - and sometimes even enhancing - clean performance. These results position pruning not merely as a compression tool but as a structural intervention for robust RL.
title	Pruning Cannot Hurt Robustness: Certified Trade-offs in Reinforcement Learning
topic	Machine Learning
url	https://arxiv.org/abs/2510.12939

Similar Items