Saved in:
| Main Authors: | Caron, Alberto, Hicks, Chris, Mavroudis, Vasilios |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.02639 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Causal Model-Based Policy Optimization
by: Caron, Alberto, et al.
Published: (2025)
by: Caron, Alberto, et al.
Published: (2025)
A View on Out-of-Distribution Identification from a Statistical Testing Theory Perspective
by: Caron, Alberto, et al.
Published: (2024)
by: Caron, Alberto, et al.
Published: (2024)
Inherently Interpretable and Uncertainty-Aware Models for Online Learning in Cyber-Security Problems
by: Kolicic, Benjamin, et al.
Published: (2024)
by: Kolicic, Benjamin, et al.
Published: (2024)
Entity-based Reinforcement Learning for Autonomous Cyber Defence
by: Thompson, Isaac Symes, et al.
Published: (2024)
by: Thompson, Isaac Symes, et al.
Published: (2024)
Beyond Training-time Poisoning: Component-level and Post-training Backdoors in Deep Reinforcement Learning
by: Vyas, Sanyam, et al.
Published: (2025)
by: Vyas, Sanyam, et al.
Published: (2025)
Beyond Rewards in Reinforcement Learning for Cyber Defence
by: Bates, Elizabeth, et al.
Published: (2026)
by: Bates, Elizabeth, et al.
Published: (2026)
Mitigating Deep Reinforcement Learning Backdoors in the Neural Activation Space
by: Vyas, Sanyam, et al.
Published: (2024)
by: Vyas, Sanyam, et al.
Published: (2024)
Autonomous Network Defence using Reinforcement Learning
by: Foley, Myles, et al.
Published: (2024)
by: Foley, Myles, et al.
Published: (2024)
Nearest Neighbour with Bandit Feedback
by: Pasteris, Stephen, et al.
Published: (2023)
by: Pasteris, Stephen, et al.
Published: (2023)
Fairness with Exponential Weights
by: Pasteris, Stephen, et al.
Published: (2024)
by: Pasteris, Stephen, et al.
Published: (2024)
Extraction Propagation
by: Pasteris, Stephen, et al.
Published: (2024)
by: Pasteris, Stephen, et al.
Published: (2024)
Less is more? Rewards in RL for Cyber Defence
by: Bates, Elizabeth, et al.
Published: (2025)
by: Bates, Elizabeth, et al.
Published: (2025)
Online Convex Optimisation: The Optimal Switching Regret for all Segmentations Simultaneously
by: Pasteris, Stephen, et al.
Published: (2024)
by: Pasteris, Stephen, et al.
Published: (2024)
DRMD: Deep Reinforcement Learning for Malware Detection under Concept Drift
by: McFadden, Shae, et al.
Published: (2025)
by: McFadden, Shae, et al.
Published: (2025)
SoK: The Pitfalls of Deep Reinforcement Learning for Cybersecurity
by: McFadden, Shae, et al.
Published: (2026)
by: McFadden, Shae, et al.
Published: (2026)
An Attentive Graph Agent for Topology-Adaptive Cyber Defence
by: Sandoval, Ilya Orson, et al.
Published: (2025)
by: Sandoval, Ilya Orson, et al.
Published: (2025)
Environment Complexity and Nash Equilibria in a Sequential Social Dilemma
by: Yasir, Mustafa, et al.
Published: (2024)
by: Yasir, Mustafa, et al.
Published: (2024)
CybORG++: An Enhanced Gym for the Development of Autonomous Cyber Agents
by: Emerson, Harry, et al.
Published: (2024)
by: Emerson, Harry, et al.
Published: (2024)
Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples
by: Souly, Alexandra, et al.
Published: (2025)
by: Souly, Alexandra, et al.
Published: (2025)
Zero-Trust Network Access (ZTNA)
by: Mavroudis, Vasilios
Published: (2024)
by: Mavroudis, Vasilios
Published: (2024)
What if we could hot swap our Biometrics?
by: Crowcroft, Jon, et al.
Published: (2025)
by: Crowcroft, Jon, et al.
Published: (2025)
Efficient Exploration in Deep Reinforcement Learning: A Novel Bayesian Actor-Critic Algorithm
by: Rozanov, Nikolai
Published: (2024)
by: Rozanov, Nikolai
Published: (2024)
Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation
by: Plou, Carlos, et al.
Published: (2024)
by: Plou, Carlos, et al.
Published: (2024)
Optimistic World Models: Efficient Exploration in Model-Based Deep Reinforcement Learning
by: Mete, Akshay, et al.
Published: (2026)
by: Mete, Akshay, et al.
Published: (2026)
Preference-Guided Reinforcement Learning for Efficient Exploration
by: Wang, Guojian, et al.
Published: (2024)
by: Wang, Guojian, et al.
Published: (2024)
Quantifying Mix Network Privacy Erosion with Generative Models
by: Mavroudis, Vasilios, et al.
Published: (2025)
by: Mavroudis, Vasilios, et al.
Published: (2025)
Efficient Reinforcement Learning for Large Language Models with Intrinsic Exploration
by: Sun, Yan, et al.
Published: (2025)
by: Sun, Yan, et al.
Published: (2025)
Offline Model-Based Reinforcement Learning with Anti-Exploration
by: Srinivasan, Padmanaba, et al.
Published: (2024)
by: Srinivasan, Padmanaba, et al.
Published: (2024)
Efficient Reinforcement Learning via Decoupling Exploration and Utilization
by: Yang, Jingpu, et al.
Published: (2023)
by: Yang, Jingpu, et al.
Published: (2023)
Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
by: Wang, Yiran, et al.
Published: (2024)
by: Wang, Yiran, et al.
Published: (2024)
Goal-Guided Efficient Exploration via Large Language Model in Reinforcement Learning
by: Qi, Yajie, et al.
Published: (2025)
by: Qi, Yajie, et al.
Published: (2025)
Hybrid Belief Reinforcement Learning for Efficient Coordinated Spatial Exploration
by: Rizvi, Danish, et al.
Published: (2026)
by: Rizvi, Danish, et al.
Published: (2026)
Model-Free Active Exploration in Reinforcement Learning
by: Russo, Alessio, et al.
Published: (2024)
by: Russo, Alessio, et al.
Published: (2024)
Provably Efficient Exploration in Inverse Constrained Reinforcement Learning
by: Yue, Bo, et al.
Published: (2024)
by: Yue, Bo, et al.
Published: (2024)
Efficient Preference-Based Reinforcement Learning: Randomized Exploration Meets Experimental Design
by: Schlaginhaufen, Andreas, et al.
Published: (2025)
by: Schlaginhaufen, Andreas, et al.
Published: (2025)
Learning-Driven Exploration for Reinforcement Learning
by: Usama, Muhammad, et al.
Published: (2019)
by: Usama, Muhammad, et al.
Published: (2019)
Analysis of Publicly Accessible Operational Technology and Associated Risks
by: Rodda, Matthew, et al.
Published: (2025)
by: Rodda, Matthew, et al.
Published: (2025)
HonestCyberEval: An AI Cyber Risk Benchmark for Automated Software Exploitation
by: Ristea, Dan, et al.
Published: (2024)
by: Ristea, Dan, et al.
Published: (2024)
Referential Security as a New Paradigm for AI Evaluations
by: Ristea, Dan, et al.
Published: (2026)
by: Ristea, Dan, et al.
Published: (2026)
One Pic is All it Takes: Poisoning Visual Document Retrieval Augmented Generation with a Single Image
by: Shereen, Ezzeldin, et al.
Published: (2025)
by: Shereen, Ezzeldin, et al.
Published: (2025)
Similar Items
-
Towards Causal Model-Based Policy Optimization
by: Caron, Alberto, et al.
Published: (2025) -
A View on Out-of-Distribution Identification from a Statistical Testing Theory Perspective
by: Caron, Alberto, et al.
Published: (2024) -
Inherently Interpretable and Uncertainty-Aware Models for Online Learning in Cyber-Security Problems
by: Kolicic, Benjamin, et al.
Published: (2024) -
Entity-based Reinforcement Learning for Autonomous Cyber Defence
by: Thompson, Isaac Symes, et al.
Published: (2024) -
Beyond Training-time Poisoning: Component-level and Post-training Backdoors in Deep Reinforcement Learning
by: Vyas, Sanyam, et al.
Published: (2025)