Saved in:
| Main Authors: | Ashcraft, Chace, Karra, Kiran, Carney, Josh, Drenkow, Nathan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.08943 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Backdoors in DRL: Four Environments Focusing on In-distribution Triggers
by: Ashcraft, Chace, et al.
Published: (2025)
by: Ashcraft, Chace, et al.
Published: (2025)
Causality-Driven Audits of Model Robustness
by: Drenkow, Nathan, et al.
Published: (2024)
by: Drenkow, Nathan, et al.
Published: (2024)
Trusted Weights, Treacherous Optimizations? Optimization-Triggered Backdoor Attacks on LLMs
by: Wang, Yifei, et al.
Published: (2026)
by: Wang, Yifei, et al.
Published: (2026)
A Causal Framework for Aligning Image Quality Metrics and Deep Neural Network Robustness
by: Drenkow, Nathan, et al.
Published: (2025)
by: Drenkow, Nathan, et al.
Published: (2025)
Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice
by: Jiao, Yusheng, et al.
Published: (2024)
by: Jiao, Yusheng, et al.
Published: (2024)
POLO: Preference-Guided Multi-Turn Reinforcement Learning for Lead Optimization
by: Wang, Ziqing, et al.
Published: (2025)
by: Wang, Ziqing, et al.
Published: (2025)
Towards Virtual Clinical Trials of Radiology AI with Conditional Generative Modeling
by: Killeen, Benjamin D., et al.
Published: (2025)
by: Killeen, Benjamin D., et al.
Published: (2025)
Detecting Dataset Bias in Medical AI: A Generalized and Modality-Agnostic Auditing Framework
by: Drenkow, Nathan, et al.
Published: (2025)
by: Drenkow, Nathan, et al.
Published: (2025)
Turn-based Multi-Agent Reinforcement Learning Model Checking
by: Gross, Dennis
Published: (2025)
by: Gross, Dennis
Published: (2025)
Large Language Models are Highly Aligned with Human Ratings of Emotional Stimuli
by: Ogg, Mattson, et al.
Published: (2025)
by: Ogg, Mattson, et al.
Published: (2025)
BubbleSpec: Turning Long-Tail Bubbles into Speculative Rollout Drafts for Synchronous Reinforcement Learning
by: Xu, Yuhang, et al.
Published: (2026)
by: Xu, Yuhang, et al.
Published: (2026)
Beyond Binary: Turning Partial Success into Dense Verifiable Rewards for Reinforcement Learning in Code Generation
by: Wang, Longwen, et al.
Published: (2026)
by: Wang, Longwen, et al.
Published: (2026)
SLEA-RL: Step-Level Experience Augmented Reinforcement Learning for Multi-Turn Agentic Training
by: Wang, Prince Zizhuang, et al.
Published: (2026)
by: Wang, Prince Zizhuang, et al.
Published: (2026)
An Invitation to Deep Reinforcement Learning
by: Jaeger, Bernhard, et al.
Published: (2023)
by: Jaeger, Bernhard, et al.
Published: (2023)
Why Goal-Conditioned Reinforcement Learning Works: Relation to Dual Control
by: Lawrence, Nathan P., et al.
Published: (2025)
by: Lawrence, Nathan P., et al.
Published: (2025)
Causal-Paced Deep Reinforcement Learning
by: Cho, Geonwoo, et al.
Published: (2025)
by: Cho, Geonwoo, et al.
Published: (2025)
Recursive Deep Inverse Reinforcement Learning
by: Ghanem, Paul, et al.
Published: (2025)
by: Ghanem, Paul, et al.
Published: (2025)
Behavior-Consistent Deep Reinforcement Learning
by: Hussing, Marcel, et al.
Published: (2026)
by: Hussing, Marcel, et al.
Published: (2026)
Satisficing Exploration for Deep Reinforcement Learning
by: Arumugam, Dilip, et al.
Published: (2024)
by: Arumugam, Dilip, et al.
Published: (2024)
Rethinking Plasticity in Deep Reinforcement Learning
by: He, Zhiqiang
Published: (2026)
by: He, Zhiqiang
Published: (2026)
Understanding and Diagnosing Deep Reinforcement Learning
by: Korkmaz, Ezgi
Published: (2024)
by: Korkmaz, Ezgi
Published: (2024)
Angel or Demon: Investigating the Plasticity Interventions' Impact on Backdoor Threats in Deep Reinforcement Learning
by: Ma, Oubo, et al.
Published: (2026)
by: Ma, Oubo, et al.
Published: (2026)
Towards Interpretable Deep Reinforcement Learning Models via Inverse Reinforcement Learning
by: Xie, Sean, et al.
Published: (2022)
by: Xie, Sean, et al.
Published: (2022)
When to ASK: Uncertainty-Gated Language Assistance for Reinforcement Learning
by: Monteiro, Juarez, et al.
Published: (2026)
by: Monteiro, Juarez, et al.
Published: (2026)
RLFactory: A Plug-and-Play Reinforcement Learning Post-Training Framework for LLM Multi-Turn Tool-Use
by: Chai, Jiajun, et al.
Published: (2025)
by: Chai, Jiajun, et al.
Published: (2025)
Learning Markov State Abstractions for Deep Reinforcement Learning
by: Allen, Cameron, et al.
Published: (2021)
by: Allen, Cameron, et al.
Published: (2021)
Deep Reinforcement Learning with Gradient Eligibility Traces
by: Elelimy, Esraa, et al.
Published: (2025)
by: Elelimy, Esraa, et al.
Published: (2025)
A Survey on Explainable Deep Reinforcement Learning
by: Cheng, Zelei, et al.
Published: (2025)
by: Cheng, Zelei, et al.
Published: (2025)
A Practical Introduction to Deep Reinforcement Learning
by: Sun, Yinghan, et al.
Published: (2025)
by: Sun, Yinghan, et al.
Published: (2025)
On The Presence of Double-Descent in Deep Reinforcement Learning
by: Veselý, Viktor, et al.
Published: (2025)
by: Veselý, Viktor, et al.
Published: (2025)
Adaptive Data Exploitation in Deep Reinforcement Learning
by: Yuan, Mingqi, et al.
Published: (2025)
by: Yuan, Mingqi, et al.
Published: (2025)
Is Exploration or Optimization the Problem for Deep Reinforcement Learning?
by: Berseth, Glen
Published: (2025)
by: Berseth, Glen
Published: (2025)
Understanding and Improving Hyperbolic Deep Reinforcement Learning
by: Klein, Timo, et al.
Published: (2025)
by: Klein, Timo, et al.
Published: (2025)
Fast Value Tracking for Deep Reinforcement Learning
by: Shih, Frank, et al.
Published: (2024)
by: Shih, Frank, et al.
Published: (2024)
Optimizing Automatic Differentiation with Deep Reinforcement Learning
by: Lohoff, Jamie, et al.
Published: (2024)
by: Lohoff, Jamie, et al.
Published: (2024)
Streaming Deep Reinforcement Learning Finally Works
by: Elsayed, Mohamed, et al.
Published: (2024)
by: Elsayed, Mohamed, et al.
Published: (2024)
Weight Clipping for Deep Continual and Reinforcement Learning
by: Elsayed, Mohamed, et al.
Published: (2024)
by: Elsayed, Mohamed, et al.
Published: (2024)
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
by: Lanier, Michael, et al.
Published: (2024)
by: Lanier, Michael, et al.
Published: (2024)
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning
by: Wang, Zihan, et al.
Published: (2025)
by: Wang, Zihan, et al.
Published: (2025)
Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning
by: Shi, Chengshuai, et al.
Published: (2026)
by: Shi, Chengshuai, et al.
Published: (2026)
Similar Items
-
Backdoors in DRL: Four Environments Focusing on In-distribution Triggers
by: Ashcraft, Chace, et al.
Published: (2025) -
Causality-Driven Audits of Model Robustness
by: Drenkow, Nathan, et al.
Published: (2024) -
Trusted Weights, Treacherous Optimizations? Optimization-Triggered Backdoor Attacks on LLMs
by: Wang, Yifei, et al.
Published: (2026) -
A Causal Framework for Aligning Image Quality Metrics and Deep Neural Network Robustness
by: Drenkow, Nathan, et al.
Published: (2025) -
Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice
by: Jiao, Yusheng, et al.
Published: (2024)