Saved in:
| Main Authors: | Zheng, Xiang, Ma, Xingjun, Shen, Chao, Wang, Cong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.09247 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CALM: Curiosity-Driven Auditing for Large Language Models
by: Zheng, Xiang, et al.
Published: (2025)
by: Zheng, Xiang, et al.
Published: (2025)
CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning
by: Colas, Cédric, et al.
Published: (2018)
by: Colas, Cédric, et al.
Published: (2018)
Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning
by: Hugessen, Adriana, et al.
Published: (2024)
by: Hugessen, Adriana, et al.
Published: (2024)
Fostering Intrinsic Motivation in Reinforcement Learning with Pretrained Foundation Models
by: Andres, Alain, et al.
Published: (2024)
by: Andres, Alain, et al.
Published: (2024)
Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with constraints
by: Yin, Zhenyun, et al.
Published: (2025)
by: Yin, Zhenyun, et al.
Published: (2025)
Defense-to-Attack: Bypassing Weak Defenses Enables Stronger Jailbreaks in Vision-Language Models
by: Zhao, Yunhan, et al.
Published: (2025)
by: Zhao, Yunhan, et al.
Published: (2025)
Beyond Surface Judgments: Human-Grounded Risk Evaluation of LLM-Generated Disinformation
by: Xu, Zonghuan, et al.
Published: (2026)
by: Xu, Zonghuan, et al.
Published: (2026)
Minding Motivation: The Effect of Intrinsic Motivation on Agent Behaviors
by: Villalobos-Arias, Leonardo, et al.
Published: (2025)
by: Villalobos-Arias, Leonardo, et al.
Published: (2025)
Joint Intrinsic Motivation for Coordinated Exploration in Multi-Agent Deep Reinforcement Learning
by: Toquebiau, Maxime, et al.
Published: (2024)
by: Toquebiau, Maxime, et al.
Published: (2024)
Reinforcement Learning with Intrinsically Motivated Feedback Graph for Lost-sales Inventory Control
by: Liu, Zifan, et al.
Published: (2024)
by: Liu, Zifan, et al.
Published: (2024)
Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey
by: Colas, Cédric, et al.
Published: (2020)
by: Colas, Cédric, et al.
Published: (2020)
Autotelic Reinforcement Learning: Exploring Intrinsic Motivations for Skill Acquisition in Open-Ended Environments
by: Srivastava, Prakhar, et al.
Published: (2025)
by: Srivastava, Prakhar, et al.
Published: (2025)
Just Ask: Curious Code Agents Reveal System Prompts in Frontier LLMs
by: Zheng, Xiang, et al.
Published: (2026)
by: Zheng, Xiang, et al.
Published: (2026)
MetaVIM: Meta Variationally Intrinsic Motivated Reinforcement Learning for Decentralized Traffic Signal Control
by: Zhu, Liwen, et al.
Published: (2021)
by: Zhu, Liwen, et al.
Published: (2021)
From Order to Distribution: A Spectral Characterization of Forgetting in Continual Learning
by: Xu, Zonghuan, et al.
Published: (2026)
by: Xu, Zonghuan, et al.
Published: (2026)
Image-Based Deep Reinforcement Learning with Intrinsically Motivated Stimuli: On the Execution of Complex Robotic Tasks
by: Valencia, David, et al.
Published: (2024)
by: Valencia, David, et al.
Published: (2024)
Learning Formal Mathematics From Intrinsic Motivation
by: Poesia, Gabriel, et al.
Published: (2024)
by: Poesia, Gabriel, et al.
Published: (2024)
Toward Evaluating Robustness of Reinforcement Learning with Adversarial Policy
by: Zheng, Xiang, et al.
Published: (2023)
by: Zheng, Xiang, et al.
Published: (2023)
BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks
by: Zhao, Yunhan, et al.
Published: (2024)
by: Zhao, Yunhan, et al.
Published: (2024)
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
by: Wang, Yibo, et al.
Published: (2024)
by: Wang, Yibo, et al.
Published: (2024)
Count Counts: Motivating Exploration in LLM Reasoning with Count-based Intrinsic Rewards
by: Zhang, Xuan, et al.
Published: (2025)
by: Zhang, Xuan, et al.
Published: (2025)
Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
by: Yuan, Mingqi, et al.
Published: (2025)
by: Yuan, Mingqi, et al.
Published: (2025)
Intrinsically-Motivated Humans and Agents in Open-World Exploration
by: Lidayan, Aly, et al.
Published: (2025)
by: Lidayan, Aly, et al.
Published: (2025)
OptiLeak: Efficient Prompt Reconstruction via Reinforcement Learning in Multi-tenant LLM Services
by: Wang, Longxiang, et al.
Published: (2026)
by: Wang, Longxiang, et al.
Published: (2026)
ICPO: Intrinsic Confidence-Driven Group Relative Preference Optimization for Efficient Reinforcement Learning
by: Wang, Jinpeng, et al.
Published: (2025)
by: Wang, Jinpeng, et al.
Published: (2025)
M3-BENCH: Process-Aware Evaluation of LLM Agents' Social Behaviors in Mixed-Motive Games
by: Xie, Sixiong, et al.
Published: (2026)
by: Xie, Sixiong, et al.
Published: (2026)
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
by: Lin, Qian, et al.
Published: (2024)
by: Lin, Qian, et al.
Published: (2024)
Decentralized Traffic Flow Optimization Through Intrinsic Motivation
by: Papala, Himaja, et al.
Published: (2025)
by: Papala, Himaja, et al.
Published: (2025)
Enhancing End-to-End Multi-Task Dialogue Systems: A Study on Intrinsic Motivation Reinforcement Learning Algorithms for Improved Training and Adaptability
by: Kamuni, Navin, et al.
Published: (2024)
by: Kamuni, Navin, et al.
Published: (2024)
RedTopic: Toward Topic-Diverse Red Teaming of Large Language Models
by: Ding, Jiale, et al.
Published: (2025)
by: Ding, Jiale, et al.
Published: (2025)
DropVLA: An Action-Level Backdoor Attack on Vision-Language-Action Models
by: Xu, Zonghuan, et al.
Published: (2025)
by: Xu, Zonghuan, et al.
Published: (2025)
JT-Safe: Intrinsically Enhancing the Safety and Trustworthiness of LLMs
by: Feng, Junlan, et al.
Published: (2025)
by: Feng, Junlan, et al.
Published: (2025)
Intrinsic-Motivation Multi-Robot Social Formation Navigation with Coordinated Exploration
by: Fu, Hao, et al.
Published: (2025)
by: Fu, Hao, et al.
Published: (2025)
Towards a Formal Theory of the Need for Competence via Computational Intrinsic Motivation
by: Lintunen, Erik M., et al.
Published: (2025)
by: Lintunen, Erik M., et al.
Published: (2025)
Internal Safety Collapse in Frontier Large Language Models
by: Wu, Yutao, et al.
Published: (2026)
by: Wu, Yutao, et al.
Published: (2026)
Efficient Reinforcement Learning for Large Language Models with Intrinsic Exploration
by: Sun, Yan, et al.
Published: (2025)
by: Sun, Yan, et al.
Published: (2025)
Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs
by: Zhang, Wenjian, et al.
Published: (2026)
by: Zhang, Wenjian, et al.
Published: (2026)
BAMDP Shaping: a Unified Framework for Intrinsic Motivation and Reward Shaping
by: Lidayan, Aly, et al.
Published: (2024)
by: Lidayan, Aly, et al.
Published: (2024)
Decentralized Consensus Inference-based Hierarchical Reinforcement Learning for Multi-Constrained UAV Pursuit-Evasion Game
by: Yuming, Xiang, et al.
Published: (2025)
by: Yuming, Xiang, et al.
Published: (2025)
Adaptive Correlation-Weighted Intrinsic Rewards for Reinforcement Learning
by: Nguyen, Viet Bac, et al.
Published: (2026)
by: Nguyen, Viet Bac, et al.
Published: (2026)
Similar Items
-
CALM: Curiosity-Driven Auditing for Large Language Models
by: Zheng, Xiang, et al.
Published: (2025) -
CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning
by: Colas, Cédric, et al.
Published: (2018) -
Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning
by: Hugessen, Adriana, et al.
Published: (2024) -
Fostering Intrinsic Motivation in Reinforcement Learning with Pretrained Foundation Models
by: Andres, Alain, et al.
Published: (2024) -
Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with constraints
by: Yin, Zhenyun, et al.
Published: (2025)