:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zheng, Xiang, Ma, Xingjun, Shen, Chao, Wang, Cong
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2407.09247
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CALM: Curiosity-Driven Auditing for Large Language Models
by: Zheng, Xiang, et al.
Published: (2025)

CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning
by: Colas, Cédric, et al.
Published: (2018)

Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning
by: Hugessen, Adriana, et al.
Published: (2024)

Fostering Intrinsic Motivation in Reinforcement Learning with Pretrained Foundation Models
by: Andres, Alain, et al.
Published: (2024)

Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with constraints
by: Yin, Zhenyun, et al.
Published: (2025)

Defense-to-Attack: Bypassing Weak Defenses Enables Stronger Jailbreaks in Vision-Language Models
by: Zhao, Yunhan, et al.
Published: (2025)

Beyond Surface Judgments: Human-Grounded Risk Evaluation of LLM-Generated Disinformation
by: Xu, Zonghuan, et al.
Published: (2026)

Minding Motivation: The Effect of Intrinsic Motivation on Agent Behaviors
by: Villalobos-Arias, Leonardo, et al.
Published: (2025)

Joint Intrinsic Motivation for Coordinated Exploration in Multi-Agent Deep Reinforcement Learning
by: Toquebiau, Maxime, et al.
Published: (2024)

Reinforcement Learning with Intrinsically Motivated Feedback Graph for Lost-sales Inventory Control
by: Liu, Zifan, et al.
Published: (2024)

Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey
by: Colas, Cédric, et al.
Published: (2020)

Autotelic Reinforcement Learning: Exploring Intrinsic Motivations for Skill Acquisition in Open-Ended Environments
by: Srivastava, Prakhar, et al.
Published: (2025)

Just Ask: Curious Code Agents Reveal System Prompts in Frontier LLMs
by: Zheng, Xiang, et al.
Published: (2026)

MetaVIM: Meta Variationally Intrinsic Motivated Reinforcement Learning for Decentralized Traffic Signal Control
by: Zhu, Liwen, et al.
Published: (2021)

From Order to Distribution: A Spectral Characterization of Forgetting in Continual Learning
by: Xu, Zonghuan, et al.
Published: (2026)

Image-Based Deep Reinforcement Learning with Intrinsically Motivated Stimuli: On the Execution of Complex Robotic Tasks
by: Valencia, David, et al.
Published: (2024)

Learning Formal Mathematics From Intrinsic Motivation
by: Poesia, Gabriel, et al.
Published: (2024)

Toward Evaluating Robustness of Reinforcement Learning with Adversarial Policy
by: Zheng, Xiang, et al.
Published: (2023)

BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks
by: Zhao, Yunhan, et al.
Published: (2024)

Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
by: Wang, Yibo, et al.
Published: (2024)

Count Counts: Motivating Exploration in LLM Reasoning with Count-based Intrinsic Rewards
by: Zhang, Xuan, et al.
Published: (2025)

Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
by: Yuan, Mingqi, et al.
Published: (2025)

Intrinsically-Motivated Humans and Agents in Open-World Exploration
by: Lidayan, Aly, et al.
Published: (2025)

OptiLeak: Efficient Prompt Reconstruction via Reinforcement Learning in Multi-tenant LLM Services
by: Wang, Longxiang, et al.
Published: (2026)

ICPO: Intrinsic Confidence-Driven Group Relative Preference Optimization for Efficient Reinforcement Learning
by: Wang, Jinpeng, et al.
Published: (2025)

M3-BENCH: Process-Aware Evaluation of LLM Agents' Social Behaviors in Mixed-Motive Games
by: Xie, Sixiong, et al.
Published: (2026)

An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
by: Lin, Qian, et al.
Published: (2024)

Decentralized Traffic Flow Optimization Through Intrinsic Motivation
by: Papala, Himaja, et al.
Published: (2025)

Enhancing End-to-End Multi-Task Dialogue Systems: A Study on Intrinsic Motivation Reinforcement Learning Algorithms for Improved Training and Adaptability
by: Kamuni, Navin, et al.
Published: (2024)

RedTopic: Toward Topic-Diverse Red Teaming of Large Language Models
by: Ding, Jiale, et al.
Published: (2025)

DropVLA: An Action-Level Backdoor Attack on Vision-Language-Action Models
by: Xu, Zonghuan, et al.
Published: (2025)

JT-Safe: Intrinsically Enhancing the Safety and Trustworthiness of LLMs
by: Feng, Junlan, et al.
Published: (2025)

Intrinsic-Motivation Multi-Robot Social Formation Navigation with Coordinated Exploration
by: Fu, Hao, et al.
Published: (2025)

Towards a Formal Theory of the Need for Competence via Computational Intrinsic Motivation
by: Lintunen, Erik M., et al.
Published: (2025)

Internal Safety Collapse in Frontier Large Language Models
by: Wu, Yutao, et al.
Published: (2026)

Efficient Reinforcement Learning for Large Language Models with Intrinsic Exploration
by: Sun, Yan, et al.
Published: (2025)

Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs
by: Zhang, Wenjian, et al.
Published: (2026)

BAMDP Shaping: a Unified Framework for Intrinsic Motivation and Reward Shaping
by: Lidayan, Aly, et al.
Published: (2024)

Decentralized Consensus Inference-based Hierarchical Reinforcement Learning for Multi-Constrained UAV Pursuit-Evasion Game
by: Yuming, Xiang, et al.
Published: (2025)

Adaptive Correlation-Weighted Intrinsic Rewards for Reinforcement Learning
by: Nguyen, Viet Bac, et al.
Published: (2026)