:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Dai, Yanqi, Hu, Huanran, Wang, Lei, Jin, Shengjie, Chen, Xu, Lu, Zhiwu
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2408.04203
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CharacterBox: Evaluating the Role-Playing Capabilities of LLMs in Text-Based Virtual Worlds
by: Wang, Lei, et al.
Published: (2024)

Adaptive Task Balancing for Visual Instruction Tuning via Inter-Task Contribution and Intra-Task Difficulty
by: Dai, Yanqi, et al.
Published: (2024)

Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation
by: Dai, Yanqi, et al.
Published: (2026)

VoxRole: A Comprehensive Benchmark for Evaluating Speech-Based Role-Playing Agents
by: Wu, Weihao, et al.
Published: (2025)

MINDECHO: Role-Playing Language Agents for Key Opinion Leaders
by: Xu, Rui, et al.
Published: (2024)

CoSER: A Comprehensive Literary Dataset and Framework for Training and Evaluating LLM Role-Playing and Persona Simulation
by: Wang, Xintao, et al.
Published: (2025)

DynSess: Dynamic Session-Level Evaluation and Optimization Framework for Role-Playing Agents
by: Zhang, Rongsheng, et al.
Published: (2026)

TravelEval: A Comprehensive Benchmarking Framework for Evaluating LLM-Powered Travel Planning Agents
by: Chen, Weiyi, et al.
Published: (2026)

AdaMARP: An Adaptive Multi-Agent Interaction Framework for General Immersive Role-Playing
by: Xu, Zhenhua, et al.
Published: (2026)

MultiEditor: Controllable Multimodal Object Editing for Driving Scenarios Using 3D Gaussian Splatting Priors
by: Lu, Shouyi, et al.
Published: (2025)

From Persona to Personalization: A Survey on Role-Playing Language Agents
by: Chen, Jiangjie, et al.
Published: (2024)

Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language Agents
by: Xu, Rui, et al.
Published: (2025)

Identity-Driven Hierarchical Role-Playing Agents
by: Sun, Libo, et al.
Published: (2024)

TCMBench: A Comprehensive Benchmark for Evaluating Large Language Models in Traditional Chinese Medicine
by: Yue, Wenjing, et al.
Published: (2024)

CharacterGPT: A Persona Reconstruction Framework for Role-Playing Agents
by: Park, Jeiyoon, et al.
Published: (2024)

Enhancing Persona Following at Decoding Time via Dynamic Importance Estimation for Role-Playing Agents
by: Liu, Yuxin, et al.
Published: (2026)

Character is Destiny: Can Role-Playing Language Agents Make Persona-Driven Decisions?
by: Xu, Rui, et al.
Published: (2024)

RoleCDE:Benchmarking and Mitigating Role-Alignment Trade-offs in Role-Playing Agents
by: Lai, Huayi, et al.
Published: (2026)

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement
by: Zhang, Zhexin, et al.
Published: (2025)

MemBench: Towards More Comprehensive Evaluation on the Memory of LLM-based Agents
by: Tan, Haoran, et al.
Published: (2025)

Skill Drift Is Contract Violation: Proactive Maintenance for LLM Agent Skill Libraries
by: Fan, Linfeng, et al.
Published: (2026)

DPRF: A Generalizable Dynamic Persona Refinement Framework for Optimizing Behavior Alignment Between Personalized LLM Role-Playing Agents and Humans
by: Yao, Bingsheng, et al.
Published: (2025)

LARP: Language-Agent Role Play for Open-World Games
by: Yan, Ming, et al.
Published: (2023)

RPA-Check: A Multi-Stage Automated Framework for Evaluating Dynamic LLM-based Role-Playing Agents
by: Rosati, Riccardo, et al.
Published: (2026)

OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
by: Vijayvargiya, Sanidhya, et al.
Published: (2025)

DEPO: Dual-Efficiency Preference Optimization for LLM Agents
by: Chen, Sirui, et al.
Published: (2025)

AgentLens: Revealing The Lucky Pass Problem in SWE-Agent Evaluation
by: Sahoo, Priyam, et al.
Published: (2026)

VideoRewardBench: Comprehensive Evaluation of Multimodal Reward Models for Video Understanding
by: Zhang, Zhihong, et al.
Published: (2025)

Evaluating LLM-Generated Versus Human-Authored Responses in Role-Play Dialogues
by: Lu, Dongxu, et al.
Published: (2025)

A Multi-Task Role-Playing Agent Capable of Imitating Character Linguistic Styles
by: Chen, Siyuan, et al.
Published: (2024)

Role-Playing Evaluation for Large Language Models
by: Boudouri, Yassine El, et al.
Published: (2025)

Alignment Dynamics in LLM Fine-Tuning
by: Huang, Yuhan, et al.
Published: (2026)

Fine-Grained Behavior Simulation with Role-Playing Large Language Model on Social Media
by: Li, Kun, et al.
Published: (2024)

Memory as Asset: From Agent-centric to Human-centric Memory Management
by: Pan, Yanqi, et al.
Published: (2026)

DERM-3R: A Resource-Efficient Multimodal Agents Framework for Dermatologic Diagnosis and Treatment in Real-World Clinical Settings
by: Chen, Ziwen, et al.
Published: (2026)

An Empirical Study of Agent Developer Practices in AI Agent Frameworks
by: Wang, Yanlin, et al.
Published: (2025)

Open Role-Playing with Delta-Engines
by: Wu, Hongqiu, et al.
Published: (2024)

Emotional RAG: Enhancing Role-Playing Agents through Emotional Retrieval
by: Huang, Le, et al.
Published: (2024)

Bridging Writing Manner Gap in Visual Instruction Tuning by Creating LLM-aligned Instructions
by: Jing, Dong, et al.
Published: (2025)

Learning to play: A Multimodal Agent for 3D Game-Play
by: Yue, Yuguang, et al.
Published: (2025)