Saved in:
| Main Authors: | Dai, Yanqi, Hu, Huanran, Wang, Lei, Jin, Shengjie, Chen, Xu, Lu, Zhiwu |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.04203 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CharacterBox: Evaluating the Role-Playing Capabilities of LLMs in Text-Based Virtual Worlds
by: Wang, Lei, et al.
Published: (2024)
by: Wang, Lei, et al.
Published: (2024)
Adaptive Task Balancing for Visual Instruction Tuning via Inter-Task Contribution and Intra-Task Difficulty
by: Dai, Yanqi, et al.
Published: (2024)
by: Dai, Yanqi, et al.
Published: (2024)
Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation
by: Dai, Yanqi, et al.
Published: (2026)
by: Dai, Yanqi, et al.
Published: (2026)
VoxRole: A Comprehensive Benchmark for Evaluating Speech-Based Role-Playing Agents
by: Wu, Weihao, et al.
Published: (2025)
by: Wu, Weihao, et al.
Published: (2025)
MINDECHO: Role-Playing Language Agents for Key Opinion Leaders
by: Xu, Rui, et al.
Published: (2024)
by: Xu, Rui, et al.
Published: (2024)
CoSER: A Comprehensive Literary Dataset and Framework for Training and Evaluating LLM Role-Playing and Persona Simulation
by: Wang, Xintao, et al.
Published: (2025)
by: Wang, Xintao, et al.
Published: (2025)
DynSess: Dynamic Session-Level Evaluation and Optimization Framework for Role-Playing Agents
by: Zhang, Rongsheng, et al.
Published: (2026)
by: Zhang, Rongsheng, et al.
Published: (2026)
TravelEval: A Comprehensive Benchmarking Framework for Evaluating LLM-Powered Travel Planning Agents
by: Chen, Weiyi, et al.
Published: (2026)
by: Chen, Weiyi, et al.
Published: (2026)
AdaMARP: An Adaptive Multi-Agent Interaction Framework for General Immersive Role-Playing
by: Xu, Zhenhua, et al.
Published: (2026)
by: Xu, Zhenhua, et al.
Published: (2026)
MultiEditor: Controllable Multimodal Object Editing for Driving Scenarios Using 3D Gaussian Splatting Priors
by: Lu, Shouyi, et al.
Published: (2025)
by: Lu, Shouyi, et al.
Published: (2025)
From Persona to Personalization: A Survey on Role-Playing Language Agents
by: Chen, Jiangjie, et al.
Published: (2024)
by: Chen, Jiangjie, et al.
Published: (2024)
Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language Agents
by: Xu, Rui, et al.
Published: (2025)
by: Xu, Rui, et al.
Published: (2025)
Identity-Driven Hierarchical Role-Playing Agents
by: Sun, Libo, et al.
Published: (2024)
by: Sun, Libo, et al.
Published: (2024)
TCMBench: A Comprehensive Benchmark for Evaluating Large Language Models in Traditional Chinese Medicine
by: Yue, Wenjing, et al.
Published: (2024)
by: Yue, Wenjing, et al.
Published: (2024)
CharacterGPT: A Persona Reconstruction Framework for Role-Playing Agents
by: Park, Jeiyoon, et al.
Published: (2024)
by: Park, Jeiyoon, et al.
Published: (2024)
Enhancing Persona Following at Decoding Time via Dynamic Importance Estimation for Role-Playing Agents
by: Liu, Yuxin, et al.
Published: (2026)
by: Liu, Yuxin, et al.
Published: (2026)
Character is Destiny: Can Role-Playing Language Agents Make Persona-Driven Decisions?
by: Xu, Rui, et al.
Published: (2024)
by: Xu, Rui, et al.
Published: (2024)
RoleCDE:Benchmarking and Mitigating Role-Alignment Trade-offs in Role-Playing Agents
by: Lai, Huayi, et al.
Published: (2026)
by: Lai, Huayi, et al.
Published: (2026)
AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement
by: Zhang, Zhexin, et al.
Published: (2025)
by: Zhang, Zhexin, et al.
Published: (2025)
MemBench: Towards More Comprehensive Evaluation on the Memory of LLM-based Agents
by: Tan, Haoran, et al.
Published: (2025)
by: Tan, Haoran, et al.
Published: (2025)
Skill Drift Is Contract Violation: Proactive Maintenance for LLM Agent Skill Libraries
by: Fan, Linfeng, et al.
Published: (2026)
by: Fan, Linfeng, et al.
Published: (2026)
DPRF: A Generalizable Dynamic Persona Refinement Framework for Optimizing Behavior Alignment Between Personalized LLM Role-Playing Agents and Humans
by: Yao, Bingsheng, et al.
Published: (2025)
by: Yao, Bingsheng, et al.
Published: (2025)
LARP: Language-Agent Role Play for Open-World Games
by: Yan, Ming, et al.
Published: (2023)
by: Yan, Ming, et al.
Published: (2023)
RPA-Check: A Multi-Stage Automated Framework for Evaluating Dynamic LLM-based Role-Playing Agents
by: Rosati, Riccardo, et al.
Published: (2026)
by: Rosati, Riccardo, et al.
Published: (2026)
OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
by: Vijayvargiya, Sanidhya, et al.
Published: (2025)
by: Vijayvargiya, Sanidhya, et al.
Published: (2025)
DEPO: Dual-Efficiency Preference Optimization for LLM Agents
by: Chen, Sirui, et al.
Published: (2025)
by: Chen, Sirui, et al.
Published: (2025)
AgentLens: Revealing The Lucky Pass Problem in SWE-Agent Evaluation
by: Sahoo, Priyam, et al.
Published: (2026)
by: Sahoo, Priyam, et al.
Published: (2026)
VideoRewardBench: Comprehensive Evaluation of Multimodal Reward Models for Video Understanding
by: Zhang, Zhihong, et al.
Published: (2025)
by: Zhang, Zhihong, et al.
Published: (2025)
Evaluating LLM-Generated Versus Human-Authored Responses in Role-Play Dialogues
by: Lu, Dongxu, et al.
Published: (2025)
by: Lu, Dongxu, et al.
Published: (2025)
A Multi-Task Role-Playing Agent Capable of Imitating Character Linguistic Styles
by: Chen, Siyuan, et al.
Published: (2024)
by: Chen, Siyuan, et al.
Published: (2024)
Role-Playing Evaluation for Large Language Models
by: Boudouri, Yassine El, et al.
Published: (2025)
by: Boudouri, Yassine El, et al.
Published: (2025)
Alignment Dynamics in LLM Fine-Tuning
by: Huang, Yuhan, et al.
Published: (2026)
by: Huang, Yuhan, et al.
Published: (2026)
Fine-Grained Behavior Simulation with Role-Playing Large Language Model on Social Media
by: Li, Kun, et al.
Published: (2024)
by: Li, Kun, et al.
Published: (2024)
Memory as Asset: From Agent-centric to Human-centric Memory Management
by: Pan, Yanqi, et al.
Published: (2026)
by: Pan, Yanqi, et al.
Published: (2026)
DERM-3R: A Resource-Efficient Multimodal Agents Framework for Dermatologic Diagnosis and Treatment in Real-World Clinical Settings
by: Chen, Ziwen, et al.
Published: (2026)
by: Chen, Ziwen, et al.
Published: (2026)
An Empirical Study of Agent Developer Practices in AI Agent Frameworks
by: Wang, Yanlin, et al.
Published: (2025)
by: Wang, Yanlin, et al.
Published: (2025)
Open Role-Playing with Delta-Engines
by: Wu, Hongqiu, et al.
Published: (2024)
by: Wu, Hongqiu, et al.
Published: (2024)
Emotional RAG: Enhancing Role-Playing Agents through Emotional Retrieval
by: Huang, Le, et al.
Published: (2024)
by: Huang, Le, et al.
Published: (2024)
Bridging Writing Manner Gap in Visual Instruction Tuning by Creating LLM-aligned Instructions
by: Jing, Dong, et al.
Published: (2025)
by: Jing, Dong, et al.
Published: (2025)
Learning to play: A Multimodal Agent for 3D Game-Play
by: Yue, Yuguang, et al.
Published: (2025)
by: Yue, Yuguang, et al.
Published: (2025)
Similar Items
-
CharacterBox: Evaluating the Role-Playing Capabilities of LLMs in Text-Based Virtual Worlds
by: Wang, Lei, et al.
Published: (2024) -
Adaptive Task Balancing for Visual Instruction Tuning via Inter-Task Contribution and Intra-Task Difficulty
by: Dai, Yanqi, et al.
Published: (2024) -
Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation
by: Dai, Yanqi, et al.
Published: (2026) -
VoxRole: A Comprehensive Benchmark for Evaluating Speech-Based Role-Playing Agents
by: Wu, Weihao, et al.
Published: (2025) -
MINDECHO: Role-Playing Language Agents for Key Opinion Leaders
by: Xu, Rui, et al.
Published: (2024)