Saved in:
| Main Authors: | He, Lewei, Shi, Tianyu, Huang, Pengran, Chen, Bingzhi, Chen, Qianglong, Pan, Jiahui |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.18014 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
NaturalGAIA: A Verifiable Benchmark and Hierarchical Framework for Long-Horizon GUI Tasks
by: Zheng, Zihan, et al.
Published: (2025)
by: Zheng, Zihan, et al.
Published: (2025)
On the Roles of LLMs in Planning: Embedding LLMs into Planning Graphs
by: Zhuo, Hankz Hankui, et al.
Published: (2024)
by: Zhuo, Hankz Hankui, et al.
Published: (2024)
SOLAR-RL: Semi-Online Long-horizon Assignment Reinforcement Learning
by: Wang, Jichao, et al.
Published: (2026)
by: Wang, Jichao, et al.
Published: (2026)
Reward-Decomposed Reinforcement Learning for Immersive Video Role-Playing
by: Wang, Miao, et al.
Published: (2026)
by: Wang, Miao, et al.
Published: (2026)
RoleCraft-GLM: Advancing Personalized Role-Playing in Large Language Models
by: Tao, Meiling, et al.
Published: (2023)
by: Tao, Meiling, et al.
Published: (2023)
eSapiens's DEREK Module: Deep Extraction & Reasoning Engine for Knowledge with LLMs
by: Shi, Isaac, et al.
Published: (2025)
by: Shi, Isaac, et al.
Published: (2025)
Scaling In-Context Online Learning Capability of LLMs via Cross-Episode Meta-RL
by: Lin, Xiaofeng, et al.
Published: (2026)
by: Lin, Xiaofeng, et al.
Published: (2026)
The Role of Deep Learning Regularizations on Actors in Offline RL
by: Tarasov, Denis, et al.
Published: (2024)
by: Tarasov, Denis, et al.
Published: (2024)
FALCON: Feedback-driven Adaptive Long/short-term memory reinforced Coding Optimization system
by: Li, Zeyuan, et al.
Published: (2024)
by: Li, Zeyuan, et al.
Published: (2024)
Context Misleads LLMs: The Role of Context Filtering in Maintaining Safe Alignment of LLMs
by: Kim, Jinhwa, et al.
Published: (2025)
by: Kim, Jinhwa, et al.
Published: (2025)
TriPlay-RL: Tri-Role Self-Play Reinforcement Learning for LLM Safety Alignment
by: Tan, Zhewen, et al.
Published: (2026)
by: Tan, Zhewen, et al.
Published: (2026)
The Role of Diversity in In-Context Learning for Large Language Models
by: Xiao, Wenyang, et al.
Published: (2025)
by: Xiao, Wenyang, et al.
Published: (2025)
Is PRM Necessary? Problem-Solving RL Implicitly Induces PRM Capability in LLMs
by: Feng, Zhangying, et al.
Published: (2025)
by: Feng, Zhangying, et al.
Published: (2025)
Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data
by: Jia, Zeyu, et al.
Published: (2024)
by: Jia, Zeyu, et al.
Published: (2024)
Sub-Scaling Laws: On the Role of Data Density and Training Strategies in LLMs
by: Chen, Zhengyu, et al.
Published: (2025)
by: Chen, Zhengyu, et al.
Published: (2025)
Assigning Distinct Roles to Quantized and Low-Rank Matrices Toward Optimal Weight Decomposition
by: Cho, Yoonjun, et al.
Published: (2025)
by: Cho, Yoonjun, et al.
Published: (2025)
Understanding LLM Performance Degradation in Multi-Instance Processing: The Roles of Instance Count and Context Length
by: Chen, Jingxuan, et al.
Published: (2026)
by: Chen, Jingxuan, et al.
Published: (2026)
Iterative Zoom-In: Temporal Interval Exploration for Long Video Understanding
by: Li, Chenglin, et al.
Published: (2025)
by: Li, Chenglin, et al.
Published: (2025)
Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning
by: Zhou, Guanglin, et al.
Published: (2024)
by: Zhou, Guanglin, et al.
Published: (2024)
HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing
by: Du, Chengyu, et al.
Published: (2026)
by: Du, Chengyu, et al.
Published: (2026)
Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning
by: Yang, Wang, et al.
Published: (2025)
by: Yang, Wang, et al.
Published: (2025)
When Correct Demonstrations Hurt: Rethinking the Role of Exemplars in In-Context Learning
by: Qiu, Chenghao, et al.
Published: (2026)
by: Qiu, Chenghao, et al.
Published: (2026)
The Role of Environment Access in Agnostic Reinforcement Learning
by: Krishnamurthy, Akshay, et al.
Published: (2025)
by: Krishnamurthy, Akshay, et al.
Published: (2025)
DebFlow: Automating Agent Creation via Agent Debate
by: Su, Jinwei, et al.
Published: (2025)
by: Su, Jinwei, et al.
Published: (2025)
Temporal Dependencies in In-Context Learning: The Role of Induction Heads
by: Bajaj, Anooshka, et al.
Published: (2026)
by: Bajaj, Anooshka, et al.
Published: (2026)
RoCo: Role-Based LLMs Collaboration for Automatic Heuristic Design
by: Xu, Jiawei, et al.
Published: (2025)
by: Xu, Jiawei, et al.
Published: (2025)
ProCeedRL: Process Critic with Exploratory Demonstration Reinforcement Learning for LLM Agentic Reasoning
by: Gao, Jingyue, et al.
Published: (2026)
by: Gao, Jingyue, et al.
Published: (2026)
Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning
by: Hu, Xiao, et al.
Published: (2025)
by: Hu, Xiao, et al.
Published: (2025)
eSapiens: A Platform for Secure and Auditable Retrieval-Augmented Generation
by: Shi, Isaac, et al.
Published: (2025)
by: Shi, Isaac, et al.
Published: (2025)
Evolving Roles of LLMs in Scientific Innovation: Assistant, Collaborator, Scientist, and Evaluator
by: Zhang, Haoxuan, et al.
Published: (2025)
by: Zhang, Haoxuan, et al.
Published: (2025)
On the Role of Transformer Feed-Forward Layers in Nonlinear In-Context Learning
by: Sun, Haoyuan, et al.
Published: (2025)
by: Sun, Haoyuan, et al.
Published: (2025)
Causal Understanding by LLMs: The Role of Uncertainty
by: Lithgow-Serrano, Oscar, et al.
Published: (2025)
by: Lithgow-Serrano, Oscar, et al.
Published: (2025)
THOR: Transformer Heuristics for On-Demand Retrieval
by: Shi, Isaac, et al.
Published: (2025)
by: Shi, Isaac, et al.
Published: (2025)
Role-Based Fault Tolerance System for LLM RL Post-Training
by: Chen, Zhenqian, et al.
Published: (2025)
by: Chen, Zhenqian, et al.
Published: (2025)
The Role of Open-Source LLMs in Shaping the Future of GeoAI
by: Huang, Xiao, et al.
Published: (2025)
by: Huang, Xiao, et al.
Published: (2025)
Dissecting Role Cognition in Medical LLMs via Neuronal Ablation
by: Liang, Xun, et al.
Published: (2025)
by: Liang, Xun, et al.
Published: (2025)
An Overlooked Role of Context-Sensitive Dendrites
by: Raza, Mohsin, et al.
Published: (2024)
by: Raza, Mohsin, et al.
Published: (2024)
Too Good to be Bad: On the Failure of LLMs to Role-Play Villains
by: Yi, Zihao, et al.
Published: (2025)
by: Yi, Zihao, et al.
Published: (2025)
Rethinking the Chain-of-Thought: The Roles of In-Context Learning and Pre-trained Priors
by: Yang, Hao, et al.
Published: (2025)
by: Yang, Hao, et al.
Published: (2025)
CharacterBox: Evaluating the Role-Playing Capabilities of LLMs in Text-Based Virtual Worlds
by: Wang, Lei, et al.
Published: (2024)
by: Wang, Lei, et al.
Published: (2024)
Similar Items
-
NaturalGAIA: A Verifiable Benchmark and Hierarchical Framework for Long-Horizon GUI Tasks
by: Zheng, Zihan, et al.
Published: (2025) -
On the Roles of LLMs in Planning: Embedding LLMs into Planning Graphs
by: Zhuo, Hankz Hankui, et al.
Published: (2024) -
SOLAR-RL: Semi-Online Long-horizon Assignment Reinforcement Learning
by: Wang, Jichao, et al.
Published: (2026) -
Reward-Decomposed Reinforcement Learning for Immersive Video Role-Playing
by: Wang, Miao, et al.
Published: (2026) -
RoleCraft-GLM: Advancing Personalized Role-Playing in Large Language Models
by: Tao, Meiling, et al.
Published: (2023)