Saved in:
| Main Authors: | Liu, Jiale, Zeng, Yifan, Zhang, Shaokun, Zhang, Chi, Højmark-Bertelsen, Malte, Gadeberg, Marie Normann, Wang, Huazheng, Wu, Qingyun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.03973 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Memory-Augmented Agent Training for Business Document Understanding
by: Liu, Jiale, et al.
Published: (2024)
by: Liu, Jiale, et al.
Published: (2024)
Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems
by: Zhang, Shaokun, et al.
Published: (2025)
by: Zhang, Shaokun, et al.
Published: (2025)
AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
by: Zeng, Yifan, et al.
Published: (2024)
by: Zeng, Yifan, et al.
Published: (2024)
Offline Training of Language Model Agents with Functions as Learnable Weights
by: Zhang, Shaokun, et al.
Published: (2024)
by: Zhang, Shaokun, et al.
Published: (2024)
Adaptive In-conversation Team Building for Language Model Agents
by: Song, Linxin, et al.
Published: (2024)
by: Song, Linxin, et al.
Published: (2024)
StateFlow: Enhancing LLM Task-Solving through State-Driven Workflows
by: Wu, Yiran, et al.
Published: (2024)
by: Wu, Yiran, et al.
Published: (2024)
LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking
by: Zeng, Yifan, et al.
Published: (2024)
by: Zeng, Yifan, et al.
Published: (2024)
SWAA: Sliding Window Attention Adaptation for Efficient and Quality Preserving Long Context Processing
by: Yu, Yijiong, et al.
Published: (2025)
by: Yu, Yijiong, et al.
Published: (2025)
Embodied LLM Agents Learn to Cooperate in Organized Teams
by: Guo, Xudong, et al.
Published: (2024)
by: Guo, Xudong, et al.
Published: (2024)
IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models
by: Zhang, Shaokun, et al.
Published: (2023)
by: Zhang, Shaokun, et al.
Published: (2023)
When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs
by: Zeng, Yifan, et al.
Published: (2026)
by: Zeng, Yifan, et al.
Published: (2026)
MathChat: Converse to Tackle Challenging Math Problems with LLM Agents
by: Wu, Yiran, et al.
Published: (2023)
by: Wu, Yiran, et al.
Published: (2023)
Do Images Speak Louder than Words? Investigating the Effect of Textual Misinformation in VLMs
by: Zhang, Chi, et al.
Published: (2026)
by: Zhang, Chi, et al.
Published: (2026)
Hierarchical Divide-and-Conquer for Fine-Grained Alignment in LLM-Based Medical Evaluation
by: Zheng, Shunfan, et al.
Published: (2025)
by: Zheng, Shunfan, et al.
Published: (2025)
EcoAct: Economic Agent Determines When to Register What Action
by: Zhang, Shaokun, et al.
Published: (2024)
by: Zhang, Shaokun, et al.
Published: (2024)
Forecasting Frontier Language Model Agent Capabilities
by: Pimpale, Govind, et al.
Published: (2025)
by: Pimpale, Govind, et al.
Published: (2025)
Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs
by: Shen, Yifan, et al.
Published: (2025)
by: Shen, Yifan, et al.
Published: (2025)
AutoMLGen: Navigating Fine-Grained Optimization for Coding Agents
by: Du, Shangheng, et al.
Published: (2025)
by: Du, Shangheng, et al.
Published: (2025)
Checkpoint Merging via Bayesian Optimization in LLM Pretraining
by: Liu, Deyuan, et al.
Published: (2024)
by: Liu, Deyuan, et al.
Published: (2024)
A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement
by: Yuan, Hui, et al.
Published: (2024)
by: Yuan, Hui, et al.
Published: (2024)
SimpleDoc: Multi-Modal Document Understanding with Dual-Cue Page Retrieval and Iterative Refinement
by: Jain, Chelsi, et al.
Published: (2025)
by: Jain, Chelsi, et al.
Published: (2025)
BEST-Route: Adaptive LLM Routing with Test-Time Optimal Compute
by: Ding, Dujian, et al.
Published: (2025)
by: Ding, Dujian, et al.
Published: (2025)
Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging
by: Ju, Yiming, et al.
Published: (2024)
by: Ju, Yiming, et al.
Published: (2024)
Verified Critical Step Optimization for LLM Agents
by: Li, Mukai, et al.
Published: (2026)
by: Li, Mukai, et al.
Published: (2026)
Refined Coreset Selection: Towards Minimal Coreset Size under Model Performance Constraints
by: Xia, Xiaobo, et al.
Published: (2023)
by: Xia, Xiaobo, et al.
Published: (2023)
Nemotron-Research-Tool-N1: Exploring Tool-Using Language Models with Reinforced Reasoning
by: Zhang, Shaokun, et al.
Published: (2025)
by: Zhang, Shaokun, et al.
Published: (2025)
KVSlimmer: Theoretical Insights and Practical Optimizations for Asymmetric KV Merging
by: Liu, Lianjun, et al.
Published: (2026)
by: Liu, Lianjun, et al.
Published: (2026)
EVOCHAMBER: Test-Time Co-evolution of Multi-Agent System at Individual, Team, and Population Scales
by: Zhang, Yaolun, et al.
Published: (2026)
by: Zhang, Yaolun, et al.
Published: (2026)
Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through Text Reconstruction
by: Wang, Qingyun, et al.
Published: (2024)
by: Wang, Qingyun, et al.
Published: (2024)
Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization
by: Li, Yang, et al.
Published: (2025)
by: Li, Yang, et al.
Published: (2025)
Dynamic Fisher-weighted Model Merging via Bayesian Optimization
by: Lee, Sanwoo, et al.
Published: (2025)
by: Lee, Sanwoo, et al.
Published: (2025)
Towards better Human-Agent Alignment: Assessing Task Utility in LLM-Powered Applications
by: Arabzadeh, Negar, et al.
Published: (2024)
by: Arabzadeh, Negar, et al.
Published: (2024)
MPO: Boosting LLM Agents with Meta Plan Optimization
by: Xiong, Weimin, et al.
Published: (2025)
by: Xiong, Weimin, et al.
Published: (2025)
Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration
by: Xiao, Sibo, et al.
Published: (2025)
by: Xiao, Sibo, et al.
Published: (2025)
Charon: A Unified and Fine-Grained Simulator for Large-Scale LLM Training and Inference
by: Yang, Mengtian, et al.
Published: (2026)
by: Yang, Mengtian, et al.
Published: (2026)
Highly Optimized Kernels and Fine-Grained Codebooks for LLM Inference on Arm CPUs
by: Gope, Dibakar, et al.
Published: (2024)
by: Gope, Dibakar, et al.
Published: (2024)
Short Chains, Deep Thoughts: Balancing Reasoning Efficiency and Intra-Segment Capability via Split-Merge Optimization
by: Gui, Runquan, et al.
Published: (2026)
by: Gui, Runquan, et al.
Published: (2026)
IF-CRITIC: Towards a Fine-Grained LLM Critic for Instruction-Following Evaluation
by: Wen, Bosi, et al.
Published: (2025)
by: Wen, Bosi, et al.
Published: (2025)
Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents
by: Song, Yifan, et al.
Published: (2024)
by: Song, Yifan, et al.
Published: (2024)
Scaling LLM Inference with Optimized Sample Compute Allocation
by: Zhang, Kexun, et al.
Published: (2024)
by: Zhang, Kexun, et al.
Published: (2024)
Similar Items
-
Memory-Augmented Agent Training for Business Document Understanding
by: Liu, Jiale, et al.
Published: (2024) -
Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems
by: Zhang, Shaokun, et al.
Published: (2025) -
AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
by: Zeng, Yifan, et al.
Published: (2024) -
Offline Training of Language Model Agents with Functions as Learnable Weights
by: Zhang, Shaokun, et al.
Published: (2024) -
Adaptive In-conversation Team Building for Language Model Agents
by: Song, Linxin, et al.
Published: (2024)