Saved in:
| Main Authors: | Wang, Hui, Zhang, Fafa, Zhang, Xiaoyu, Mu, Chaoxu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.21706 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
One for All: A General Framework of LLMs-based Multi-Criteria Decision Making on Human Expert Level
by: Wang, Hui, et al.
Published: (2025)
by: Wang, Hui, et al.
Published: (2025)
Planning of Heuristics: Strategic Planning on Large Language Models with Monte Carlo Tree Search for Automating Heuristic Optimization
by: Wang, Hui, et al.
Published: (2025)
by: Wang, Hui, et al.
Published: (2025)
CogMCTS: A Novel Cognitive-Guided Monte Carlo Tree Search Framework for Iterative Heuristic Evolution with Large Language Models
by: Wang, Hui, et al.
Published: (2025)
by: Wang, Hui, et al.
Published: (2025)
Generalized Nested Rollout Policy Adaptation with Limited Repetitions
by: Cazenave, Tristan
Published: (2024)
by: Cazenave, Tristan
Published: (2024)
Nested-ReFT: Efficient Reinforcement Learning for Large Language Model Fine-Tuning via Off-Policy Rollouts
by: Heuillet, Maxime, et al.
Published: (2025)
by: Heuillet, Maxime, et al.
Published: (2025)
DenseLoRA: Dense Low-Rank Adaptation of Large Language Models
by: Mu, Lin, et al.
Published: (2025)
by: Mu, Lin, et al.
Published: (2025)
Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts
by: Pang, Jing-Cheng, et al.
Published: (2024)
by: Pang, Jing-Cheng, et al.
Published: (2024)
Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents
by: Deng, Yang, et al.
Published: (2023)
by: Deng, Yang, et al.
Published: (2023)
TLoRA: Task-aware Low Rank Adaptation of Large Language Models
by: Lin, Weicheng, et al.
Published: (2026)
by: Lin, Weicheng, et al.
Published: (2026)
Extracting Training Dialogue Data from Large Language Model based Task Bots
by: Zhang, Shuo, et al.
Published: (2026)
by: Zhang, Shuo, et al.
Published: (2026)
TSS GAZ PTP: Towards Improving Gumbel AlphaZero with Two-stage Self-play for Multi-constrained Electric Vehicle Routing Problems
by: Wang, Hui, et al.
Published: (2025)
by: Wang, Hui, et al.
Published: (2025)
DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented Dialogues
by: Luo, Xiang, et al.
Published: (2024)
by: Luo, Xiang, et al.
Published: (2024)
Task as Context Prompting for Accurate Medical Symptom Coding Using Large Language Models
by: He, Chengyang, et al.
Published: (2025)
by: He, Chengyang, et al.
Published: (2025)
Anti-Overestimation Dialogue Policy Learning for Task-Completion Dialogue System
by: Tian, Chang, et al.
Published: (2022)
by: Tian, Chang, et al.
Published: (2022)
Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems
by: Kazi, Taaha, et al.
Published: (2024)
by: Kazi, Taaha, et al.
Published: (2024)
General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks
by: Liu, Junlin, et al.
Published: (2026)
by: Liu, Junlin, et al.
Published: (2026)
Driving Everywhere with Large Language Model Policy Adaptation
by: Li, Boyi, et al.
Published: (2024)
by: Li, Boyi, et al.
Published: (2024)
DFlow: Diverse Dialogue Flow Simulation with Large Language Models
by: Du, Wanyu, et al.
Published: (2024)
by: Du, Wanyu, et al.
Published: (2024)
Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues
by: Medjad, Maya, et al.
Published: (2025)
by: Medjad, Maya, et al.
Published: (2025)
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models
by: Hu, Mengkang, et al.
Published: (2023)
by: Hu, Mengkang, et al.
Published: (2023)
A Prompt-driven Task Planning Method for Multi-drones based on Large Language Model
by: Liu, Yaohua
Published: (2024)
by: Liu, Yaohua
Published: (2024)
SPEC-RL: Accelerating On-Policy Reinforcement Learning with Speculative Rollouts
by: Liu, Bingshuai, et al.
Published: (2025)
by: Liu, Bingshuai, et al.
Published: (2025)
Knowledge Graph Fusion with Large Language Models for Accurate, Explainable Manufacturing Process Planning
by: Hoang, Danny, et al.
Published: (2025)
by: Hoang, Danny, et al.
Published: (2025)
Reasoning Pattern Matters: Learning to Reason without Human Rationales
by: Pang, Chaoxu, et al.
Published: (2025)
by: Pang, Chaoxu, et al.
Published: (2025)
Multi-Party Supervised Fine-tuning of Language Models for Multi-Party Dialogue Generation
by: Wang, Xiaoyu, et al.
Published: (2024)
by: Wang, Xiaoyu, et al.
Published: (2024)
Evaluating Large Language Models in Analysing Classroom Dialogue
by: Long, Yun, et al.
Published: (2024)
by: Long, Yun, et al.
Published: (2024)
DND: Boosting Large Language Models with Dynamic Nested Depth
by: Chen, Tieyuan, et al.
Published: (2025)
by: Chen, Tieyuan, et al.
Published: (2025)
Efficient Task Adaptation in Large Language Models via Selective Parameter Optimization
by: Wan, Weijie, et al.
Published: (2026)
by: Wan, Weijie, et al.
Published: (2026)
Prompting Fairness: Integrating Causality to Debias Large Language Models
by: Li, Jingling, et al.
Published: (2024)
by: Li, Jingling, et al.
Published: (2024)
SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models
by: Wu, Yi, et al.
Published: (2024)
by: Wu, Yi, et al.
Published: (2024)
Large Language Models as Generalizable Policies for Embodied Tasks
by: Szot, Andrew, et al.
Published: (2023)
by: Szot, Andrew, et al.
Published: (2023)
Task-Aligned Tool Recommendation for Large Language Models
by: Gao, Hang, et al.
Published: (2024)
by: Gao, Hang, et al.
Published: (2024)
Large Language Models as Planning Domain Generators
by: Oswald, James, et al.
Published: (2024)
by: Oswald, James, et al.
Published: (2024)
Aligning Large Language Models with Healthcare Stakeholders: A Pathway to Trustworthy AI Integration
by: Ding, Kexin, et al.
Published: (2025)
by: Ding, Kexin, et al.
Published: (2025)
Sparsity Induction for Accurate Post-Training Pruning of Large Language Models
by: Jiang, Minhao, et al.
Published: (2026)
by: Jiang, Minhao, et al.
Published: (2026)
Contextual Attention Modulation: Towards Efficient Multi-Task Adaptation in Large Language Models
by: Pan, Dayan, et al.
Published: (2025)
by: Pan, Dayan, et al.
Published: (2025)
TaskBench: Benchmarking Large Language Models for Task Automation
by: Shen, Yongliang, et al.
Published: (2023)
by: Shen, Yongliang, et al.
Published: (2023)
Adaptive Rollout Allocation for Online Reinforcement Learning with Verifiable Rewards
by: Nguyen, Hieu Trung, et al.
Published: (2026)
by: Nguyen, Hieu Trung, et al.
Published: (2026)
A Survey of the Evolution of Language Model-Based Dialogue Systems: Data, Task and Models
by: Wang, Hongru, et al.
Published: (2023)
by: Wang, Hongru, et al.
Published: (2023)
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
by: Zhao, Lirui, et al.
Published: (2024)
by: Zhao, Lirui, et al.
Published: (2024)
Similar Items
-
One for All: A General Framework of LLMs-based Multi-Criteria Decision Making on Human Expert Level
by: Wang, Hui, et al.
Published: (2025) -
Planning of Heuristics: Strategic Planning on Large Language Models with Monte Carlo Tree Search for Automating Heuristic Optimization
by: Wang, Hui, et al.
Published: (2025) -
CogMCTS: A Novel Cognitive-Guided Monte Carlo Tree Search Framework for Iterative Heuristic Evolution with Large Language Models
by: Wang, Hui, et al.
Published: (2025) -
Generalized Nested Rollout Policy Adaptation with Limited Repetitions
by: Cazenave, Tristan
Published: (2024) -
Nested-ReFT: Efficient Reinforcement Learning for Large Language Model Fine-Tuning via Off-Policy Rollouts
by: Heuillet, Maxime, et al.
Published: (2025)