Saved in:
| Main Authors: | Du, Weihong, Liu, Jia, Wen, Zujie, Jin, Dingnan, Liang, Hongru, Lei, Wenqiang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.03633 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PAGED: A Benchmark for Procedural Graphs Extraction from Documents
by: Du, Weihong, et al.
Published: (2024)
by: Du, Weihong, et al.
Published: (2024)
Strength Lies in Differences! Improving Strategy Planning for Non-collaborative Dialogues via Diversified User Simulation
by: Zhang, Tong, et al.
Published: (2024)
by: Zhang, Tong, et al.
Published: (2024)
BAR: A Backward Reasoning based Agent for Complex Minecraft Tasks
by: Du, Weihong, et al.
Published: (2025)
by: Du, Weihong, et al.
Published: (2025)
CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models
by: Zhang, Tong, et al.
Published: (2024)
by: Zhang, Tong, et al.
Published: (2024)
STYLE: Improving Domain Transferability of Asking Clarification Questions in Large Language Model Powered Conversational Agents
by: Chen, Yue, et al.
Published: (2024)
by: Chen, Yue, et al.
Published: (2024)
SCOP: Evaluating the Comprehension Process of Large Language Models from a Cognitive View
by: Xiao, Yongjie, et al.
Published: (2025)
by: Xiao, Yongjie, et al.
Published: (2025)
DiffScore: Text Evaluation Beyond Autoregressive Likelihood
by: Lai, Wen, et al.
Published: (2026)
by: Lai, Wen, et al.
Published: (2026)
ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented Generation
by: Chen, Hao, et al.
Published: (2025)
by: Chen, Hao, et al.
Published: (2025)
Can Large Language Models Understand Internet Buzzwords Through User-Generated Content
by: Huang, Chen, et al.
Published: (2025)
by: Huang, Chen, et al.
Published: (2025)
iAgent: LLM Agent as a Shield between User and Recommender Systems
by: Xu, Wujiang, et al.
Published: (2025)
by: Xu, Wujiang, et al.
Published: (2025)
User-Assistant Bias in LLMs
by: Pan, Xu, et al.
Published: (2025)
by: Pan, Xu, et al.
Published: (2025)
MultiLingPoT: Enhancing Mathematical Reasoning with Multilingual Program Fine-tuning
by: Li, Nianqi, et al.
Published: (2024)
by: Li, Nianqi, et al.
Published: (2024)
AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback
by: Guan, Jian, et al.
Published: (2024)
by: Guan, Jian, et al.
Published: (2024)
A-MEM: Agentic Memory for LLM Agents
by: Xu, Wujiang, et al.
Published: (2025)
by: Xu, Wujiang, et al.
Published: (2025)
ProPerSim: Developing Proactive and Personalized AI Assistants through User-Assistant Simulation
by: Kim, Jiho, et al.
Published: (2025)
by: Kim, Jiho, et al.
Published: (2025)
Clue-Instruct: Text-Based Clue Generation for Educational Crossword Puzzles
by: Zugarini, Andrea, et al.
Published: (2024)
by: Zugarini, Andrea, et al.
Published: (2024)
User Modeling Challenges in Interactive AI Assistant Systems
by: Su, Megan, et al.
Published: (2024)
by: Su, Megan, et al.
Published: (2024)
I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search
by: Liang, Zujie, et al.
Published: (2025)
by: Liang, Zujie, et al.
Published: (2025)
PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant
by: Yin, Congrui, et al.
Published: (2025)
by: Yin, Congrui, et al.
Published: (2025)
FSMR: A Feature Swapping Multi-modal Reasoning Approach with Joint Textual and Visual Clues
by: Li, Shuang, et al.
Published: (2024)
by: Li, Shuang, et al.
Published: (2024)
Counting Clues: A Lightweight Probabilistic Baseline Can Match an LLM
by: Jia, Furong, et al.
Published: (2025)
by: Jia, Furong, et al.
Published: (2025)
Quantifying the Utility of User Simulators for Building Collaborative LLM Assistants
by: Suh, Joseph, et al.
Published: (2026)
by: Suh, Joseph, et al.
Published: (2026)
InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning
by: Yan, Yuchen, et al.
Published: (2026)
by: Yan, Yuchen, et al.
Published: (2026)
QUILL: Quotation Generation Enhancement of Large Language Models
by: Xiao, Jin, et al.
Published: (2024)
by: Xiao, Jin, et al.
Published: (2024)
Less is More: Compact Clue Selection for Efficient Retrieval-Augmented Generation Reasoning
by: Zhang, Qianchi, et al.
Published: (2025)
by: Zhang, Qianchi, et al.
Published: (2025)
Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals
by: Wu, Yue, et al.
Published: (2023)
by: Wu, Yue, et al.
Published: (2023)
Concept -- An Evaluation Protocol on Conversational Recommender Systems with System-centric and User-centric Factors
by: Huang, Chen, et al.
Published: (2024)
by: Huang, Chen, et al.
Published: (2024)
AMaPO: Adaptive Margin-attached Preference Optimization for Language Model Alignment
by: Deng, Ruibo, et al.
Published: (2025)
by: Deng, Ruibo, et al.
Published: (2025)
Mem-PAL: Towards Memory-based Personalized Dialogue Assistants for Long-term User-Agent Interaction
by: Huang, Zhaopei, et al.
Published: (2025)
by: Huang, Zhaopei, et al.
Published: (2025)
ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation
by: Huang, Chen, et al.
Published: (2024)
by: Huang, Chen, et al.
Published: (2024)
Towards High Data Efficiency in Reinforcement Learning with Verifiable Reward
by: Tang, Xinyu, et al.
Published: (2025)
by: Tang, Xinyu, et al.
Published: (2025)
Proactive Conversational Assistant for a Procedural Manual Task based on Audio and IMU
by: Mahfuz, Rehana, et al.
Published: (2026)
by: Mahfuz, Rehana, et al.
Published: (2026)
Fusion-Eval: Integrating Assistant Evaluators with LLMs
by: Shu, Lei, et al.
Published: (2023)
by: Shu, Lei, et al.
Published: (2023)
Listening to Patients: A Framework of Detecting and Mitigating Patient Misreport for Medical Dialogue Generation
by: Qin, Lang, et al.
Published: (2024)
by: Qin, Lang, et al.
Published: (2024)
Document-level Causal Relation Extraction with Knowledge-guided Binary Question Answering
by: Wang, Zimu, et al.
Published: (2024)
by: Wang, Zimu, et al.
Published: (2024)
LifeSim: Long-Horizon User Life Simulator for Personalized Assistant Evaluation
by: Duan, Feiyu, et al.
Published: (2026)
by: Duan, Feiyu, et al.
Published: (2026)
CARE: Privacy-Compliant Agentic Reasoning with Evidence Discordance
by: Liu, Haochen, et al.
Published: (2026)
by: Liu, Haochen, et al.
Published: (2026)
Multi-Granular Multimodal Clue Fusion for Meme Understanding
by: Zheng, Li, et al.
Published: (2025)
by: Zheng, Li, et al.
Published: (2025)
MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models
by: Xue, Boyang, et al.
Published: (2024)
by: Xue, Boyang, et al.
Published: (2024)
CRAFTQA: A Code-Driven Adaptive Framework for Complex Structured Data Reasoning
by: Gan, Chengtao, et al.
Published: (2026)
by: Gan, Chengtao, et al.
Published: (2026)
Similar Items
-
PAGED: A Benchmark for Procedural Graphs Extraction from Documents
by: Du, Weihong, et al.
Published: (2024) -
Strength Lies in Differences! Improving Strategy Planning for Non-collaborative Dialogues via Diversified User Simulation
by: Zhang, Tong, et al.
Published: (2024) -
BAR: A Backward Reasoning based Agent for Complex Minecraft Tasks
by: Du, Weihong, et al.
Published: (2025) -
CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models
by: Zhang, Tong, et al.
Published: (2024) -
STYLE: Improving Domain Transferability of Asking Clarification Questions in Large Language Model Powered Conversational Agents
by: Chen, Yue, et al.
Published: (2024)