Saved in:
| Main Authors: | Mao, Manqing, Ting, Paishun, Xiang, Yijian, Xu, Mingyang, Chen, Julia, Lin, Jianzhe |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.04883 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
by: Shen, Zhuokang, et al.
Published: (2026)
by: Shen, Zhuokang, et al.
Published: (2026)
User-Assistant Bias in LLMs
by: Pan, Xu, et al.
Published: (2025)
by: Pan, Xu, et al.
Published: (2025)
Learning How To Ask: Cycle-Consistency Refines Prompts in Multimodal Foundation Models
by: Diesendruck, Maurice, et al.
Published: (2024)
by: Diesendruck, Maurice, et al.
Published: (2024)
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning
by: Chen, Mingyang, et al.
Published: (2024)
by: Chen, Mingyang, et al.
Published: (2024)
Multi-trait User Simulation with Adaptive Decoding for Conversational Task Assistants
by: Ferreira, Rafael, et al.
Published: (2024)
by: Ferreira, Rafael, et al.
Published: (2024)
Humanlike Multi-user Agent (HUMA): Designing a Deceptively Human AI Facilitator for Group Chats
by: Jacniacki, Mateusz, et al.
Published: (2025)
by: Jacniacki, Mateusz, et al.
Published: (2025)
Can LLMs Infer Conversational Agent Users' Personality Traits from Chat History?
by: Cögendez, Derya, et al.
Published: (2026)
by: Cögendez, Derya, et al.
Published: (2026)
When Benchmarks Leak: Inference-Time Decontamination for LLMs
by: Chai, Jianzhe, et al.
Published: (2026)
by: Chai, Jianzhe, et al.
Published: (2026)
Conversational Assistants to support Heart Failure Patients: comparing a Neurosymbolic Architecture with ChatGPT
by: Tayal, Anuja, et al.
Published: (2025)
by: Tayal, Anuja, et al.
Published: (2025)
GRP: Goal-Reversed Prompting for Zero-Shot Evaluation with LLMs
by: Song, Mingyang, et al.
Published: (2025)
by: Song, Mingyang, et al.
Published: (2025)
MAC: A Multi-Agent Framework for Interactive User Clarification in Multi-turn Conversations
by: Acikgoz, Emre Can, et al.
Published: (2025)
by: Acikgoz, Emre Can, et al.
Published: (2025)
Synthetic Users, Real Differences: an Evaluation Framework for User Simulation in Multi-Turn Conversations
by: Liu, Yu Lu, et al.
Published: (2026)
by: Liu, Yu Lu, et al.
Published: (2026)
Question Suggestion for Conversational Shopping Assistants Using Product Metadata
by: Vedula, Nikhita, et al.
Published: (2024)
by: Vedula, Nikhita, et al.
Published: (2024)
Think Globally, Group Locally: Evaluating LLMs Using Multi-Lingual Word Grouping Games
by: Guerra-Solano, César, et al.
Published: (2025)
by: Guerra-Solano, César, et al.
Published: (2025)
AgentGroupChat: An Interactive Group Chat Simulacra For Better Eliciting Emergent Behavior
by: Gu, Zhouhong, et al.
Published: (2024)
by: Gu, Zhouhong, et al.
Published: (2024)
SimulatorArena: Are User Simulators Reliable Proxies for Multi-Turn Evaluation of AI Assistants?
by: Dou, Yao, et al.
Published: (2025)
by: Dou, Yao, et al.
Published: (2025)
Can Many-Shot In-Context Learning Help LLMs as Evaluators? A Preliminary Empirical Study
by: Song, Mingyang, et al.
Published: (2024)
by: Song, Mingyang, et al.
Published: (2024)
RMTBench: Benchmarking LLMs Through Multi-Turn User-Centric Role-Playing
by: Xiang, Hao, et al.
Published: (2025)
by: Xiang, Hao, et al.
Published: (2025)
Detecting Ambiguities to Guide Query Rewrite for Robust Conversations in Enterprise AI Assistants
by: Tanjim, Md Mehrab, et al.
Published: (2025)
by: Tanjim, Md Mehrab, et al.
Published: (2025)
PRISM: Probability Reallocation with In-Span Masking for Knowledge-Sensitive Alignment
by: Xu, Chenning, et al.
Published: (2026)
by: Xu, Chenning, et al.
Published: (2026)
Beyond the Illusion of Consensus: From Surface Heuristics to Knowledge-Grounded Evaluation in LLM-as-a-Judge
by: Song, Mingyang, et al.
Published: (2026)
by: Song, Mingyang, et al.
Published: (2026)
Assessing Web Search Credibility and Response Groundedness in Chat Assistants
by: Vykopal, Ivan, et al.
Published: (2025)
by: Vykopal, Ivan, et al.
Published: (2025)
SafeChat: A Framework for Building Trustworthy Collaborative Assistants and a Case Study of its Usefulness
by: Srivastava, Biplav, et al.
Published: (2025)
by: Srivastava, Biplav, et al.
Published: (2025)
Counting-Stars: A Multi-evidence, Position-aware, and Scalable Benchmark for Evaluating Long-Context Large Language Models
by: Song, Mingyang, et al.
Published: (2024)
by: Song, Mingyang, et al.
Published: (2024)
BatchPrompt: Accomplish more with less
by: Lin, Jianzhe, et al.
Published: (2023)
by: Lin, Jianzhe, et al.
Published: (2023)
Exploring a New Competency Modeling Process with Large Language Models
by: Du, Silin, et al.
Published: (2026)
by: Du, Silin, et al.
Published: (2026)
FlowKV: Enhancing Multi-Turn Conversational Coherence in LLMs via Isolated Key-Value Cache Management
by: Liu, Xiang, et al.
Published: (2025)
by: Liu, Xiang, et al.
Published: (2025)
PRISM of Opinions: A Persona-Reasoned Multimodal Framework for User-centric Conversational Stance Detection
by: Wang, Bingbing, et al.
Published: (2025)
by: Wang, Bingbing, et al.
Published: (2025)
Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions
by: Song, Mingyang, et al.
Published: (2026)
by: Song, Mingyang, et al.
Published: (2026)
Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning
by: Song, Mingyang, et al.
Published: (2025)
by: Song, Mingyang, et al.
Published: (2025)
A Survey of Query Optimization in Large Language Models
by: Song, Mingyang, et al.
Published: (2024)
by: Song, Mingyang, et al.
Published: (2024)
ChatRetriever: Adapting Large Language Models for Generalized and Robust Conversational Dense Retrieval
by: Mao, Kelong, et al.
Published: (2024)
by: Mao, Kelong, et al.
Published: (2024)
LLAMAPIE: Proactive In-Ear Conversation Assistants
by: Chen, Tuochao, et al.
Published: (2025)
by: Chen, Tuochao, et al.
Published: (2025)
Beware of Words: Evaluating the Lexical Diversity of Conversational LLMs using ChatGPT as Case Study
by: Martínez, Gonzalo, et al.
Published: (2024)
by: Martínez, Gonzalo, et al.
Published: (2024)
Prosa: Rubric-Based Evaluation of LLMs on Real User Chats in Brazilian Portuguese
by: Junior, Roseval Malaquias, et al.
Published: (2026)
by: Junior, Roseval Malaquias, et al.
Published: (2026)
Proactive Hearing Assistants that Isolate Egocentric Conversations
by: Hu, Guilin, et al.
Published: (2025)
by: Hu, Guilin, et al.
Published: (2025)
Note2Chat: Improving LLMs for Multi-Turn Clinical History Taking Using Medical Notes
by: Zhou, Yang, et al.
Published: (2026)
by: Zhou, Yang, et al.
Published: (2026)
ChatSOS: Vector Database Augmented Generative Question Answering Assistant in Safety Engineering
by: Tang, Haiyang, et al.
Published: (2024)
by: Tang, Haiyang, et al.
Published: (2024)
The MUCA Roma
ProPerSim: Developing Proactive and Personalized AI Assistants through User-Assistant Simulation
by: Kim, Jiho, et al.
Published: (2025)
by: Kim, Jiho, et al.
Published: (2025)
Similar Items
-
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
by: Shen, Zhuokang, et al.
Published: (2026) -
User-Assistant Bias in LLMs
by: Pan, Xu, et al.
Published: (2025) -
Learning How To Ask: Cycle-Consistency Refines Prompts in Multimodal Foundation Models
by: Diesendruck, Maurice, et al.
Published: (2024) -
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning
by: Chen, Mingyang, et al.
Published: (2024) -
Multi-trait User Simulation with Adaptive Decoding for Conversational Task Assistants
by: Ferreira, Rafael, et al.
Published: (2024)