:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Ziyi, Lu, Yuxuan, Li, Wenbo, Amini, Amirali, Sun, Bo, Bart, Yakov, Lyu, Weimin, Gesi, Jiri, Wang, Tian, Huang, Jing, Su, Yu, Ehsan, Upol, Alikhani, Malihe, Li, Toby Jia-Jun, Chilton, Lydia, Wang, Dakuo
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Human-Computer Interaction
Online Access:	https://arxiv.org/abs/2506.05606
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LLM Agent Meets Agentic AI: Can LLM Agents Simulate Customers to Evaluate Agentic-AI-based Shopping Assistants?
by: Sun, Lu, et al.
Published: (2025)

Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
by: Zhang, Yimeng, et al.
Published: (2025)

UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents
by: Lu, Yuxuan, et al.
Published: (2025)

UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
by: Lu, Yuxuan, et al.
Published: (2025)

Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
by: Wang, Ziyi, et al.
Published: (2025)

Evaluating Theory of (an uncertain) Mind: Predicting the Uncertain Beliefs of Others in Conversation Forecasting
by: Sicilia, Anthony, et al.
Published: (2024)

Eliciting Uncertainty in Chain-of-Thought to Mitigate Bias against Forecasting Harmful User Behaviors
by: Sicilia, Anthony, et al.
Published: (2024)

Explainable AI Reloaded: Challenging the XAI Status Quo in the Era of Large Language Models
by: Ehsan, Upol, et al.
Published: (2024)

Beyond Self-learned Attention: Mitigating Attention Bias in Transformer-based Models Using Attention Guidance
by: Gesi, Jiri, et al.
Published: (2024)

Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation
by: Chen, Jiaju, et al.
Published: (2025)

Schemex: Discovering Design Patterns from Examples through Iterative Abstraction and Refinement
by: Wang, Sitong, et al.
Published: (2025)

Studying and Mitigating Biases in Sign Language Understanding Models
by: Atwell, Katherine, et al.
Published: (2024)

An Active Learning Framework for Inclusive Generation by Large Language Models
by: Hassan, Sabit, et al.
Published: (2024)

Accounting for Sycophancy in Language Model Uncertainty Estimation
by: Sicilia, Anthony, et al.
Published: (2024)

Active Learning for Robust and Representative LLM Generation in Safety-Critical Scenarios
by: Hassan, Sabit, et al.
Published: (2024)

See, Think, Act: Online Shopper Behavior Simulation with VLM Agents
by: Zhang, Yimeng, et al.
Published: (2025)

MoodSmith: Enabling Mood-Consistent Multimedia for AI-Generated Advocacy Campaigns
by: Menon, Samia, et al.
Published: (2024)

HumBEL: A Human-in-the-Loop Approach for Evaluating Demographic Factors of Language Models in Human-Machine Conversations
by: Sicilia, Anthony, et al.
Published: (2023)

WEBSERV: A Full-Stack and RL-Ready Web Environment for Training Web Agents at Scale
by: Lu, Yuxuan, et al.
Published: (2025)

Rewriting Video: Text-Driven Reauthoring of Video Footage
by: Wang, Sitong, et al.
Published: (2026)

Can LLM Agents Simulate Multi-Turn Human Behavior? Evidence from Real Online Customer Behavior Data
by: Lu, Yuxuan, et al.
Published: (2025)

Copying style, Extracting value: Illustrators' Perception of AI Style Transfer and its Impact on Creative Labor
by: Porquet, Julien, et al.
Published: (2024)

Simulating Human Strategic Behavior: Comparing Single and Multi-agent LLMs
by: Sreedhar, Karthik, et al.
Published: (2024)

Trajectory2Task: Training Robust Tool-Calling Agents with Synthesized Yet Verifiable Data for Complex User Intents
by: Wang, Ziyi, et al.
Published: (2026)

BASIL: Bayesian Assessment of Sycophancy in LLMs
by: Atwell, Katherine, et al.
Published: (2025)

"Nothing about us without us": Perspectives of Global Deaf and Hard-of-hearing Community Members on Sign Language Technologies
by: Atwell, Katherine, et al.
Published: (2025)

Fairness at Every Intersection: Uncovering and Mitigating Intersectional Biases in Multimodal Clinical Predictions
by: Ramachandranpillai, Resmi, et al.
Published: (2024)

SiLVERScore: Semantically-Aware Embeddings for Sign Language Generation Evaluation
by: Imai, Saki, et al.
Published: (2025)

Measuring How (Not Just Whether) VLMs Build Common Ground
by: Imai, Saki, et al.
Published: (2025)

Including Facial Expressions in Contextual Embeddings for Sign Language Generation
by: Viegas, Carla, et al.
Published: (2022)

Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
by: Chen, Chaoran, et al.
Published: (2025)

Towards Robustness Analysis of E-Commerce Ranking System
by: Wang, Ningfei, et al.
Published: (2024)

Eliciting Topic Hierarchies from Large Language Models
by: Li, Grace, et al.
Published: (2023)

Human-centered explanation does not fit all: The interplay of sociotechnical, cognitive, and individual factors in the effect AI explanations in algorithmic decision-making
by: Ahn, Yongsu, et al.
Published: (2025)

Firefly: Illuminating Large-Scale Verified Tool-Call Data Generation from Real APIs
by: Lu, Yuxuan, et al.
Published: (2026)

Beyond Training: How Workers Discover Value in Enterprise AI
by: Sahni, Riya, et al.
Published: (2025)

AI Humor Generation: Cognitive, Social and Creative Skills for Effective Humor
by: Kim, Sean, et al.
Published: (2025)

Through the Lens of Human-Human Collaboration: A Configurable Research Platform for Exploring Human-Agent Collaboration
by: Yao, Bingsheng, et al.
Published: (2025)

Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied Agents
by: Hassan, Sabit, et al.
Published: (2024)

Modeling Intensification for Sign Language Generation: A Computational Approach
by: İnan, Mert, et al.
Published: (2022)