Saved in:
| Main Authors: | Wang, Ziyi, Lu, Yuxuan, Li, Wenbo, Amini, Amirali, Sun, Bo, Bart, Yakov, Lyu, Weimin, Gesi, Jiri, Wang, Tian, Huang, Jing, Su, Yu, Ehsan, Upol, Alikhani, Malihe, Li, Toby Jia-Jun, Chilton, Lydia, Wang, Dakuo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.05606 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LLM Agent Meets Agentic AI: Can LLM Agents Simulate Customers to Evaluate Agentic-AI-based Shopping Assistants?
by: Sun, Lu, et al.
Published: (2025)
by: Sun, Lu, et al.
Published: (2025)
Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
by: Zhang, Yimeng, et al.
Published: (2025)
by: Zhang, Yimeng, et al.
Published: (2025)
UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents
by: Lu, Yuxuan, et al.
Published: (2025)
by: Lu, Yuxuan, et al.
Published: (2025)
UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
by: Lu, Yuxuan, et al.
Published: (2025)
by: Lu, Yuxuan, et al.
Published: (2025)
Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
by: Wang, Ziyi, et al.
Published: (2025)
by: Wang, Ziyi, et al.
Published: (2025)
Evaluating Theory of (an uncertain) Mind: Predicting the Uncertain Beliefs of Others in Conversation Forecasting
by: Sicilia, Anthony, et al.
Published: (2024)
by: Sicilia, Anthony, et al.
Published: (2024)
Eliciting Uncertainty in Chain-of-Thought to Mitigate Bias against Forecasting Harmful User Behaviors
by: Sicilia, Anthony, et al.
Published: (2024)
by: Sicilia, Anthony, et al.
Published: (2024)
Explainable AI Reloaded: Challenging the XAI Status Quo in the Era of Large Language Models
by: Ehsan, Upol, et al.
Published: (2024)
by: Ehsan, Upol, et al.
Published: (2024)
Beyond Self-learned Attention: Mitigating Attention Bias in Transformer-based Models Using Attention Guidance
by: Gesi, Jiri, et al.
Published: (2024)
by: Gesi, Jiri, et al.
Published: (2024)
Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation
by: Chen, Jiaju, et al.
Published: (2025)
by: Chen, Jiaju, et al.
Published: (2025)
Schemex: Discovering Design Patterns from Examples through Iterative Abstraction and Refinement
by: Wang, Sitong, et al.
Published: (2025)
by: Wang, Sitong, et al.
Published: (2025)
Studying and Mitigating Biases in Sign Language Understanding Models
by: Atwell, Katherine, et al.
Published: (2024)
by: Atwell, Katherine, et al.
Published: (2024)
An Active Learning Framework for Inclusive Generation by Large Language Models
by: Hassan, Sabit, et al.
Published: (2024)
by: Hassan, Sabit, et al.
Published: (2024)
Accounting for Sycophancy in Language Model Uncertainty Estimation
by: Sicilia, Anthony, et al.
Published: (2024)
by: Sicilia, Anthony, et al.
Published: (2024)
Active Learning for Robust and Representative LLM Generation in Safety-Critical Scenarios
by: Hassan, Sabit, et al.
Published: (2024)
by: Hassan, Sabit, et al.
Published: (2024)
See, Think, Act: Online Shopper Behavior Simulation with VLM Agents
by: Zhang, Yimeng, et al.
Published: (2025)
by: Zhang, Yimeng, et al.
Published: (2025)
MoodSmith: Enabling Mood-Consistent Multimedia for AI-Generated Advocacy Campaigns
by: Menon, Samia, et al.
Published: (2024)
by: Menon, Samia, et al.
Published: (2024)
HumBEL: A Human-in-the-Loop Approach for Evaluating Demographic Factors of Language Models in Human-Machine Conversations
by: Sicilia, Anthony, et al.
Published: (2023)
by: Sicilia, Anthony, et al.
Published: (2023)
WEBSERV: A Full-Stack and RL-Ready Web Environment for Training Web Agents at Scale
by: Lu, Yuxuan, et al.
Published: (2025)
by: Lu, Yuxuan, et al.
Published: (2025)
Rewriting Video: Text-Driven Reauthoring of Video Footage
by: Wang, Sitong, et al.
Published: (2026)
by: Wang, Sitong, et al.
Published: (2026)
Can LLM Agents Simulate Multi-Turn Human Behavior? Evidence from Real Online Customer Behavior Data
by: Lu, Yuxuan, et al.
Published: (2025)
by: Lu, Yuxuan, et al.
Published: (2025)
Copying style, Extracting value: Illustrators' Perception of AI Style Transfer and its Impact on Creative Labor
by: Porquet, Julien, et al.
Published: (2024)
by: Porquet, Julien, et al.
Published: (2024)
Simulating Human Strategic Behavior: Comparing Single and Multi-agent LLMs
by: Sreedhar, Karthik, et al.
Published: (2024)
by: Sreedhar, Karthik, et al.
Published: (2024)
Trajectory2Task: Training Robust Tool-Calling Agents with Synthesized Yet Verifiable Data for Complex User Intents
by: Wang, Ziyi, et al.
Published: (2026)
by: Wang, Ziyi, et al.
Published: (2026)
BASIL: Bayesian Assessment of Sycophancy in LLMs
by: Atwell, Katherine, et al.
Published: (2025)
by: Atwell, Katherine, et al.
Published: (2025)
"Nothing about us without us": Perspectives of Global Deaf and Hard-of-hearing Community Members on Sign Language Technologies
by: Atwell, Katherine, et al.
Published: (2025)
by: Atwell, Katherine, et al.
Published: (2025)
Fairness at Every Intersection: Uncovering and Mitigating Intersectional Biases in Multimodal Clinical Predictions
by: Ramachandranpillai, Resmi, et al.
Published: (2024)
by: Ramachandranpillai, Resmi, et al.
Published: (2024)
SiLVERScore: Semantically-Aware Embeddings for Sign Language Generation Evaluation
by: Imai, Saki, et al.
Published: (2025)
by: Imai, Saki, et al.
Published: (2025)
Measuring How (Not Just Whether) VLMs Build Common Ground
by: Imai, Saki, et al.
Published: (2025)
by: Imai, Saki, et al.
Published: (2025)
Including Facial Expressions in Contextual Embeddings for Sign Language Generation
by: Viegas, Carla, et al.
Published: (2022)
by: Viegas, Carla, et al.
Published: (2022)
Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
by: Chen, Chaoran, et al.
Published: (2025)
by: Chen, Chaoran, et al.
Published: (2025)
Towards Robustness Analysis of E-Commerce Ranking System
by: Wang, Ningfei, et al.
Published: (2024)
by: Wang, Ningfei, et al.
Published: (2024)
Eliciting Topic Hierarchies from Large Language Models
by: Li, Grace, et al.
Published: (2023)
by: Li, Grace, et al.
Published: (2023)
Human-centered explanation does not fit all: The interplay of sociotechnical, cognitive, and individual factors in the effect AI explanations in algorithmic decision-making
by: Ahn, Yongsu, et al.
Published: (2025)
by: Ahn, Yongsu, et al.
Published: (2025)
Firefly: Illuminating Large-Scale Verified Tool-Call Data Generation from Real APIs
by: Lu, Yuxuan, et al.
Published: (2026)
by: Lu, Yuxuan, et al.
Published: (2026)
Beyond Training: How Workers Discover Value in Enterprise AI
by: Sahni, Riya, et al.
Published: (2025)
by: Sahni, Riya, et al.
Published: (2025)
AI Humor Generation: Cognitive, Social and Creative Skills for Effective Humor
by: Kim, Sean, et al.
Published: (2025)
by: Kim, Sean, et al.
Published: (2025)
Through the Lens of Human-Human Collaboration: A Configurable Research Platform for Exploring Human-Agent Collaboration
by: Yao, Bingsheng, et al.
Published: (2025)
by: Yao, Bingsheng, et al.
Published: (2025)
Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied Agents
by: Hassan, Sabit, et al.
Published: (2024)
by: Hassan, Sabit, et al.
Published: (2024)
Modeling Intensification for Sign Language Generation: A Computational Approach
by: İnan, Mert, et al.
Published: (2022)
by: İnan, Mert, et al.
Published: (2022)
Similar Items
-
LLM Agent Meets Agentic AI: Can LLM Agents Simulate Customers to Evaluate Agentic-AI-based Shopping Assistants?
by: Sun, Lu, et al.
Published: (2025) -
Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
by: Zhang, Yimeng, et al.
Published: (2025) -
UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents
by: Lu, Yuxuan, et al.
Published: (2025) -
UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
by: Lu, Yuxuan, et al.
Published: (2025) -
Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
by: Wang, Ziyi, et al.
Published: (2025)