:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Jiahao Nick, Zhang, Zhuohao Jerry, Ma, Jiaju
Format:	Preprint
Published:	2024
Subjects:	Human-Computer Interaction Artificial Intelligence
Online Access:	https://arxiv.org/abs/2409.08250
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

OmniActions: Predicting Digital Actions in Response to Real-World Multimodal Sensory Inputs with LLMs
by: Li, Jiahao Nick, et al.
Published: (2024)

SensorChat: Answering Qualitative and Quantitative Questions during Long-Term Multimodal Sensor Interactions
by: Yu, Xiaofan, et al.
Published: (2025)

TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering
by: Li, Zhonghao, et al.
Published: (2025)

Question Answering for Decisionmaking in Green Building Design: A Multimodal Data Reasoning Method Driven by Large Language Models
by: Li, Yihui, et al.
Published: (2024)

OmniGUI: Benchmarking GUI Agents in Omni-Modal Smartphone Environments
by: Henry, Felix, et al.
Published: (2026)

Understanding and Supporting Formal Email Exchange by Answering AI-Generated Questions
by: Miura, Yusuke, et al.
Published: (2025)

RAG-VR: Leveraging Retrieval-Augmented Generation for 3D Question Answering in VR Environments
by: Ding, Shiyi, et al.
Published: (2025)

Enabling Personalized Long-term Interactions in LLM-based Agents through Persistent Memory and User Profiles
by: Westhäußer, Rebecca, et al.
Published: (2025)

Personalized to Persuade: The Effects of Contextualization and Warmth on Trust and Reliance in Conversational AI
by: Yazan, Mert, et al.
Published: (2026)

OpenOmni: A Collaborative Open Source Tool for Building Future-Ready Multimodal Conversational Agents
by: Sun, Qiang, et al.
Published: (2024)

Say It My Way: Exploring Control in Conversational Visual Question Answering with Blind Users
by: Zeraati, Farnaz Zamiri, et al.
Published: (2026)

QACP: An Annotated Question Answering Dataset for Assisting Chinese Python Programming Learners
by: Xiao, Rui, et al.
Published: (2024)

ClearFairy: Capturing Creative Workflows through Decision Structuring, In-Situ Questioning, and Rationale Inference
by: Son, Kihoon, et al.
Published: (2025)

A Survey of Large Language Model Agents for Question Answering
by: Yue, Murong
Published: (2025)

MapAgent: Trajectory-Constructed Memory-Augmented Planning for Mobile Task Automation
by: Kong, Yi, et al.
Published: (2025)

Open-Ended Multi-Modal Relational Reasoning for Video Question Answering
by: Luo, Haozheng, et al.
Published: (2020)

Enabling On-Device LLMs Personalization with Smartphone Sensing
by: Zhang, Shiquan, et al.
Published: (2024)

CUPID: Evaluating Personalized and Contextualized Alignment of LLMs from Interactions
by: Kim, Tae Soo, et al.
Published: (2025)

Evaluating Contextually Personalized Programming Exercises Created with Generative AI
by: Logacheva, Evanfiya, et al.
Published: (2024)

AgentEconomist: An End-to-end Agentic System Translating Economic Intuitions into Executable Computational Experiments
by: Chen, Jiaju, et al.
Published: (2026)

Can LLMs Address Mental Health Questions? A Comparison with Human Therapists
by: Wang, Synthia, et al.
Published: (2025)

Comparing RAG and GraphRAG for Page-Level Retrieval Question Answering on Math Textbook
by: Chen, Eason, et al.
Published: (2025)

OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions
by: Luo, Cheng, et al.
Published: (2025)

RealitySummary: Exploring On-Demand Mixed Reality Text Summarization and Question Answering using Large Language Models
by: Gunturu, Aditya, et al.
Published: (2024)

AI, Take the Wheel: What Drives Delegation and Trust in Human-Computer Cooperative Question Answering?
by: Gor, Maharshi, et al.
Published: (2026)

OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web
by: Kapoor, Raghav, et al.
Published: (2024)

See or Recall: A Sanity Check for the Role of Vision in Solving Visualization Question Answer Tasks with Multimodal LLMs
by: Li, Zhimin, et al.
Published: (2025)

Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning
by: Liu, Michael Xieyang, et al.
Published: (2025)

Contextualized Counterspeech: Strategies for Adaptation, Personalization, and Evaluation
by: Cima, Lorenzo, et al.
Published: (2024)

GhostWriter: Augmenting Collaborative Human-AI Writing Experiences Through Personalization and Agency
by: Yeh, Catherine, et al.
Published: (2024)

What Do People Want to Know About Artificial Intelligence (AI)? The Importance of Answering End-User Questions to Explain Autonomous Vehicle (AV) Decisions
by: Molaei, Somayeh, et al.
Published: (2025)

AffectAI-Capture: A Reproducible Multimodal Protocol for Small-Group Meeting Research
by: Seikavandi, Meisam Jamshidi, et al.
Published: (2026)

Resonance: Drawing from Memories to Imagine Positive Futures through AI-Augmented Journaling
by: Zulfikar, Wazeer, et al.
Published: (2025)

Towards Real-World Validity in Generative AI Benchmarks: Understanding and Designing Domain-Centered Evaluations for Journalism Practitioners
by: Li, Charlotte, et al.
Published: (2025)

PersonaAI: Leveraging Retrieval-Augmented Generation and Personalized Context for AI-Driven Digital Avatars
by: Kimara, Elvis, et al.
Published: (2025)

Through the Lens of Human-Human Collaboration: A Configurable Research Platform for Exploring Human-Agent Collaboration
by: Yao, Bingsheng, et al.
Published: (2025)

Cognitive Prosthetic: An AI-Enabled Multimodal System for Episodic Recall in Knowledge Work
by: Obiuwevwi, Lawrence, et al.
Published: (2026)

NaviSense: A Multimodal Assistive Mobile application for Object Retrieval by Persons with Visual Impairment
by: Sridhar, Ajay Narayanan, et al.
Published: (2025)

Are We Asking the Right Questions? On Ambiguity in Natural Language Queries for Tabular Data Analysis
by: Gomm, Daniel, et al.
Published: (2025)

PILAR: Personalizing Augmented Reality Interactions with LLM-based Human-Centric and Trustworthy Explanations for Daily Use Cases
by: Kundu, Ripan Kumar, et al.
Published: (2025)