Saved in:
| Main Authors: | Li, Jiahao Nick, Zhang, Zhuohao Jerry, Ma, Jiaju |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.08250 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
OmniActions: Predicting Digital Actions in Response to Real-World Multimodal Sensory Inputs with LLMs
by: Li, Jiahao Nick, et al.
Published: (2024)
by: Li, Jiahao Nick, et al.
Published: (2024)
SensorChat: Answering Qualitative and Quantitative Questions during Long-Term Multimodal Sensor Interactions
by: Yu, Xiaofan, et al.
Published: (2025)
by: Yu, Xiaofan, et al.
Published: (2025)
TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering
by: Li, Zhonghao, et al.
Published: (2025)
by: Li, Zhonghao, et al.
Published: (2025)
Question Answering for Decisionmaking in Green Building Design: A Multimodal Data Reasoning Method Driven by Large Language Models
by: Li, Yihui, et al.
Published: (2024)
by: Li, Yihui, et al.
Published: (2024)
OmniGUI: Benchmarking GUI Agents in Omni-Modal Smartphone Environments
by: Henry, Felix, et al.
Published: (2026)
by: Henry, Felix, et al.
Published: (2026)
Understanding and Supporting Formal Email Exchange by Answering AI-Generated Questions
by: Miura, Yusuke, et al.
Published: (2025)
by: Miura, Yusuke, et al.
Published: (2025)
RAG-VR: Leveraging Retrieval-Augmented Generation for 3D Question Answering in VR Environments
by: Ding, Shiyi, et al.
Published: (2025)
by: Ding, Shiyi, et al.
Published: (2025)
Enabling Personalized Long-term Interactions in LLM-based Agents through Persistent Memory and User Profiles
by: Westhäußer, Rebecca, et al.
Published: (2025)
by: Westhäußer, Rebecca, et al.
Published: (2025)
Personalized to Persuade: The Effects of Contextualization and Warmth on Trust and Reliance in Conversational AI
by: Yazan, Mert, et al.
Published: (2026)
by: Yazan, Mert, et al.
Published: (2026)
OpenOmni: A Collaborative Open Source Tool for Building Future-Ready Multimodal Conversational Agents
by: Sun, Qiang, et al.
Published: (2024)
by: Sun, Qiang, et al.
Published: (2024)
Say It My Way: Exploring Control in Conversational Visual Question Answering with Blind Users
by: Zeraati, Farnaz Zamiri, et al.
Published: (2026)
by: Zeraati, Farnaz Zamiri, et al.
Published: (2026)
QACP: An Annotated Question Answering Dataset for Assisting Chinese Python Programming Learners
by: Xiao, Rui, et al.
Published: (2024)
by: Xiao, Rui, et al.
Published: (2024)
ClearFairy: Capturing Creative Workflows through Decision Structuring, In-Situ Questioning, and Rationale Inference
by: Son, Kihoon, et al.
Published: (2025)
by: Son, Kihoon, et al.
Published: (2025)
A Survey of Large Language Model Agents for Question Answering
by: Yue, Murong
Published: (2025)
by: Yue, Murong
Published: (2025)
MapAgent: Trajectory-Constructed Memory-Augmented Planning for Mobile Task Automation
by: Kong, Yi, et al.
Published: (2025)
by: Kong, Yi, et al.
Published: (2025)
Open-Ended Multi-Modal Relational Reasoning for Video Question Answering
by: Luo, Haozheng, et al.
Published: (2020)
by: Luo, Haozheng, et al.
Published: (2020)
Enabling On-Device LLMs Personalization with Smartphone Sensing
by: Zhang, Shiquan, et al.
Published: (2024)
by: Zhang, Shiquan, et al.
Published: (2024)
CUPID: Evaluating Personalized and Contextualized Alignment of LLMs from Interactions
by: Kim, Tae Soo, et al.
Published: (2025)
by: Kim, Tae Soo, et al.
Published: (2025)
Evaluating Contextually Personalized Programming Exercises Created with Generative AI
by: Logacheva, Evanfiya, et al.
Published: (2024)
by: Logacheva, Evanfiya, et al.
Published: (2024)
AgentEconomist: An End-to-end Agentic System Translating Economic Intuitions into Executable Computational Experiments
by: Chen, Jiaju, et al.
Published: (2026)
by: Chen, Jiaju, et al.
Published: (2026)
Can LLMs Address Mental Health Questions? A Comparison with Human Therapists
by: Wang, Synthia, et al.
Published: (2025)
by: Wang, Synthia, et al.
Published: (2025)
Comparing RAG and GraphRAG for Page-Level Retrieval Question Answering on Math Textbook
by: Chen, Eason, et al.
Published: (2025)
by: Chen, Eason, et al.
Published: (2025)
OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions
by: Luo, Cheng, et al.
Published: (2025)
by: Luo, Cheng, et al.
Published: (2025)
RealitySummary: Exploring On-Demand Mixed Reality Text Summarization and Question Answering using Large Language Models
by: Gunturu, Aditya, et al.
Published: (2024)
by: Gunturu, Aditya, et al.
Published: (2024)
AI, Take the Wheel: What Drives Delegation and Trust in Human-Computer Cooperative Question Answering?
by: Gor, Maharshi, et al.
Published: (2026)
by: Gor, Maharshi, et al.
Published: (2026)
OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web
by: Kapoor, Raghav, et al.
Published: (2024)
by: Kapoor, Raghav, et al.
Published: (2024)
See or Recall: A Sanity Check for the Role of Vision in Solving Visualization Question Answer Tasks with Multimodal LLMs
by: Li, Zhimin, et al.
Published: (2025)
by: Li, Zhimin, et al.
Published: (2025)
Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning
by: Liu, Michael Xieyang, et al.
Published: (2025)
by: Liu, Michael Xieyang, et al.
Published: (2025)
Contextualized Counterspeech: Strategies for Adaptation, Personalization, and Evaluation
by: Cima, Lorenzo, et al.
Published: (2024)
by: Cima, Lorenzo, et al.
Published: (2024)
GhostWriter: Augmenting Collaborative Human-AI Writing Experiences Through Personalization and Agency
by: Yeh, Catherine, et al.
Published: (2024)
by: Yeh, Catherine, et al.
Published: (2024)
What Do People Want to Know About Artificial Intelligence (AI)? The Importance of Answering End-User Questions to Explain Autonomous Vehicle (AV) Decisions
by: Molaei, Somayeh, et al.
Published: (2025)
by: Molaei, Somayeh, et al.
Published: (2025)
AffectAI-Capture: A Reproducible Multimodal Protocol for Small-Group Meeting Research
by: Seikavandi, Meisam Jamshidi, et al.
Published: (2026)
by: Seikavandi, Meisam Jamshidi, et al.
Published: (2026)
Resonance: Drawing from Memories to Imagine Positive Futures through AI-Augmented Journaling
by: Zulfikar, Wazeer, et al.
Published: (2025)
by: Zulfikar, Wazeer, et al.
Published: (2025)
Towards Real-World Validity in Generative AI Benchmarks: Understanding and Designing Domain-Centered Evaluations for Journalism Practitioners
by: Li, Charlotte, et al.
Published: (2025)
by: Li, Charlotte, et al.
Published: (2025)
PersonaAI: Leveraging Retrieval-Augmented Generation and Personalized Context for AI-Driven Digital Avatars
by: Kimara, Elvis, et al.
Published: (2025)
by: Kimara, Elvis, et al.
Published: (2025)
Through the Lens of Human-Human Collaboration: A Configurable Research Platform for Exploring Human-Agent Collaboration
by: Yao, Bingsheng, et al.
Published: (2025)
by: Yao, Bingsheng, et al.
Published: (2025)
Cognitive Prosthetic: An AI-Enabled Multimodal System for Episodic Recall in Knowledge Work
by: Obiuwevwi, Lawrence, et al.
Published: (2026)
by: Obiuwevwi, Lawrence, et al.
Published: (2026)
NaviSense: A Multimodal Assistive Mobile application for Object Retrieval by Persons with Visual Impairment
by: Sridhar, Ajay Narayanan, et al.
Published: (2025)
by: Sridhar, Ajay Narayanan, et al.
Published: (2025)
Are We Asking the Right Questions? On Ambiguity in Natural Language Queries for Tabular Data Analysis
by: Gomm, Daniel, et al.
Published: (2025)
by: Gomm, Daniel, et al.
Published: (2025)
PILAR: Personalizing Augmented Reality Interactions with LLM-based Human-Centric and Trustworthy Explanations for Daily Use Cases
by: Kundu, Ripan Kumar, et al.
Published: (2025)
by: Kundu, Ripan Kumar, et al.
Published: (2025)
Similar Items
-
OmniActions: Predicting Digital Actions in Response to Real-World Multimodal Sensory Inputs with LLMs
by: Li, Jiahao Nick, et al.
Published: (2024) -
SensorChat: Answering Qualitative and Quantitative Questions during Long-Term Multimodal Sensor Interactions
by: Yu, Xiaofan, et al.
Published: (2025) -
TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering
by: Li, Zhonghao, et al.
Published: (2025) -
Question Answering for Decisionmaking in Green Building Design: A Multimodal Data Reasoning Method Driven by Large Language Models
by: Li, Yihui, et al.
Published: (2024) -
OmniGUI: Benchmarking GUI Agents in Omni-Modal Smartphone Environments
by: Henry, Felix, et al.
Published: (2026)