:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huang, Jun, Liu, Yan
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence Human-Computer Interaction Machine Learning
Online Access:	https://arxiv.org/abs/2404.15564
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Predicting and Explaining Mobile UI Tappability with Vision Modeling and Saliency Analysis
by: Schoop, Eldon, et al.
Published: (2022)

Revision Matters: Generative Design Guided by Revision Edits
by: Li, Tao, et al.
Published: (2024)

Improving Prototypical Visual Explanations with Reward Reweighing, Reselection, and Retraining
by: Li, Aaron J., et al.
Published: (2023)

I-CEE: Tailoring Explanations of Image Classification Models to User Expertise
by: Rong, Yao, et al.
Published: (2023)

Interaction as Explanation: A User Interaction-based Method for Explaining Image Classification Models
by: Yun, Hyeonggeun
Published: (2024)

DAVE: Distribution-aware Attribution via ViT Gradient Decomposition
by: Wróbel, Adam, et al.
Published: (2026)

Fusing Forces: Deep-Human-Guided Refinement of Segmentation Masks
by: Sterzinger, Rafael, et al.
Published: (2024)

From Feature Importance to Natural Language Explanations Using LLMs with RAG
by: Tekkesinoglu, Sule, et al.
Published: (2024)

EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning
by: Jiang, Yue, et al.
Published: (2024)

Dodgersort: Uncertainty-Aware VLM-Guided Human-in-the-Loop Pairwise Ranking
by: Park, Yujin, et al.
Published: (2026)

InterVLS: Interactive Model Understanding and Improvement with Vision-Language Surrogates
by: Huang, Jinbin, et al.
Published: (2023)

OpenDriver: An Open-Road Driver State Detection Dataset
by: Liu, Delong, et al.
Published: (2023)

AI Guide Dog: Egocentric Path Prediction on Smartphone
by: Jadhav, Aishwarya, et al.
Published: (2025)

AKRMap: Adaptive Kernel Regression for Trustworthy Visualization of Cross-Modal Embeddings
by: Ye, Yilin, et al.
Published: (2025)

Instruction-Guided Editing Controls for Images and Multimedia: A Survey in LLM era
by: Nguyen, Thanh Tam, et al.
Published: (2024)

Lost in Edits? A $λ$-Compass for AIGC Provenance
by: You, Wenhao, et al.
Published: (2025)

GPT Sonograpy: Hand Gesture Decoding from Forearm Ultrasound Images via VLM
by: Bimbraw, Keshav, et al.
Published: (2024)

ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal Images
by: Ding, Fangqiang, et al.
Published: (2024)

Gesture Matters: Pedestrian Gesture Recognition for AVs Through Skeleton Pose Evaluation
by: Mahdi, Alif Rizqullah, et al.
Published: (2026)

Learning User Embeddings from Human Gaze for Personalised Saliency Prediction
by: Strohm, Florian, et al.
Published: (2024)

ChainReaction: Causal Chain-Guided Reasoning for Modular and Explainable Causal-Why Video Question Answering
by: Parmar, Paritosh, et al.
Published: (2025)

Analysis of the 2024 BraTS Meningioma Radiotherapy Planning Automated Segmentation Challenge
by: LaBella, Dominic, et al.
Published: (2024)

How Good is ChatGPT at Audiovisual Deepfake Detection: A Comparative Study of ChatGPT, AI Models and Human Perception
by: Shahzad, Sahibzada Adil, et al.
Published: (2024)

Human-Agent Joint Learning for Efficient Robot Manipulation Skill Acquisition
by: Luo, Shengcheng, et al.
Published: (2024)

Realtime Dynamic Gaze Target Tracking and Depth-Level Estimation
by: Seraj, Esmaeil, et al.
Published: (2024)

Improving Uncertainty-Error Correspondence in Deep Bayesian Medical Image Segmentation
by: Mody, Prerak, et al.
Published: (2024)

Object Recognition in Human Computer Interaction:- A Comparative Analysis
by: Ranade, Kaushik, et al.
Published: (2024)

Graph4GUI: Graph Neural Networks for Representing Graphical User Interfaces
by: Jiang, Yue, et al.
Published: (2024)

Deep Generative Domain Adaptation with Temporal Attention for Cross-User Activity Recognition
by: Ye, Xiaozhou, et al.
Published: (2024)

Generating Synthetic Satellite Imagery for Rare Objects: An Empirical Comparison of Models and Metrics
by: Nguyen, Tuong Vy, et al.
Published: (2024)

Generating Synthetic Satellite Imagery With Deep-Learning Text-to-Image Models -- Technical Challenges and Implications for Monitoring and Verification
by: Nguyen, Tuong Vy, et al.
Published: (2024)

Deep Generative Domain Adaptation with Temporal Relation Knowledge for Cross-User Activity Recognition
by: Ye, Xiaozhou, et al.
Published: (2024)

Training a Vision Language Model as Smartphone Assistant
by: Dorka, Nicolai, et al.
Published: (2024)

Semantic Approach to Quantifying the Consistency of Diffusion Model Image Generation
by: Bent, Brinnae
Published: (2024)

MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction
by: Parab, Mithun, et al.
Published: (2024)

A Study of Acquisition Functions for Medical Imaging Deep Active Learning
by: Dossou, Bonaventure F. P.
Published: (2024)

RadioActive: 3D Radiological Interactive Segmentation Benchmark
by: Ulrich, Constantin, et al.
Published: (2024)

EmoGene: Audio-Driven Emotional 3D Talking-Head Generation
by: Wang, Wenqing, et al.
Published: (2024)

HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning
by: Hiranaka, Ayano, et al.
Published: (2024)

Looking for a better fit? An Incremental Learning Multimodal Object Referencing Framework adapting to Individual Drivers
by: Gomaa, Amr, et al.
Published: (2024)