Saved in:
| Main Authors: | Huang, Jun, Liu, Yan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.15564 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Predicting and Explaining Mobile UI Tappability with Vision Modeling and Saliency Analysis
by: Schoop, Eldon, et al.
Published: (2022)
by: Schoop, Eldon, et al.
Published: (2022)
Revision Matters: Generative Design Guided by Revision Edits
by: Li, Tao, et al.
Published: (2024)
by: Li, Tao, et al.
Published: (2024)
Improving Prototypical Visual Explanations with Reward Reweighing, Reselection, and Retraining
by: Li, Aaron J., et al.
Published: (2023)
by: Li, Aaron J., et al.
Published: (2023)
I-CEE: Tailoring Explanations of Image Classification Models to User Expertise
by: Rong, Yao, et al.
Published: (2023)
by: Rong, Yao, et al.
Published: (2023)
Interaction as Explanation: A User Interaction-based Method for Explaining Image Classification Models
by: Yun, Hyeonggeun
Published: (2024)
by: Yun, Hyeonggeun
Published: (2024)
DAVE: Distribution-aware Attribution via ViT Gradient Decomposition
by: Wróbel, Adam, et al.
Published: (2026)
by: Wróbel, Adam, et al.
Published: (2026)
Fusing Forces: Deep-Human-Guided Refinement of Segmentation Masks
by: Sterzinger, Rafael, et al.
Published: (2024)
by: Sterzinger, Rafael, et al.
Published: (2024)
From Feature Importance to Natural Language Explanations Using LLMs with RAG
by: Tekkesinoglu, Sule, et al.
Published: (2024)
by: Tekkesinoglu, Sule, et al.
Published: (2024)
EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning
by: Jiang, Yue, et al.
Published: (2024)
by: Jiang, Yue, et al.
Published: (2024)
Dodgersort: Uncertainty-Aware VLM-Guided Human-in-the-Loop Pairwise Ranking
by: Park, Yujin, et al.
Published: (2026)
by: Park, Yujin, et al.
Published: (2026)
InterVLS: Interactive Model Understanding and Improvement with Vision-Language Surrogates
by: Huang, Jinbin, et al.
Published: (2023)
by: Huang, Jinbin, et al.
Published: (2023)
OpenDriver: An Open-Road Driver State Detection Dataset
by: Liu, Delong, et al.
Published: (2023)
by: Liu, Delong, et al.
Published: (2023)
AI Guide Dog: Egocentric Path Prediction on Smartphone
by: Jadhav, Aishwarya, et al.
Published: (2025)
by: Jadhav, Aishwarya, et al.
Published: (2025)
AKRMap: Adaptive Kernel Regression for Trustworthy Visualization of Cross-Modal Embeddings
by: Ye, Yilin, et al.
Published: (2025)
by: Ye, Yilin, et al.
Published: (2025)
Instruction-Guided Editing Controls for Images and Multimedia: A Survey in LLM era
by: Nguyen, Thanh Tam, et al.
Published: (2024)
by: Nguyen, Thanh Tam, et al.
Published: (2024)
Lost in Edits? A $λ$-Compass for AIGC Provenance
by: You, Wenhao, et al.
Published: (2025)
by: You, Wenhao, et al.
Published: (2025)
GPT Sonograpy: Hand Gesture Decoding from Forearm Ultrasound Images via VLM
by: Bimbraw, Keshav, et al.
Published: (2024)
by: Bimbraw, Keshav, et al.
Published: (2024)
ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal Images
by: Ding, Fangqiang, et al.
Published: (2024)
by: Ding, Fangqiang, et al.
Published: (2024)
Gesture Matters: Pedestrian Gesture Recognition for AVs Through Skeleton Pose Evaluation
by: Mahdi, Alif Rizqullah, et al.
Published: (2026)
by: Mahdi, Alif Rizqullah, et al.
Published: (2026)
Learning User Embeddings from Human Gaze for Personalised Saliency Prediction
by: Strohm, Florian, et al.
Published: (2024)
by: Strohm, Florian, et al.
Published: (2024)
ChainReaction: Causal Chain-Guided Reasoning for Modular and Explainable Causal-Why Video Question Answering
by: Parmar, Paritosh, et al.
Published: (2025)
by: Parmar, Paritosh, et al.
Published: (2025)
Analysis of the 2024 BraTS Meningioma Radiotherapy Planning Automated Segmentation Challenge
by: LaBella, Dominic, et al.
Published: (2024)
by: LaBella, Dominic, et al.
Published: (2024)
How Good is ChatGPT at Audiovisual Deepfake Detection: A Comparative Study of ChatGPT, AI Models and Human Perception
by: Shahzad, Sahibzada Adil, et al.
Published: (2024)
by: Shahzad, Sahibzada Adil, et al.
Published: (2024)
Human-Agent Joint Learning for Efficient Robot Manipulation Skill Acquisition
by: Luo, Shengcheng, et al.
Published: (2024)
by: Luo, Shengcheng, et al.
Published: (2024)
Realtime Dynamic Gaze Target Tracking and Depth-Level Estimation
by: Seraj, Esmaeil, et al.
Published: (2024)
by: Seraj, Esmaeil, et al.
Published: (2024)
Improving Uncertainty-Error Correspondence in Deep Bayesian Medical Image Segmentation
by: Mody, Prerak, et al.
Published: (2024)
by: Mody, Prerak, et al.
Published: (2024)
Object Recognition in Human Computer Interaction:- A Comparative Analysis
by: Ranade, Kaushik, et al.
Published: (2024)
by: Ranade, Kaushik, et al.
Published: (2024)
Graph4GUI: Graph Neural Networks for Representing Graphical User Interfaces
by: Jiang, Yue, et al.
Published: (2024)
by: Jiang, Yue, et al.
Published: (2024)
Deep Generative Domain Adaptation with Temporal Attention for Cross-User Activity Recognition
by: Ye, Xiaozhou, et al.
Published: (2024)
by: Ye, Xiaozhou, et al.
Published: (2024)
Generating Synthetic Satellite Imagery for Rare Objects: An Empirical Comparison of Models and Metrics
by: Nguyen, Tuong Vy, et al.
Published: (2024)
by: Nguyen, Tuong Vy, et al.
Published: (2024)
Generating Synthetic Satellite Imagery With Deep-Learning Text-to-Image Models -- Technical Challenges and Implications for Monitoring and Verification
by: Nguyen, Tuong Vy, et al.
Published: (2024)
by: Nguyen, Tuong Vy, et al.
Published: (2024)
Deep Generative Domain Adaptation with Temporal Relation Knowledge for Cross-User Activity Recognition
by: Ye, Xiaozhou, et al.
Published: (2024)
by: Ye, Xiaozhou, et al.
Published: (2024)
Training a Vision Language Model as Smartphone Assistant
by: Dorka, Nicolai, et al.
Published: (2024)
by: Dorka, Nicolai, et al.
Published: (2024)
Semantic Approach to Quantifying the Consistency of Diffusion Model Image Generation
by: Bent, Brinnae
Published: (2024)
by: Bent, Brinnae
Published: (2024)
MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction
by: Parab, Mithun, et al.
Published: (2024)
by: Parab, Mithun, et al.
Published: (2024)
A Study of Acquisition Functions for Medical Imaging Deep Active Learning
by: Dossou, Bonaventure F. P.
Published: (2024)
by: Dossou, Bonaventure F. P.
Published: (2024)
RadioActive: 3D Radiological Interactive Segmentation Benchmark
by: Ulrich, Constantin, et al.
Published: (2024)
by: Ulrich, Constantin, et al.
Published: (2024)
EmoGene: Audio-Driven Emotional 3D Talking-Head Generation
by: Wang, Wenqing, et al.
Published: (2024)
by: Wang, Wenqing, et al.
Published: (2024)
HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning
by: Hiranaka, Ayano, et al.
Published: (2024)
by: Hiranaka, Ayano, et al.
Published: (2024)
Looking for a better fit? An Incremental Learning Multimodal Object Referencing Framework adapting to Individual Drivers
by: Gomaa, Amr, et al.
Published: (2024)
by: Gomaa, Amr, et al.
Published: (2024)
Similar Items
-
Predicting and Explaining Mobile UI Tappability with Vision Modeling and Saliency Analysis
by: Schoop, Eldon, et al.
Published: (2022) -
Revision Matters: Generative Design Guided by Revision Edits
by: Li, Tao, et al.
Published: (2024) -
Improving Prototypical Visual Explanations with Reward Reweighing, Reselection, and Retraining
by: Li, Aaron J., et al.
Published: (2023) -
I-CEE: Tailoring Explanations of Image Classification Models to User Expertise
by: Rong, Yao, et al.
Published: (2023) -
Interaction as Explanation: A User Interaction-based Method for Explaining Image Classification Models
by: Yun, Hyeonggeun
Published: (2024)