:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Rudakov, Evgenii, Shock, Jonathan, Lappi, Otto, Cowley, Benjamin Ultan
Format:	Preprint
Published:	2025
Subjects:	Human-Computer Interaction Artificial Intelligence Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2507.08028
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Graph-Based Exploration for ARC-AGI-3 Interactive Reasoning Tasks
by: Rudakov, Evgenii, et al.
Published: (2025)

PyBatchRender: A Python Library for Batched 3D Rendering at Up to One Million FPS
by: Rudakov, Evgenii, et al.
Published: (2026)

SelfReDepth: Self-Supervised Real-Time Depth Restoration for Consumer-Grade Sensors
by: Duarte, Alexandre, et al.
Published: (2024)

Real-Time Feedback and Benchmark Dataset for Isometric Pose Evaluation
by: Jaiswal, Abhishek, et al.
Published: (2025)

StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
by: Sun, Zhiyao, et al.
Published: (2025)

Real-Time Intuitive AI Drawing System for Collaboration: Enhancing Human Creativity through Formal and Contextual Intent Integration
by: Song, Jookyung, et al.
Published: (2025)

Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance
by: Han, Kyungtae, et al.
Published: (2025)

Transfer Learning-based Real-time Handgun Detection
by: Elmir, Youssef
Published: (2023)

RITA: A Real-time Interactive Talking Avatars Framework
by: Cheng, Wuxinlin, et al.
Published: (2024)

Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation
by: Park, Se Jin, et al.
Published: (2024)

Classification Metrics for Image Explanations: Towards Building Reliable XAI-Evaluations
by: Fresz, Benjamin, et al.
Published: (2024)

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis
by: Xie, Tianbao, et al.
Published: (2025)

Regressor-Guided Generative Image Editing Balances User Emotions to Reduce Time Spent Online
by: Gebhardt, Christoph, et al.
Published: (2025)

DAVE: Distribution-aware Attribution via ViT Gradient Decomposition
by: Wróbel, Adam, et al.
Published: (2026)

Negative Shanshui: Real-time Interactive Ink Painting Synthesis
by: Zhou, Aven-Le
Published: (2025)

In-Depth Analysis of Emotion Recognition through Knowledge-Based Large Language Models
by: Han, Bin, et al.
Published: (2024)

"I Can See Forever!": Evaluating Real-time VideoLLMs for Assisting Individuals with Visual Impairments
by: Zhang, Ziyi, et al.
Published: (2025)

Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
by: Ki, Taekyung, et al.
Published: (2026)

Dermatologist-like explainable AI enhances melanoma diagnosis accuracy: eye-tracking study
by: Chanda, Tirtha, et al.
Published: (2024)

Real-Time Drivers' Drowsiness Detection and Analysis through Deep Learning
by: Zaman, ANK, et al.
Published: (2025)

Adaptive 3D UI Placement in Mixed Reality Using Deep Reinforcement Learning
by: Lu, Feiyu, et al.
Published: (2025)

ViEEG: Hierarchical Visual Neural Representation for EEG Brain Decoding
by: Liu, Minxu, et al.
Published: (2025)

Modelling the Interplay of Eye-Tracking Temporal Dynamics and Personality for Emotion Detection in Face-to-Face Settings
by: Seikavandi, Meisam J., et al.
Published: (2025)

"Jutters"
by: Driessen, Meike, et al.
Published: (2025)

Few-Shot VLM-Based G-Code and HMI Verification in CNC Machining
by: Pour, Yasaman Hashem, et al.
Published: (2025)

Achieving Effective Virtual Reality Interactions via Acoustic Gesture Recognition based on Large Language Models
by: Zhang, Xijie, et al.
Published: (2025)

SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation
by: Cui, Hu, et al.
Published: (2025)

Pencils to Pixels: A Systematic Study of Creative Drawings across Children, Adults and AI
by: Nath, Surabhi S, et al.
Published: (2025)

Augmenting Image Annotation: A Human-LMM Collaborative Framework for Efficient Object Selection and Label Generation
by: Zhang, He, et al.
Published: (2025)

Milmer: a Framework for Multiple Instance Learning based Multimodal Emotion Recognition
by: Wang, Zaitian, et al.
Published: (2025)

Towards a Multimodal Document-grounded Conversational AI System for Education
by: Taneja, Karan, et al.
Published: (2025)

OLMD: Orientation-aware Long-term Motion Decoupling for Continuous Sign Language Recognition
by: Yu, Yiheng, et al.
Published: (2025)

OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions
by: Luo, Cheng, et al.
Published: (2025)

Towards Safer and Understandable Driver Intention Prediction
by: Karuppasamy, Mukilan, et al.
Published: (2025)

Reading Smiles: Proxy Bias in Foundation Models for Facial Emotion Recognition
by: Tsangko, Iosif, et al.
Published: (2025)

GazeLLM: Multimodal LLMs incorporating Human Visual Attention
by: Rekimoto, Jun
Published: (2025)

Automated Visual Attention Detection using Mobile Eye Tracking in Behavioral Classroom Studies
by: Bozkir, Efe, et al.
Published: (2025)

UI-UG: A Unified MLLM for UI Understanding and Generation
by: Yang, Hao, et al.
Published: (2025)

Pose-Robust Calibration Strategy for Point-of-Gaze Estimation on Mobile Phones
by: Zhao, Yujie, et al.
Published: (2025)

Advancing the Understanding and Evaluation of AR-Generated Scenes: When Vision-Language Models Shine and Stumble
by: Duan, Lin, et al.
Published: (2025)