Saved in:
| Main Authors: | Rudakov, Evgenii, Shock, Jonathan, Lappi, Otto, Cowley, Benjamin Ultan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.08028 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Graph-Based Exploration for ARC-AGI-3 Interactive Reasoning Tasks
by: Rudakov, Evgenii, et al.
Published: (2025)
by: Rudakov, Evgenii, et al.
Published: (2025)
PyBatchRender: A Python Library for Batched 3D Rendering at Up to One Million FPS
by: Rudakov, Evgenii, et al.
Published: (2026)
by: Rudakov, Evgenii, et al.
Published: (2026)
SelfReDepth: Self-Supervised Real-Time Depth Restoration for Consumer-Grade Sensors
by: Duarte, Alexandre, et al.
Published: (2024)
by: Duarte, Alexandre, et al.
Published: (2024)
Real-Time Feedback and Benchmark Dataset for Isometric Pose Evaluation
by: Jaiswal, Abhishek, et al.
Published: (2025)
by: Jaiswal, Abhishek, et al.
Published: (2025)
StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
by: Sun, Zhiyao, et al.
Published: (2025)
by: Sun, Zhiyao, et al.
Published: (2025)
Real-Time Intuitive AI Drawing System for Collaboration: Enhancing Human Creativity through Formal and Contextual Intent Integration
by: Song, Jookyung, et al.
Published: (2025)
by: Song, Jookyung, et al.
Published: (2025)
Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance
by: Han, Kyungtae, et al.
Published: (2025)
by: Han, Kyungtae, et al.
Published: (2025)
Transfer Learning-based Real-time Handgun Detection
by: Elmir, Youssef
Published: (2023)
by: Elmir, Youssef
Published: (2023)
RITA: A Real-time Interactive Talking Avatars Framework
by: Cheng, Wuxinlin, et al.
Published: (2024)
by: Cheng, Wuxinlin, et al.
Published: (2024)
Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation
by: Park, Se Jin, et al.
Published: (2024)
by: Park, Se Jin, et al.
Published: (2024)
Classification Metrics for Image Explanations: Towards Building Reliable XAI-Evaluations
by: Fresz, Benjamin, et al.
Published: (2024)
by: Fresz, Benjamin, et al.
Published: (2024)
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis
by: Xie, Tianbao, et al.
Published: (2025)
by: Xie, Tianbao, et al.
Published: (2025)
Regressor-Guided Generative Image Editing Balances User Emotions to Reduce Time Spent Online
by: Gebhardt, Christoph, et al.
Published: (2025)
by: Gebhardt, Christoph, et al.
Published: (2025)
DAVE: Distribution-aware Attribution via ViT Gradient Decomposition
by: Wróbel, Adam, et al.
Published: (2026)
by: Wróbel, Adam, et al.
Published: (2026)
Negative Shanshui: Real-time Interactive Ink Painting Synthesis
by: Zhou, Aven-Le
Published: (2025)
by: Zhou, Aven-Le
Published: (2025)
In-Depth Analysis of Emotion Recognition through Knowledge-Based Large Language Models
by: Han, Bin, et al.
Published: (2024)
by: Han, Bin, et al.
Published: (2024)
"I Can See Forever!": Evaluating Real-time VideoLLMs for Assisting Individuals with Visual Impairments
by: Zhang, Ziyi, et al.
Published: (2025)
by: Zhang, Ziyi, et al.
Published: (2025)
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
by: Ki, Taekyung, et al.
Published: (2026)
by: Ki, Taekyung, et al.
Published: (2026)
Dermatologist-like explainable AI enhances melanoma diagnosis accuracy: eye-tracking study
by: Chanda, Tirtha, et al.
Published: (2024)
by: Chanda, Tirtha, et al.
Published: (2024)
Real-Time Drivers' Drowsiness Detection and Analysis through Deep Learning
by: Zaman, ANK, et al.
Published: (2025)
by: Zaman, ANK, et al.
Published: (2025)
Adaptive 3D UI Placement in Mixed Reality Using Deep Reinforcement Learning
by: Lu, Feiyu, et al.
Published: (2025)
by: Lu, Feiyu, et al.
Published: (2025)
ViEEG: Hierarchical Visual Neural Representation for EEG Brain Decoding
by: Liu, Minxu, et al.
Published: (2025)
by: Liu, Minxu, et al.
Published: (2025)
Modelling the Interplay of Eye-Tracking Temporal Dynamics and Personality for Emotion Detection in Face-to-Face Settings
by: Seikavandi, Meisam J., et al.
Published: (2025)
by: Seikavandi, Meisam J., et al.
Published: (2025)
"Jutters"
by: Driessen, Meike, et al.
Published: (2025)
by: Driessen, Meike, et al.
Published: (2025)
Few-Shot VLM-Based G-Code and HMI Verification in CNC Machining
by: Pour, Yasaman Hashem, et al.
Published: (2025)
by: Pour, Yasaman Hashem, et al.
Published: (2025)
Achieving Effective Virtual Reality Interactions via Acoustic Gesture Recognition based on Large Language Models
by: Zhang, Xijie, et al.
Published: (2025)
by: Zhang, Xijie, et al.
Published: (2025)
SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation
by: Cui, Hu, et al.
Published: (2025)
by: Cui, Hu, et al.
Published: (2025)
Pencils to Pixels: A Systematic Study of Creative Drawings across Children, Adults and AI
by: Nath, Surabhi S, et al.
Published: (2025)
by: Nath, Surabhi S, et al.
Published: (2025)
Augmenting Image Annotation: A Human-LMM Collaborative Framework for Efficient Object Selection and Label Generation
by: Zhang, He, et al.
Published: (2025)
by: Zhang, He, et al.
Published: (2025)
Milmer: a Framework for Multiple Instance Learning based Multimodal Emotion Recognition
by: Wang, Zaitian, et al.
Published: (2025)
by: Wang, Zaitian, et al.
Published: (2025)
Towards a Multimodal Document-grounded Conversational AI System for Education
by: Taneja, Karan, et al.
Published: (2025)
by: Taneja, Karan, et al.
Published: (2025)
OLMD: Orientation-aware Long-term Motion Decoupling for Continuous Sign Language Recognition
by: Yu, Yiheng, et al.
Published: (2025)
by: Yu, Yiheng, et al.
Published: (2025)
OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions
by: Luo, Cheng, et al.
Published: (2025)
by: Luo, Cheng, et al.
Published: (2025)
Towards Safer and Understandable Driver Intention Prediction
by: Karuppasamy, Mukilan, et al.
Published: (2025)
by: Karuppasamy, Mukilan, et al.
Published: (2025)
Reading Smiles: Proxy Bias in Foundation Models for Facial Emotion Recognition
by: Tsangko, Iosif, et al.
Published: (2025)
by: Tsangko, Iosif, et al.
Published: (2025)
GazeLLM: Multimodal LLMs incorporating Human Visual Attention
by: Rekimoto, Jun
Published: (2025)
by: Rekimoto, Jun
Published: (2025)
Automated Visual Attention Detection using Mobile Eye Tracking in Behavioral Classroom Studies
by: Bozkir, Efe, et al.
Published: (2025)
by: Bozkir, Efe, et al.
Published: (2025)
UI-UG: A Unified MLLM for UI Understanding and Generation
by: Yang, Hao, et al.
Published: (2025)
by: Yang, Hao, et al.
Published: (2025)
Pose-Robust Calibration Strategy for Point-of-Gaze Estimation on Mobile Phones
by: Zhao, Yujie, et al.
Published: (2025)
by: Zhao, Yujie, et al.
Published: (2025)
Advancing the Understanding and Evaluation of AR-Generated Scenes: When Vision-Language Models Shine and Stumble
by: Duan, Lin, et al.
Published: (2025)
by: Duan, Lin, et al.
Published: (2025)
Similar Items
-
Graph-Based Exploration for ARC-AGI-3 Interactive Reasoning Tasks
by: Rudakov, Evgenii, et al.
Published: (2025) -
PyBatchRender: A Python Library for Batched 3D Rendering at Up to One Million FPS
by: Rudakov, Evgenii, et al.
Published: (2026) -
SelfReDepth: Self-Supervised Real-Time Depth Restoration for Consumer-Grade Sensors
by: Duarte, Alexandre, et al.
Published: (2024) -
Real-Time Feedback and Benchmark Dataset for Isometric Pose Evaluation
by: Jaiswal, Abhishek, et al.
Published: (2025) -
StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
by: Sun, Zhiyao, et al.
Published: (2025)