:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kerkouri, Mohamed Amine, Tliba, Marouane, Chetouani, Aladine, Bruno, Alessandro
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Human-Computer Interaction
Online Access:	https://arxiv.org/abs/2602.22049
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

What They Saw, Not Just Where They Looked: Semantic Scanpath Similarity via VLMs and NLP metric
by: Kerkouri, Mohamed Amine, et al.
Published: (2026)

Quantization Effects on Neural Networks Perception: How would quantization change the perceptual field of vision models?
by: Kerkouri, Mohamed Amine, et al.
Published: (2024)

Modeling Beyond MOS: Quality Assessment Models Must Integrate Context, Reasoning, and Multimodality
by: Kerkouri, Mohamed Amine, et al.
Published: (2025)

Shifting Focus: From Global Semantics to Local Prominent Features in Swin-Transformer for Knee Osteoarthritis Severity Assessment
by: Sekhri, Aymen, et al.
Published: (2024)

Morphology-Aware KOA Classification: Integrating Graph Priors with Vision Models
by: Tliba, Marouane, et al.
Published: (2025)

Shifts in Doctors' Eye Movements Between Real and AI-Generated Medical Images
by: Wong, David C, et al.
Published: (2025)

UF-AMA: A unified framework for cross-domain emotion recognition via adaptive multimodal alignment
by: Wang, Zheng, et al.
Published: (2026)

Multi-face emotion detection for effective Human-Robot Interaction
by: Yahyaoui, Mohamed Ala, et al.
Published: (2025)

Several questions of visual generation in 2024
by: Gu, Shuyang
Published: (2024)

Real-Time Hand Gesture Recognition: Integrating Skeleton-Based Data Fusion and Multi-Stream CNN
by: Yusuf, Oluwaleke, et al.
Published: (2024)

CG-MER: A Card Game-based Multimodal dataset for Emotion Recognition
by: Farhat, Nessrine, et al.
Published: (2025)

Category-aware EEG image generation based on wavelet transform and contrast semantic loss
by: Zhang, Enshang, et al.
Published: (2025)

CT-DegradBench: A Physics-Informed Benchmark for CT Degradation Detection and Severity Estimation
by: Taifour, Yousra Nabila, et al.
Published: (2026)

Cross-user activity recognition using deep domain adaptation with temporal relation information
by: Ye, Xiaozhou, et al.
Published: (2024)

GroundUp: Rapid Sketch-Based 3D City Massing
by: Unlu, Gizem Esra, et al.
Published: (2024)

Exploring Thermography Technology: A Comprehensive Facial Dataset for Face Detection, Recognition, and Emotion
by: Abuhussein, Mohamed Fawzi Abdelshafie, et al.
Published: (2024)

Accurate online action and gesture recognition system using detectors and Deep SPD Siamese Networks
by: Akremi, Mohamed Sanim, et al.
Published: (2025)

Computer Vision for Objects used in Group Work: Challenges and Opportunities
by: Jung, Changsoo, et al.
Published: (2025)

Weak-Annotation of HAR Datasets using Vision Foundation Models
by: Bock, Marius, et al.
Published: (2024)

Generalized Pose Space Embeddings for Training In-the-Wild using Anaylis-by-Synthesis
by: Borer, Dominik, et al.
Published: (2024)

CADDI: An in-Class Activity Detection Dataset using IMU data from low-cost sensors
by: Marquez-Carpintero, Luis, et al.
Published: (2025)

Unsupervised learning of Data-driven Facial Expression Coding System (DFECS) using keypoint tracking
by: Tripathi, Shivansh Chandra, et al.
Published: (2024)

GazeGPT: Augmenting Human Capabilities using Gaze-contingent Contextual AI for Smart Eyewear
by: Konrad, Robert, et al.
Published: (2024)

Is Medieval Distant Viewing Possible? : Extending and Enriching Annotation of Legacy Image Collections using Visual Analytics
by: Meinecke, Christofer, et al.
Published: (2022)

Implicit Search Intent Recognition using EEG and Eye Tracking: Novel Dataset and Cross-User Prediction
by: Sharma, Mansi, et al.
Published: (2025)

Deep Learning in Mild Cognitive Impairment Diagnosis using Eye Movements and Image Content in Visual Memory Tasks
by: Rocha, Tomás Silva Santos, et al.
Published: (2025)

Resource-Efficient Gesture Recognition using Low-Resolution Thermal Camera via Spiking Neural Networks and Sparse Segmentation
by: Safa, Ali, et al.
Published: (2024)

ASAP: Interpretable Analysis and Summarization of AI-generated Image Patterns at Scale
by: Huang, Jinbin, et al.
Published: (2024)

How good are humans at detecting AI-generated images? Learnings from an experiment
by: Roca, Thomas, et al.
Published: (2025)

Combining Transformers and CNNs for Efficient Object Detection in High-Resolution Satellite Imagery
by: Drapier, Nicolas, et al.
Published: (2025)

The Visual Experience Dataset: Over 200 Recorded Hours of Integrated Eye Movement, Odometry, and Egocentric Video
by: Greene, Michelle R., et al.
Published: (2024)

CoCoG-2: Controllable generation of visual stimuli for understanding human concept representation
by: Wei, Chen, et al.
Published: (2024)

Accurate Eye Tracking from Dense 3D Surface Reconstructions using Single-Shot Deflectometry
by: Wang, Jiazhang, et al.
Published: (2023)

Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual Coach
by: Palmero, Cristina, et al.
Published: (2023)

SCHEMA for Gemini 3 Pro Image: A Structured Methodology for Controlled AI Image Generation on Google's Native Multimodal Model
by: Cazzaniga, Luca
Published: (2026)

Viewpoint Recommendation for Point Cloud Labeling through Interaction Cost Modeling
by: Zhang, Yu, et al.
Published: (2026)

SymbolSight: Minimizing Inter-Symbol Interference for Reading with Prosthetic Vision
by: Lesner, Jasmine, et al.
Published: (2026)

Towards an End-to-End System for 3D Tracking of Physical Objects in Virtual Immersive Environments
by: Knapiński, Stanisław, et al.
Published: (2026)

MicroBi-ConvLSTM: An Ultra-Lightweight Efficient Model for Human Activity Recognition on Resource Constrained Devices
by: Mandal, Mridankan
Published: (2026)

Real-Time Cellist Postural Evaluation With On-Device Computer Vision
by: Wang, Paolo, et al.
Published: (2026)