Saved in:
| Main Authors: | Kerkouri, Mohamed Amine, Tliba, Marouane, Chetouani, Aladine, Bruno, Alessandro |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.22049 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
What They Saw, Not Just Where They Looked: Semantic Scanpath Similarity via VLMs and NLP metric
by: Kerkouri, Mohamed Amine, et al.
Published: (2026)
by: Kerkouri, Mohamed Amine, et al.
Published: (2026)
Quantization Effects on Neural Networks Perception: How would quantization change the perceptual field of vision models?
by: Kerkouri, Mohamed Amine, et al.
Published: (2024)
by: Kerkouri, Mohamed Amine, et al.
Published: (2024)
Modeling Beyond MOS: Quality Assessment Models Must Integrate Context, Reasoning, and Multimodality
by: Kerkouri, Mohamed Amine, et al.
Published: (2025)
by: Kerkouri, Mohamed Amine, et al.
Published: (2025)
Shifting Focus: From Global Semantics to Local Prominent Features in Swin-Transformer for Knee Osteoarthritis Severity Assessment
by: Sekhri, Aymen, et al.
Published: (2024)
by: Sekhri, Aymen, et al.
Published: (2024)
Morphology-Aware KOA Classification: Integrating Graph Priors with Vision Models
by: Tliba, Marouane, et al.
Published: (2025)
by: Tliba, Marouane, et al.
Published: (2025)
Shifts in Doctors' Eye Movements Between Real and AI-Generated Medical Images
by: Wong, David C, et al.
Published: (2025)
by: Wong, David C, et al.
Published: (2025)
UF-AMA: A unified framework for cross-domain emotion recognition via adaptive multimodal alignment
by: Wang, Zheng, et al.
Published: (2026)
by: Wang, Zheng, et al.
Published: (2026)
Multi-face emotion detection for effective Human-Robot Interaction
by: Yahyaoui, Mohamed Ala, et al.
Published: (2025)
by: Yahyaoui, Mohamed Ala, et al.
Published: (2025)
Several questions of visual generation in 2024
by: Gu, Shuyang
Published: (2024)
by: Gu, Shuyang
Published: (2024)
Real-Time Hand Gesture Recognition: Integrating Skeleton-Based Data Fusion and Multi-Stream CNN
by: Yusuf, Oluwaleke, et al.
Published: (2024)
by: Yusuf, Oluwaleke, et al.
Published: (2024)
CG-MER: A Card Game-based Multimodal dataset for Emotion Recognition
by: Farhat, Nessrine, et al.
Published: (2025)
by: Farhat, Nessrine, et al.
Published: (2025)
Category-aware EEG image generation based on wavelet transform and contrast semantic loss
by: Zhang, Enshang, et al.
Published: (2025)
by: Zhang, Enshang, et al.
Published: (2025)
CT-DegradBench: A Physics-Informed Benchmark for CT Degradation Detection and Severity Estimation
by: Taifour, Yousra Nabila, et al.
Published: (2026)
by: Taifour, Yousra Nabila, et al.
Published: (2026)
Cross-user activity recognition using deep domain adaptation with temporal relation information
by: Ye, Xiaozhou, et al.
Published: (2024)
by: Ye, Xiaozhou, et al.
Published: (2024)
GroundUp: Rapid Sketch-Based 3D City Massing
by: Unlu, Gizem Esra, et al.
Published: (2024)
by: Unlu, Gizem Esra, et al.
Published: (2024)
Exploring Thermography Technology: A Comprehensive Facial Dataset for Face Detection, Recognition, and Emotion
by: Abuhussein, Mohamed Fawzi Abdelshafie, et al.
Published: (2024)
by: Abuhussein, Mohamed Fawzi Abdelshafie, et al.
Published: (2024)
Accurate online action and gesture recognition system using detectors and Deep SPD Siamese Networks
by: Akremi, Mohamed Sanim, et al.
Published: (2025)
by: Akremi, Mohamed Sanim, et al.
Published: (2025)
Computer Vision for Objects used in Group Work: Challenges and Opportunities
by: Jung, Changsoo, et al.
Published: (2025)
by: Jung, Changsoo, et al.
Published: (2025)
Weak-Annotation of HAR Datasets using Vision Foundation Models
by: Bock, Marius, et al.
Published: (2024)
by: Bock, Marius, et al.
Published: (2024)
Generalized Pose Space Embeddings for Training In-the-Wild using Anaylis-by-Synthesis
by: Borer, Dominik, et al.
Published: (2024)
by: Borer, Dominik, et al.
Published: (2024)
CADDI: An in-Class Activity Detection Dataset using IMU data from low-cost sensors
by: Marquez-Carpintero, Luis, et al.
Published: (2025)
by: Marquez-Carpintero, Luis, et al.
Published: (2025)
Unsupervised learning of Data-driven Facial Expression Coding System (DFECS) using keypoint tracking
by: Tripathi, Shivansh Chandra, et al.
Published: (2024)
by: Tripathi, Shivansh Chandra, et al.
Published: (2024)
GazeGPT: Augmenting Human Capabilities using Gaze-contingent Contextual AI for Smart Eyewear
by: Konrad, Robert, et al.
Published: (2024)
by: Konrad, Robert, et al.
Published: (2024)
Is Medieval Distant Viewing Possible? : Extending and Enriching Annotation of Legacy Image Collections using Visual Analytics
by: Meinecke, Christofer, et al.
Published: (2022)
by: Meinecke, Christofer, et al.
Published: (2022)
Implicit Search Intent Recognition using EEG and Eye Tracking: Novel Dataset and Cross-User Prediction
by: Sharma, Mansi, et al.
Published: (2025)
by: Sharma, Mansi, et al.
Published: (2025)
Deep Learning in Mild Cognitive Impairment Diagnosis using Eye Movements and Image Content in Visual Memory Tasks
by: Rocha, Tomás Silva Santos, et al.
Published: (2025)
by: Rocha, Tomás Silva Santos, et al.
Published: (2025)
Resource-Efficient Gesture Recognition using Low-Resolution Thermal Camera via Spiking Neural Networks and Sparse Segmentation
by: Safa, Ali, et al.
Published: (2024)
by: Safa, Ali, et al.
Published: (2024)
ASAP: Interpretable Analysis and Summarization of AI-generated Image Patterns at Scale
by: Huang, Jinbin, et al.
Published: (2024)
by: Huang, Jinbin, et al.
Published: (2024)
How good are humans at detecting AI-generated images? Learnings from an experiment
by: Roca, Thomas, et al.
Published: (2025)
by: Roca, Thomas, et al.
Published: (2025)
Combining Transformers and CNNs for Efficient Object Detection in High-Resolution Satellite Imagery
by: Drapier, Nicolas, et al.
Published: (2025)
by: Drapier, Nicolas, et al.
Published: (2025)
The Visual Experience Dataset: Over 200 Recorded Hours of Integrated Eye Movement, Odometry, and Egocentric Video
by: Greene, Michelle R., et al.
Published: (2024)
by: Greene, Michelle R., et al.
Published: (2024)
CoCoG-2: Controllable generation of visual stimuli for understanding human concept representation
by: Wei, Chen, et al.
Published: (2024)
by: Wei, Chen, et al.
Published: (2024)
Accurate Eye Tracking from Dense 3D Surface Reconstructions using Single-Shot Deflectometry
by: Wang, Jiazhang, et al.
Published: (2023)
by: Wang, Jiazhang, et al.
Published: (2023)
Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual Coach
by: Palmero, Cristina, et al.
Published: (2023)
by: Palmero, Cristina, et al.
Published: (2023)
SCHEMA for Gemini 3 Pro Image: A Structured Methodology for Controlled AI Image Generation on Google's Native Multimodal Model
by: Cazzaniga, Luca
Published: (2026)
by: Cazzaniga, Luca
Published: (2026)
Viewpoint Recommendation for Point Cloud Labeling through Interaction Cost Modeling
by: Zhang, Yu, et al.
Published: (2026)
by: Zhang, Yu, et al.
Published: (2026)
SymbolSight: Minimizing Inter-Symbol Interference for Reading with Prosthetic Vision
by: Lesner, Jasmine, et al.
Published: (2026)
by: Lesner, Jasmine, et al.
Published: (2026)
Towards an End-to-End System for 3D Tracking of Physical Objects in Virtual Immersive Environments
by: Knapiński, Stanisław, et al.
Published: (2026)
by: Knapiński, Stanisław, et al.
Published: (2026)
MicroBi-ConvLSTM: An Ultra-Lightweight Efficient Model for Human Activity Recognition on Resource Constrained Devices
by: Mandal, Mridankan
Published: (2026)
by: Mandal, Mridankan
Published: (2026)
Real-Time Cellist Postural Evaluation With On-Device Computer Vision
by: Wang, Paolo, et al.
Published: (2026)
by: Wang, Paolo, et al.
Published: (2026)
Similar Items
-
What They Saw, Not Just Where They Looked: Semantic Scanpath Similarity via VLMs and NLP metric
by: Kerkouri, Mohamed Amine, et al.
Published: (2026) -
Quantization Effects on Neural Networks Perception: How would quantization change the perceptual field of vision models?
by: Kerkouri, Mohamed Amine, et al.
Published: (2024) -
Modeling Beyond MOS: Quality Assessment Models Must Integrate Context, Reasoning, and Multimodality
by: Kerkouri, Mohamed Amine, et al.
Published: (2025) -
Shifting Focus: From Global Semantics to Local Prominent Features in Swin-Transformer for Knee Osteoarthritis Severity Assessment
by: Sekhri, Aymen, et al.
Published: (2024) -
Morphology-Aware KOA Classification: Integrating Graph Priors with Vision Models
by: Tliba, Marouane, et al.
Published: (2025)