:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Simpsi, Andrea, Aspesi, Andrea, Mentasti, Simone, Merigo, Luca, Ongarello, Tommaso, Matteucci, Matteo
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2502.03057
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

EETnet: a CNN for Gaze Detection and Tracking for Smart-Eyewear
by: Aspesi, Andrea, et al.
Published: (2025)

SGR-OCC: Evolving Monocular Priors for Embodied 3D Occupancy Prediction via Soft-Gating Lifting and Semantic-Adaptive Geometric Refinement
by: Guo, Yiran, et al.
Published: (2026)

A Spatio-temporal Graph Network Allowing Incomplete Trajectory Input for Pedestrian Trajectory Prediction
by: Long, Juncen, et al.
Published: (2025)

From Pixels to Graphs: using Scene and Knowledge Graphs for HD-EPIC VQA Challenge
by: Taluzzi, Agnese, et al.
Published: (2025)

Polarization-resolved imaging improves eye tracking
by: Žurauskas, Mantas, et al.
Published: (2025)

Eyepiece-free pupil-optimized holographic near-eye displays
by: Zhou, Jie, et al.
Published: (2025)

Swap It Like Its Hot: Segmentation-based spoof attacks on eye-tracking images
by: Narkar, Anish S., et al.
Published: (2024)

Act, Think or Abstain: Complexity-Aware Adaptive Inference for Vision-Language-Action Models
by: Izzo, Riccardo Andrea, et al.
Published: (2026)

Rapidly deploying on-device eye tracking by distilling visual foundation models
by: Jiang, Cheng, et al.
Published: (2026)

A deep learning approach to track eye movements based on events
by: Seth, Chirag, et al.
Published: (2025)

BVMatch: Lidar-based Place Recognition Using Bird's-eye View Images
by: Luo, Lun, et al.
Published: (2021)

Lift-Attend-Splat: Bird's-eye-view camera-lidar fusion using transformers
by: Gunn, James, et al.
Published: (2023)

SCENEFORGE: Enhancing 3D-text alignment with Structured Scene Compositions
by: Sbrolli, Cristian, et al.
Published: (2025)

Few Shot Semantic Segmentation: a review of methodologies, benchmarks, and open challenges
by: Catalano, Nico, et al.
Published: (2023)

No Captions, No Problem: Captionless 3D-CLIP Alignment with Hard Negatives via CLIP Knowledge and LLMs
by: Sbrolli, Cristian, et al.
Published: (2024)

EmMixformer: Mix transformer for eye movement recognition
by: Qin, Huafeng, et al.
Published: (2024)

Dermatologist-like explainable AI enhances melanoma diagnosis accuracy: eye-tracking study
by: Chanda, Tirtha, et al.
Published: (2024)

The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry
by: Cudrano, Paolo, et al.
Published: (2024)

One target to align them all: LiDAR, RGB and event cameras extrinsic calibration for Autonomous Driving
by: Bertogalli, Andrea, et al.
Published: (2025)

DOC-Depth: A novel approach for dense depth ground truth generation
by: de Moreau, Simon, et al.
Published: (2025)

Federated Knowledge Recycling: Privacy-Preserving Synthetic Data Sharing
by: Lomurno, Eugenio, et al.
Published: (2024)

Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models
by: Go, Sooyeon, et al.
Published: (2024)

VitalVideos-Europe: A dataset of face videos with PPG and blood pressure ground truths
by: Toye, Pieter-Jan
Published: (2023)

MARS: a Multimodal Alignment and Ranking System for Few-Shot Segmentation
by: Catalano, Nico, et al.
Published: (2025)

Detecting and refurbishing ground truth errors during training of deep learning-based echocardiography segmentation models
by: Islam, Iman, et al.
Published: (2026)

Can Shape-Infused Joint Embeddings Improve Image-Conditioned 3D Diffusion?
by: Sbrolli, Cristian, et al.
Published: (2024)

Auto-Comp: An Automated Pipeline for Scalable Compositional Probing of Contrastive Vision-Language Models
by: Sbrolli, Cristian, et al.
Published: (2026)

GazeSCRNN: Event-based Near-eye Gaze Tracking using a Spiking Neural Network
by: Groenen, Stijn, et al.
Published: (2025)

Decentralized LoRA augmented transformer with multi-scale feature learning for secured eye diagnosis
by: Borno, Md. Naimur Asif, et al.
Published: (2025)

Multi-modal Collaborative Optimization and Expansion Network for Event-assisted Single-eye Expression Recognition
by: Han, Runduo, et al.
Published: (2025)

Towards mitigating uncann(eye)ness in face swaps via gaze-centric loss terms
by: Wilson, Ethan, et al.
Published: (2024)

Confidence-aware multi-modality learning for eye disease screening
by: Zou, Ke, et al.
Published: (2024)

Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling
by: Holste, Gregory, et al.
Published: (2024)

HiERO-StepG @ Ego4D Step Grounding Challenge: hierarchical activity understanding enables zero-shot step grounding
by: Zenotto, Andrea, et al.
Published: (2026)

LiDAR-Event Stereo Fusion with Hallucinations
by: Bartolomei, Luca, et al.
Published: (2024)

No Pose, No Problem in 4D: Feed-Forward Dynamic Gaussians from Unposed Multi-View Videos
by: Balice, Matteo, et al.
Published: (2026)

Dual input stream transformer for vertical drift correction in eye-tracking reading data
by: Mercier, Thomas M., et al.
Published: (2023)

Stable Diffusion Dataset Generation for Downstream Classification Tasks
by: Lomurno, Eugenio, et al.
Published: (2024)

Bird's-eye view safety monitoring for the construction top under the tower crane
by: Wang, Yanke, et al.
Published: (2025)

Improving Robustness of Vision-Language-Action Models by Restoring Corrupted Visual Inputs
by: Orjuela, Daniel Yezid Guarnizo, et al.
Published: (2026)