Saved in:
| Main Authors: | Simpsi, Andrea, Aspesi, Andrea, Mentasti, Simone, Merigo, Luca, Ongarello, Tommaso, Matteucci, Matteo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.03057 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EETnet: a CNN for Gaze Detection and Tracking for Smart-Eyewear
by: Aspesi, Andrea, et al.
Published: (2025)
by: Aspesi, Andrea, et al.
Published: (2025)
SGR-OCC: Evolving Monocular Priors for Embodied 3D Occupancy Prediction via Soft-Gating Lifting and Semantic-Adaptive Geometric Refinement
by: Guo, Yiran, et al.
Published: (2026)
by: Guo, Yiran, et al.
Published: (2026)
A Spatio-temporal Graph Network Allowing Incomplete Trajectory Input for Pedestrian Trajectory Prediction
by: Long, Juncen, et al.
Published: (2025)
by: Long, Juncen, et al.
Published: (2025)
From Pixels to Graphs: using Scene and Knowledge Graphs for HD-EPIC VQA Challenge
by: Taluzzi, Agnese, et al.
Published: (2025)
by: Taluzzi, Agnese, et al.
Published: (2025)
Polarization-resolved imaging improves eye tracking
by: Žurauskas, Mantas, et al.
Published: (2025)
by: Žurauskas, Mantas, et al.
Published: (2025)
Eyepiece-free pupil-optimized holographic near-eye displays
by: Zhou, Jie, et al.
Published: (2025)
by: Zhou, Jie, et al.
Published: (2025)
Swap It Like Its Hot: Segmentation-based spoof attacks on eye-tracking images
by: Narkar, Anish S., et al.
Published: (2024)
by: Narkar, Anish S., et al.
Published: (2024)
Act, Think or Abstain: Complexity-Aware Adaptive Inference for Vision-Language-Action Models
by: Izzo, Riccardo Andrea, et al.
Published: (2026)
by: Izzo, Riccardo Andrea, et al.
Published: (2026)
Rapidly deploying on-device eye tracking by distilling visual foundation models
by: Jiang, Cheng, et al.
Published: (2026)
by: Jiang, Cheng, et al.
Published: (2026)
A deep learning approach to track eye movements based on events
by: Seth, Chirag, et al.
Published: (2025)
by: Seth, Chirag, et al.
Published: (2025)
BVMatch: Lidar-based Place Recognition Using Bird's-eye View Images
by: Luo, Lun, et al.
Published: (2021)
by: Luo, Lun, et al.
Published: (2021)
Lift-Attend-Splat: Bird's-eye-view camera-lidar fusion using transformers
by: Gunn, James, et al.
Published: (2023)
by: Gunn, James, et al.
Published: (2023)
SCENEFORGE: Enhancing 3D-text alignment with Structured Scene Compositions
by: Sbrolli, Cristian, et al.
Published: (2025)
by: Sbrolli, Cristian, et al.
Published: (2025)
Few Shot Semantic Segmentation: a review of methodologies, benchmarks, and open challenges
by: Catalano, Nico, et al.
Published: (2023)
by: Catalano, Nico, et al.
Published: (2023)
No Captions, No Problem: Captionless 3D-CLIP Alignment with Hard Negatives via CLIP Knowledge and LLMs
by: Sbrolli, Cristian, et al.
Published: (2024)
by: Sbrolli, Cristian, et al.
Published: (2024)
EmMixformer: Mix transformer for eye movement recognition
by: Qin, Huafeng, et al.
Published: (2024)
by: Qin, Huafeng, et al.
Published: (2024)
Dermatologist-like explainable AI enhances melanoma diagnosis accuracy: eye-tracking study
by: Chanda, Tirtha, et al.
Published: (2024)
by: Chanda, Tirtha, et al.
Published: (2024)
The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry
by: Cudrano, Paolo, et al.
Published: (2024)
by: Cudrano, Paolo, et al.
Published: (2024)
One target to align them all: LiDAR, RGB and event cameras extrinsic calibration for Autonomous Driving
by: Bertogalli, Andrea, et al.
Published: (2025)
by: Bertogalli, Andrea, et al.
Published: (2025)
DOC-Depth: A novel approach for dense depth ground truth generation
by: de Moreau, Simon, et al.
Published: (2025)
by: de Moreau, Simon, et al.
Published: (2025)
Federated Knowledge Recycling: Privacy-Preserving Synthetic Data Sharing
by: Lomurno, Eugenio, et al.
Published: (2024)
by: Lomurno, Eugenio, et al.
Published: (2024)
Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models
by: Go, Sooyeon, et al.
Published: (2024)
by: Go, Sooyeon, et al.
Published: (2024)
VitalVideos-Europe: A dataset of face videos with PPG and blood pressure ground truths
by: Toye, Pieter-Jan
Published: (2023)
by: Toye, Pieter-Jan
Published: (2023)
MARS: a Multimodal Alignment and Ranking System for Few-Shot Segmentation
by: Catalano, Nico, et al.
Published: (2025)
by: Catalano, Nico, et al.
Published: (2025)
Detecting and refurbishing ground truth errors during training of deep learning-based echocardiography segmentation models
by: Islam, Iman, et al.
Published: (2026)
by: Islam, Iman, et al.
Published: (2026)
Can Shape-Infused Joint Embeddings Improve Image-Conditioned 3D Diffusion?
by: Sbrolli, Cristian, et al.
Published: (2024)
by: Sbrolli, Cristian, et al.
Published: (2024)
Auto-Comp: An Automated Pipeline for Scalable Compositional Probing of Contrastive Vision-Language Models
by: Sbrolli, Cristian, et al.
Published: (2026)
by: Sbrolli, Cristian, et al.
Published: (2026)
GazeSCRNN: Event-based Near-eye Gaze Tracking using a Spiking Neural Network
by: Groenen, Stijn, et al.
Published: (2025)
by: Groenen, Stijn, et al.
Published: (2025)
Decentralized LoRA augmented transformer with multi-scale feature learning for secured eye diagnosis
by: Borno, Md. Naimur Asif, et al.
Published: (2025)
by: Borno, Md. Naimur Asif, et al.
Published: (2025)
Multi-modal Collaborative Optimization and Expansion Network for Event-assisted Single-eye Expression Recognition
by: Han, Runduo, et al.
Published: (2025)
by: Han, Runduo, et al.
Published: (2025)
Towards mitigating uncann(eye)ness in face swaps via gaze-centric loss terms
by: Wilson, Ethan, et al.
Published: (2024)
by: Wilson, Ethan, et al.
Published: (2024)
Confidence-aware multi-modality learning for eye disease screening
by: Zou, Ke, et al.
Published: (2024)
by: Zou, Ke, et al.
Published: (2024)
Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling
by: Holste, Gregory, et al.
Published: (2024)
by: Holste, Gregory, et al.
Published: (2024)
HiERO-StepG @ Ego4D Step Grounding Challenge: hierarchical activity understanding enables zero-shot step grounding
by: Zenotto, Andrea, et al.
Published: (2026)
by: Zenotto, Andrea, et al.
Published: (2026)
LiDAR-Event Stereo Fusion with Hallucinations
by: Bartolomei, Luca, et al.
Published: (2024)
by: Bartolomei, Luca, et al.
Published: (2024)
No Pose, No Problem in 4D: Feed-Forward Dynamic Gaussians from Unposed Multi-View Videos
by: Balice, Matteo, et al.
Published: (2026)
by: Balice, Matteo, et al.
Published: (2026)
Dual input stream transformer for vertical drift correction in eye-tracking reading data
by: Mercier, Thomas M., et al.
Published: (2023)
by: Mercier, Thomas M., et al.
Published: (2023)
Stable Diffusion Dataset Generation for Downstream Classification Tasks
by: Lomurno, Eugenio, et al.
Published: (2024)
by: Lomurno, Eugenio, et al.
Published: (2024)
Bird's-eye view safety monitoring for the construction top under the tower crane
by: Wang, Yanke, et al.
Published: (2025)
by: Wang, Yanke, et al.
Published: (2025)
Improving Robustness of Vision-Language-Action Models by Restoring Corrupted Visual Inputs
by: Orjuela, Daniel Yezid Guarnizo, et al.
Published: (2026)
by: Orjuela, Daniel Yezid Guarnizo, et al.
Published: (2026)
Similar Items
-
EETnet: a CNN for Gaze Detection and Tracking for Smart-Eyewear
by: Aspesi, Andrea, et al.
Published: (2025) -
SGR-OCC: Evolving Monocular Priors for Embodied 3D Occupancy Prediction via Soft-Gating Lifting and Semantic-Adaptive Geometric Refinement
by: Guo, Yiran, et al.
Published: (2026) -
A Spatio-temporal Graph Network Allowing Incomplete Trajectory Input for Pedestrian Trajectory Prediction
by: Long, Juncen, et al.
Published: (2025) -
From Pixels to Graphs: using Scene and Knowledge Graphs for HD-EPIC VQA Challenge
by: Taluzzi, Agnese, et al.
Published: (2025) -
Polarization-resolved imaging improves eye tracking
by: Žurauskas, Mantas, et al.
Published: (2025)