Saved in:
| Main Authors: | Santos-Villafranca, Maria, Bermudez-cameo, Jesus, Perez-Yus, Alejandro, Farinella, Giovanni Maria, Furnari, Antonino |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2606.02246 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Calisthenics Skills Temporal Video Segmentation
by: Finocchiaro, Antonio, et al.
Published: (2025)
by: Finocchiaro, Antonio, et al.
Published: (2025)
Task Graph Maximum Likelihood Estimation for Procedural Activity Understanding in Egocentric Videos
by: Seminara, Luigi, et al.
Published: (2025)
by: Seminara, Luigi, et al.
Published: (2025)
Multimodal Knowledge Distillation for Egocentric Action Recognition Robust to Missing Modalities
by: Santos-Villafranca, Maria, et al.
Published: (2025)
by: Santos-Villafranca, Maria, et al.
Published: (2025)
Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs
by: Quattrocchi, Camillo, et al.
Published: (2023)
by: Quattrocchi, Camillo, et al.
Published: (2023)
Exploiting Multimodal Synthetic Data for Egocentric Human-Object Interaction Detection in an Industrial Scenario
by: Leonardi, Rosario, et al.
Published: (2023)
by: Leonardi, Rosario, et al.
Published: (2023)
Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric Videos
by: Seminara, Luigi, et al.
Published: (2024)
by: Seminara, Luigi, et al.
Published: (2024)
Mamba-OTR: a Mamba-based Solution for Online Take and Release Detection from Untrimmed Egocentric Video
by: Catinello, Alessandro Sebastiano, et al.
Published: (2025)
by: Catinello, Alessandro Sebastiano, et al.
Published: (2025)
Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection?
by: Leonardi, Rosario, et al.
Published: (2023)
by: Leonardi, Rosario, et al.
Published: (2023)
Leveraging Synthetic Data for Enhancing Egocentric Hand-Object Interaction Detection
by: Leonardi, Rosario, et al.
Published: (2026)
by: Leonardi, Rosario, et al.
Published: (2026)
Gazing Into Missteps: Leveraging Eye-Gaze for Unsupervised Mistake Detection in Egocentric Videos of Skilled Human Activities
by: Mazzamuto, Michele, et al.
Published: (2024)
by: Mazzamuto, Michele, et al.
Published: (2024)
Ego-EXTRA: video-language Egocentric Dataset for EXpert-TRAinee assistance
by: Ragusa, Francesco, et al.
Published: (2025)
by: Ragusa, Francesco, et al.
Published: (2025)
Semantically Guided Action Anticipation
by: Diko, Anxhelo, et al.
Published: (2024)
by: Diko, Anxhelo, et al.
Published: (2024)
Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation
by: Finocchiaro, Antonio, et al.
Published: (2025)
by: Finocchiaro, Antonio, et al.
Published: (2025)
StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipation
by: Ragusa, Francesco, et al.
Published: (2023)
by: Ragusa, Francesco, et al.
Published: (2023)
How Far Can Off-the-Shelf Multimodal Large Language Models Go in Online Episodic Memory Question Answering?
by: Lando, Giuseppe, et al.
Published: (2025)
by: Lando, Giuseppe, et al.
Published: (2025)
O-MaMa: Learning Object Mask Matching between Egocentric and Exocentric Views
by: Mur-Labadia, Lorenzo, et al.
Published: (2025)
by: Mur-Labadia, Lorenzo, et al.
Published: (2025)
Learning Egocentric In-Hand Object Segmentation through Weak Supervision from Human Narrations
by: Messina, Nicola, et al.
Published: (2025)
by: Messina, Nicola, et al.
Published: (2025)
EASG-Bench: Video Q&A Benchmark with Egocentric Action Scene Graphs
by: Rodin, Ivan, et al.
Published: (2025)
by: Rodin, Ivan, et al.
Published: (2025)
A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains
by: Finocchiaro, Antonio, et al.
Published: (2025)
by: Finocchiaro, Antonio, et al.
Published: (2025)
An Outlook into the Future of Egocentric Vision
by: Plizzari, Chiara, et al.
Published: (2023)
by: Plizzari, Chiara, et al.
Published: (2023)
ProSkill: Segment-Level Skill Assessment in Procedural Videos
by: Mazzamuto, Michele, et al.
Published: (2026)
by: Mazzamuto, Michele, et al.
Published: (2026)
Convolution kernel adaptation to calibrated fisheye
by: Berenguel-Baeta, Bruno, et al.
Published: (2024)
by: Berenguel-Baeta, Bruno, et al.
Published: (2024)
GlovEgo-HOI: Bridging the Synthetic-to-Real Gap for Industrial Egocentric Human-Object Interaction Detection
by: Spoto, Alfio, et al.
Published: (2026)
by: Spoto, Alfio, et al.
Published: (2026)
Online Episodic Memory Visual Query Localization with Egocentric Streaming Object Memory
by: Manigrasso, Zaira, et al.
Published: (2024)
by: Manigrasso, Zaira, et al.
Published: (2024)
ENIGMA-360: An Ego-Exo Dataset for Human Behavior Understanding in Industrial Scenarios
by: Ragusa, Francesco, et al.
Published: (2026)
by: Ragusa, Francesco, et al.
Published: (2026)
AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation
by: Mur-Labadia, Lorenzo, et al.
Published: (2024)
by: Mur-Labadia, Lorenzo, et al.
Published: (2024)
EgoInteract: Synthetic Egocentric Videos Generation for Interaction Understanding and Anticipation
by: Leonardi, Rosario, et al.
Published: (2026)
by: Leonardi, Rosario, et al.
Published: (2026)
PREGO: online mistake detection in PRocedural EGOcentric videos
by: Flaborea, Alessandro, et al.
Published: (2024)
by: Flaborea, Alessandro, et al.
Published: (2024)
EGOSTREAM: A Diagnostic Benchmark for Streaming Episodic Memory in Egocentric Vision
by: Forte, Rosario, et al.
Published: (2026)
by: Forte, Rosario, et al.
Published: (2026)
Leveraging Gaze and Set-of-Mark in VLLMs for Human-Object Interaction Anticipation from Egocentric Videos
by: Materia, Daniele, et al.
Published: (2026)
by: Materia, Daniele, et al.
Published: (2026)
Integrating Affordances and Attention models for Short-Term Object Interaction Anticipation
by: Labadia, Lorenzo Mur, et al.
Published: (2026)
by: Labadia, Lorenzo Mur, et al.
Published: (2026)
Exploring Multimodal LMMs for Online Episodic Memory Question Answering on the Edge
by: Lando, Giuseppe, et al.
Published: (2026)
by: Lando, Giuseppe, et al.
Published: (2026)
SignIT: A Comprehensive Dataset and Multimodal Analysis for Italian Sign Language Recognition
by: Micieli, Alessia, et al.
Published: (2025)
by: Micieli, Alessia, et al.
Published: (2025)
ZARRIO @ Ego4D Short Term Object Interaction Anticipation Challenge: Leveraging Affordances and Attention-based models for STA
by: Mur-Labadia, Lorenzo, et al.
Published: (2024)
by: Mur-Labadia, Lorenzo, et al.
Published: (2024)
EgoPrompt: Prompt Learning for Egocentric Action Recognition
by: Lyu, Huaihai, et al.
Published: (2025)
by: Lyu, Huaihai, et al.
Published: (2025)
TI-PREGO: Chain of Thought and In-Context Learning for Online Mistake Detection in PRocedural EGOcentric Videos
by: Plini, Leonardo, et al.
Published: (2024)
by: Plini, Leonardo, et al.
Published: (2024)
EgoAction: Egocentric Action Composition with Reliability-Aware Temporal Fusion for the EPIC-KITCHENS Action Detection Challenge at CVPR 2026
by: Fu, Zhiheng, et al.
Published: (2026)
by: Fu, Zhiheng, et al.
Published: (2026)
Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
by: Wu, Tz-Ying, et al.
Published: (2024)
by: Wu, Tz-Ying, et al.
Published: (2024)
EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT
by: Pei, Baoqi, et al.
Published: (2025)
by: Pei, Baoqi, et al.
Published: (2025)
EgoGraph: Temporal Knowledge Graph for Egocentric Video Understanding
by: Sun, Shitong, et al.
Published: (2026)
by: Sun, Shitong, et al.
Published: (2026)
Similar Items
-
Calisthenics Skills Temporal Video Segmentation
by: Finocchiaro, Antonio, et al.
Published: (2025) -
Task Graph Maximum Likelihood Estimation for Procedural Activity Understanding in Egocentric Videos
by: Seminara, Luigi, et al.
Published: (2025) -
Multimodal Knowledge Distillation for Egocentric Action Recognition Robust to Missing Modalities
by: Santos-Villafranca, Maria, et al.
Published: (2025) -
Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs
by: Quattrocchi, Camillo, et al.
Published: (2023) -
Exploiting Multimodal Synthetic Data for Egocentric Human-Object Interaction Detection in an Industrial Scenario
by: Leonardi, Rosario, et al.
Published: (2023)