Saved in:
| Main Authors: | Tur, Anil Osman, Conti, Alessandro, Beyan, Cigdem, Boscaini, Davide, Larcher, Roberto, Messelodi, Stefano, Poiesi, Fabio, Ricci, Elisa |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.14963 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AI-driven visual monitoring of industrial assembly tasks
by: Nardon, Mattia, et al.
Published: (2025)
by: Nardon, Mattia, et al.
Published: (2025)
Towards Unconstrained Human-Object Interaction
by: Tonini, Francesco, et al.
Published: (2026)
by: Tonini, Francesco, et al.
Published: (2026)
Accurate and efficient zero-shot 6D pose estimation with frozen foundation models
by: Caraffa, Andrea, et al.
Published: (2025)
by: Caraffa, Andrea, et al.
Published: (2025)
Dynamic Scoring with Enhanced Semantics for Training-Free Human-Object Interaction Detection
by: Tonini, Francesco, et al.
Published: (2025)
by: Tonini, Francesco, et al.
Published: (2025)
Revisiting Fully Convolutional Geometric Features for Object 6D Pose Estimation
by: Jaime Corsetti Davide Boscaini Fabio Poiesi
Published: (2026)
by: Jaime Corsetti Davide Boscaini Fabio Poiesi
Published: (2026)
Leveraging Confident Image Regions for Source-Free Domain-Adaptive Object Detection
by: Mekhalfi, Mohamed Lamine, et al.
Published: (2025)
by: Mekhalfi, Mohamed Lamine, et al.
Published: (2025)
FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models
by: Caraffa, Andrea, et al.
Published: (2023)
by: Caraffa, Andrea, et al.
Published: (2023)
Distilling 3D distinctive local descriptors for 6D pose estimation
by: Hamza, Amir, et al.
Published: (2025)
by: Hamza, Amir, et al.
Published: (2025)
Open-vocabulary object 6D pose estimation
by: Corsetti, Jaime, et al.
Published: (2023)
by: Corsetti, Jaime, et al.
Published: (2023)
Functionality understanding and segmentation in 3D scenes
by: Corsetti, Jaime, et al.
Published: (2024)
by: Corsetti, Jaime, et al.
Published: (2024)
Generative 6D Pose Estimation via Conditional Flow Matching
by: Hamza, Amir, et al.
Published: (2026)
by: Hamza, Amir, et al.
Published: (2026)
MadCLIP: Few-shot Medical Anomaly Detection with CLIP
by: Shiri, Mahshid, et al.
Published: (2025)
by: Shiri, Mahshid, et al.
Published: (2025)
Discriminator-Guided Adaptive Diffusion for Source-Free Test-Time Adaptation under Image Corruptions
by: Olivato, Francesco, et al.
Published: (2026)
by: Olivato, Francesco, et al.
Published: (2026)
Specificity-aware reinforcement learning for fine-grained open-world classification
by: Angheben, Samuele, et al.
Published: (2026)
by: Angheben, Samuele, et al.
Published: (2026)
AL-GTD: Deep Active Learning for Gaze Target Detection
by: Tonini, Francesco, et al.
Published: (2024)
by: Tonini, Francesco, et al.
Published: (2024)
HAC: Parameter-Efficient Hyperbolic Adaptation of CLIP for Zero-Shot VQA
by: Dibitonto, Francesco, et al.
Published: (2026)
by: Dibitonto, Francesco, et al.
Published: (2026)
3D Part Segmentation via Geometric Aggregation of 2D Visual Features
by: Garosi, Marco, et al.
Published: (2024)
by: Garosi, Marco, et al.
Published: (2024)
High-resolution open-vocabulary object 6D pose estimation
by: Corsetti, Jaime, et al.
Published: (2024)
by: Corsetti, Jaime, et al.
Published: (2024)
CLIP-VAD: Exploiting Vision-Language Models for Voice Activity Detection
by: Appiani, Andrea, et al.
Published: (2024)
by: Appiani, Andrea, et al.
Published: (2024)
Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding
by: Mei, Guofeng, et al.
Published: (2023)
by: Mei, Guofeng, et al.
Published: (2023)
Novel class discovery meets foundation models for 3D semantic segmentation
by: Riz, Luigi, et al.
Published: (2023)
by: Riz, Luigi, et al.
Published: (2023)
Test-Time Zero-Shot Temporal Action Localization
by: Liberatori, Benedetta, et al.
Published: (2024)
by: Liberatori, Benedetta, et al.
Published: (2024)
MADPOT: Medical Anomaly Detection with CLIP Adaptation and Partial Optimal Transport
by: Shiri, Mahshid, et al.
Published: (2025)
by: Shiri, Mahshid, et al.
Published: (2025)
Anticipating Next Active Objects for Egocentric Videos
by: Thakur, Sanket, et al.
Published: (2023)
by: Thakur, Sanket, et al.
Published: (2023)
Geometry-Conditioned Diffusion for Occlusion-Robust In-Bed Pose Estimation
by: Khameneh, Navid Aslankhani, et al.
Published: (2026)
by: Khameneh, Navid Aslankhani, et al.
Published: (2026)
Action-guided generation of 3D functionality segmentation data
by: Corsetti, Jaime, et al.
Published: (2025)
by: Corsetti, Jaime, et al.
Published: (2025)
Vocabulary-free Image Classification
by: Conti, Alessandro, et al.
Published: (2023)
by: Conti, Alessandro, et al.
Published: (2023)
Zero-Shot Temporal Action Localization Through Textual Guidance
by: Liberatori, Benedetta, et al.
Published: (2026)
by: Liberatori, Benedetta, et al.
Published: (2026)
Vocabulary-free Image Classification and Semantic Segmentation
by: Conti, Alessandro, et al.
Published: (2024)
by: Conti, Alessandro, et al.
Published: (2024)
MAMBO: High-Resolution Generative Approach for Mammography Images
by: Škipina, Milica, et al.
Published: (2025)
by: Škipina, Milica, et al.
Published: (2025)
Upper-Body Pose-based Gaze Estimation for Privacy-Preserving 3D Gaze Target Detection
by: Toaiari, Andrea, et al.
Published: (2024)
by: Toaiari, Andrea, et al.
Published: (2024)
Lifelong Imitation Learning with Multimodal Latent Replay and Incremental Adjustment
by: Yu, Fanqi, et al.
Published: (2026)
by: Yu, Fanqi, et al.
Published: (2026)
Large Multimodal Models as General In-Context Classifiers
by: Garosi, Marco, et al.
Published: (2026)
by: Garosi, Marco, et al.
Published: (2026)
Compositional Caching for Training-free Open-vocabulary Attribute Detection
by: Garosi, Marco, et al.
Published: (2025)
by: Garosi, Marco, et al.
Published: (2025)
Fine-grained Preference Optimization Improves Zero-shot Text-to-Speech
by: Yao, Jixun, et al.
Published: (2025)
by: Yao, Jixun, et al.
Published: (2025)
Fine-grained Abnormality Prompt Learning for Zero-shot Anomaly Detection
by: Zhu, Jiawen, et al.
Published: (2024)
by: Zhu, Jiawen, et al.
Published: (2024)
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
by: Wang, Zhecan, et al.
Published: (2023)
by: Wang, Zhecan, et al.
Published: (2023)
Democratizing Fine-grained Visual Recognition with Large Language Models
by: Liu, Mingxuan, et al.
Published: (2024)
by: Liu, Mingxuan, et al.
Published: (2024)
AnimeAdapter: Fine-grained and Consistent Zero-shot Anime Character Generation
by: Han, Yixuan
Published: (2026)
by: Han, Yixuan
Published: (2026)
Letter to the editor: Comparing the coagulation and platelet parameters of women with premature ovarian insufficiency with those of age‐matched controls: A case–control study
by: Cengiz Beyan
Published: (2025)
by: Cengiz Beyan
Published: (2025)
Similar Items
-
AI-driven visual monitoring of industrial assembly tasks
by: Nardon, Mattia, et al.
Published: (2025) -
Towards Unconstrained Human-Object Interaction
by: Tonini, Francesco, et al.
Published: (2026) -
Accurate and efficient zero-shot 6D pose estimation with frozen foundation models
by: Caraffa, Andrea, et al.
Published: (2025) -
Dynamic Scoring with Enhanced Semantics for Training-Free Human-Object Interaction Detection
by: Tonini, Francesco, et al.
Published: (2025) -
Revisiting Fully Convolutional Geometric Features for Object 6D Pose Estimation
by: Jaime Corsetti Davide Boscaini Fabio Poiesi
Published: (2026)