:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Kriegler, Andreas, Beleznai, Csaba, Gelautz, Margrit
Formato:	Preprint
Publicado:	2026
Materias:	Computer Vision and Pattern Recognition Geometric Topology 68T45 (Primary), 52B15, 52-08 I.2.10; I.4.8; I.5
Acceso en línea:	https://arxiv.org/abs/2604.18208
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Polarization-Based Eye Tracking with Personalized Siamese Architectures
por: Kalkanli, Beyza, et al.
Publicado: (2026)

Key-Scan-Based Mobile Robot Navigation: Integrated Mapping, Planning, and Control using Graphs of Scan Regions
por: Latha, Dharshan Bashkaran, et al.
Publicado: (2024)

Smelly, dense, and spreaded: The Object Detection for Olfactory References (ODOR) dataset
por: Zinnen, Mathias, et al.
Publicado: (2025)

MSPCaps: A Multi-Scale Patchify Capsule Network with Cross-Agreement Routing for Visual Recognition
por: Hu, Yudong, et al.
Publicado: (2025)

Floorplan2Guide: LLM-Guided Floorplan Parsing for BLV Indoor Navigation
por: Ayanzadeh, Aydin, et al.
Publicado: (2025)

AntiGrounding: Lifting Robotic Actions into VLM Representation Space for Decision Making
por: Li, Wenbo, et al.
Publicado: (2025)

T-Rex: Task-Adaptive Spatial Representation Extraction for Robotic Manipulation with Vision-Language Models
por: Chen, Yiteng, et al.
Publicado: (2025)

Extraction Of Cumulative Blobs From Dynamic Gestures
por: Naulakha, Rishabh, et al.
Publicado: (2025)

Spectral Integrated Gradients for Coarse-to-Fine Feature Attribution
por: Kim, Soyeon, et al.
Publicado: (2026)

Manifold-Aligned Guided Integrated Gradients for Reliable Feature Attribution
por: Kim, Soyeon, et al.
Publicado: (2026)

On the convex hull of integer points above the hyperbola
por: Alcántara, David, et al.
Publicado: (2025)

VLM-NCD:Novel Class Discovery with Vision-Based Large Language Models
por: Su, Yuetong, et al.
Publicado: (2025)

SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding
por: Gautam, Sushant, et al.
Publicado: (2025)

AI-Powered Augmented Reality for Satellite Assembly, Integration and Test
por: Patricio, Alvaro, et al.
Publicado: (2024)

DSER: Spectral Epipolar Representation for Efficient Light Field Depth Estimation
por: Mohammad, Noor Islam S., et al.
Publicado: (2025)

Topology-Aware 3D Gaussian Splatting: Leveraging Persistent Homology for Optimized Structural Integrity
por: Shen, Tianqi, et al.
Publicado: (2024)

Deep Learning-Based Multi-Object Tracking: A Comprehensive Survey from Foundations to State-of-the-Art
por: Adžemović, Momir
Publicado: (2025)

A large-scale, physically-based synthetic dataset for satellite pose estimation
por: Velkei, Szabolcs, et al.
Publicado: (2025)

NeuroWrite: Predictive Handwritten Digit Classification using Deep Neural Networks
por: Asish, Kottakota, et al.
Publicado: (2023)

MEGA-GUI: Multi-stage Enhanced Grounding Agents for GUI Elements
por: Kwak, SeokJoo, et al.
Publicado: (2025)

VDPP: Video Depth Post-Processing for Speed and Scalability
por: Yoon, Daewon, et al.
Publicado: (2026)

From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image Segmentation
por: Chen, Jingkun, et al.
Publicado: (2025)

CLIP-Joint-Detect: End-to-End Joint Training of Object Detectors with Contrastive Vision-Language Supervision
por: Raoufi, Behnam, et al.
Publicado: (2025)

3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model
por: Ko, Hyun-kyu, et al.
Publicado: (2026)

Enhancing Maritime Object Detection in Real-Time with RT-DETR and Data Augmentation
por: Nemati, Nader
Publicado: (2025)

Position: Age Estimation Models Do Not Process Biometric Data
por: Marshalkin, Nikita
Publicado: (2026)

HEDGE: Hallucination Estimation via Dense Geometric Entropy for VQA with Vision-Language Models
por: Gautam, Sushant, et al.
Publicado: (2025)

Fast Normalized Cross-Correlation for Template Matching with Rotations
por: Almira, José María, et al.
Publicado: (2023)

Semantic2Graph: Graph-based Multi-modal Feature Fusion for Action Segmentation in Videos
por: Zhang, Junbin, et al.
Publicado: (2022)

Predictive Modeling of Maritime Radar Data Using Transformer Architecture
por: Qesaraku, Bjorna, et al.
Publicado: (2025)

BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection
por: Zhang, Jian, et al.
Publicado: (2024)

Transforming faces into video stories -- VideoFace2.0
por: Brkljač, Branko, et al.
Publicado: (2025)

OpenFusion++: An Open-vocabulary Real-time Scene Understanding System
por: Jin, Xiaofeng, et al.
Publicado: (2025)

Hierarchical Spatial Algorithms for High-Resolution Image Quantization and Feature Extraction
por: Mohammad, Noor Islam S.
Publicado: (2025)

Spiking Neural Networks for event-based action recognition: A new task to understand their advantage
por: Vicente-Sola, Alex, et al.
Publicado: (2022)

Structured Basis Function Networks: Loss-Centric Multi-Hypothesis Ensembles with Controllable Diversity
por: Dominguez, Alejandro Rodriguez, et al.
Publicado: (2025)

Corn Ear Detection and Orientation Estimation Using Deep Learning
por: Sprague, Nathan, et al.
Publicado: (2024)

Gr-IoU: Ground-Intersection over Union for Robust Multi-Object Tracking with 3D Geometric Constraints
por: Toida, Keisuke, et al.
Publicado: (2024)

Interpretable Machine Learning-Derived Spectral Indices for Vegetation Monitoring
por: Lotfi, Ali, et al.
Publicado: (2025)

Yanyun-3: Enabling Cross-Platform Strategy Game Operation with Vision-Language Models
por: Wang, Guoyan, et al.
Publicado: (2025)