Guardado en:
| Autores principales: | Kriegler, Andreas, Beleznai, Csaba, Gelautz, Margrit |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2604.18208 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Polarization-Based Eye Tracking with Personalized Siamese Architectures
por: Kalkanli, Beyza, et al.
Publicado: (2026)
por: Kalkanli, Beyza, et al.
Publicado: (2026)
Key-Scan-Based Mobile Robot Navigation: Integrated Mapping, Planning, and Control using Graphs of Scan Regions
por: Latha, Dharshan Bashkaran, et al.
Publicado: (2024)
por: Latha, Dharshan Bashkaran, et al.
Publicado: (2024)
Smelly, dense, and spreaded: The Object Detection for Olfactory References (ODOR) dataset
por: Zinnen, Mathias, et al.
Publicado: (2025)
por: Zinnen, Mathias, et al.
Publicado: (2025)
MSPCaps: A Multi-Scale Patchify Capsule Network with Cross-Agreement Routing for Visual Recognition
por: Hu, Yudong, et al.
Publicado: (2025)
por: Hu, Yudong, et al.
Publicado: (2025)
Floorplan2Guide: LLM-Guided Floorplan Parsing for BLV Indoor Navigation
por: Ayanzadeh, Aydin, et al.
Publicado: (2025)
por: Ayanzadeh, Aydin, et al.
Publicado: (2025)
AntiGrounding: Lifting Robotic Actions into VLM Representation Space for Decision Making
por: Li, Wenbo, et al.
Publicado: (2025)
por: Li, Wenbo, et al.
Publicado: (2025)
T-Rex: Task-Adaptive Spatial Representation Extraction for Robotic Manipulation with Vision-Language Models
por: Chen, Yiteng, et al.
Publicado: (2025)
por: Chen, Yiteng, et al.
Publicado: (2025)
Extraction Of Cumulative Blobs From Dynamic Gestures
por: Naulakha, Rishabh, et al.
Publicado: (2025)
por: Naulakha, Rishabh, et al.
Publicado: (2025)
Spectral Integrated Gradients for Coarse-to-Fine Feature Attribution
por: Kim, Soyeon, et al.
Publicado: (2026)
por: Kim, Soyeon, et al.
Publicado: (2026)
Manifold-Aligned Guided Integrated Gradients for Reliable Feature Attribution
por: Kim, Soyeon, et al.
Publicado: (2026)
por: Kim, Soyeon, et al.
Publicado: (2026)
On the convex hull of integer points above the hyperbola
por: Alcántara, David, et al.
Publicado: (2025)
por: Alcántara, David, et al.
Publicado: (2025)
VLM-NCD:Novel Class Discovery with Vision-Based Large Language Models
por: Su, Yuetong, et al.
Publicado: (2025)
por: Su, Yuetong, et al.
Publicado: (2025)
SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding
por: Gautam, Sushant, et al.
Publicado: (2025)
por: Gautam, Sushant, et al.
Publicado: (2025)
AI-Powered Augmented Reality for Satellite Assembly, Integration and Test
por: Patricio, Alvaro, et al.
Publicado: (2024)
por: Patricio, Alvaro, et al.
Publicado: (2024)
DSER: Spectral Epipolar Representation for Efficient Light Field Depth Estimation
por: Mohammad, Noor Islam S., et al.
Publicado: (2025)
por: Mohammad, Noor Islam S., et al.
Publicado: (2025)
Topology-Aware 3D Gaussian Splatting: Leveraging Persistent Homology for Optimized Structural Integrity
por: Shen, Tianqi, et al.
Publicado: (2024)
por: Shen, Tianqi, et al.
Publicado: (2024)
Deep Learning-Based Multi-Object Tracking: A Comprehensive Survey from Foundations to State-of-the-Art
por: Adžemović, Momir
Publicado: (2025)
por: Adžemović, Momir
Publicado: (2025)
A large-scale, physically-based synthetic dataset for satellite pose estimation
por: Velkei, Szabolcs, et al.
Publicado: (2025)
por: Velkei, Szabolcs, et al.
Publicado: (2025)
NeuroWrite: Predictive Handwritten Digit Classification using Deep Neural Networks
por: Asish, Kottakota, et al.
Publicado: (2023)
por: Asish, Kottakota, et al.
Publicado: (2023)
MEGA-GUI: Multi-stage Enhanced Grounding Agents for GUI Elements
por: Kwak, SeokJoo, et al.
Publicado: (2025)
por: Kwak, SeokJoo, et al.
Publicado: (2025)
VDPP: Video Depth Post-Processing for Speed and Scalability
por: Yoon, Daewon, et al.
Publicado: (2026)
por: Yoon, Daewon, et al.
Publicado: (2026)
From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image Segmentation
por: Chen, Jingkun, et al.
Publicado: (2025)
por: Chen, Jingkun, et al.
Publicado: (2025)
CLIP-Joint-Detect: End-to-End Joint Training of Object Detectors with Contrastive Vision-Language Supervision
por: Raoufi, Behnam, et al.
Publicado: (2025)
por: Raoufi, Behnam, et al.
Publicado: (2025)
3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model
por: Ko, Hyun-kyu, et al.
Publicado: (2026)
por: Ko, Hyun-kyu, et al.
Publicado: (2026)
Enhancing Maritime Object Detection in Real-Time with RT-DETR and Data Augmentation
por: Nemati, Nader
Publicado: (2025)
por: Nemati, Nader
Publicado: (2025)
Position: Age Estimation Models Do Not Process Biometric Data
por: Marshalkin, Nikita
Publicado: (2026)
por: Marshalkin, Nikita
Publicado: (2026)
HEDGE: Hallucination Estimation via Dense Geometric Entropy for VQA with Vision-Language Models
por: Gautam, Sushant, et al.
Publicado: (2025)
por: Gautam, Sushant, et al.
Publicado: (2025)
Fast Normalized Cross-Correlation for Template Matching with Rotations
por: Almira, José María, et al.
Publicado: (2023)
por: Almira, José María, et al.
Publicado: (2023)
Semantic2Graph: Graph-based Multi-modal Feature Fusion for Action Segmentation in Videos
por: Zhang, Junbin, et al.
Publicado: (2022)
por: Zhang, Junbin, et al.
Publicado: (2022)
Predictive Modeling of Maritime Radar Data Using Transformer Architecture
por: Qesaraku, Bjorna, et al.
Publicado: (2025)
por: Qesaraku, Bjorna, et al.
Publicado: (2025)
BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection
por: Zhang, Jian, et al.
Publicado: (2024)
por: Zhang, Jian, et al.
Publicado: (2024)
Transforming faces into video stories -- VideoFace2.0
por: Brkljač, Branko, et al.
Publicado: (2025)
por: Brkljač, Branko, et al.
Publicado: (2025)
OpenFusion++: An Open-vocabulary Real-time Scene Understanding System
por: Jin, Xiaofeng, et al.
Publicado: (2025)
por: Jin, Xiaofeng, et al.
Publicado: (2025)
Hierarchical Spatial Algorithms for High-Resolution Image Quantization and Feature Extraction
por: Mohammad, Noor Islam S.
Publicado: (2025)
por: Mohammad, Noor Islam S.
Publicado: (2025)
Spiking Neural Networks for event-based action recognition: A new task to understand their advantage
por: Vicente-Sola, Alex, et al.
Publicado: (2022)
por: Vicente-Sola, Alex, et al.
Publicado: (2022)
Structured Basis Function Networks: Loss-Centric Multi-Hypothesis Ensembles with Controllable Diversity
por: Dominguez, Alejandro Rodriguez, et al.
Publicado: (2025)
por: Dominguez, Alejandro Rodriguez, et al.
Publicado: (2025)
Corn Ear Detection and Orientation Estimation Using Deep Learning
por: Sprague, Nathan, et al.
Publicado: (2024)
por: Sprague, Nathan, et al.
Publicado: (2024)
Gr-IoU: Ground-Intersection over Union for Robust Multi-Object Tracking with 3D Geometric Constraints
por: Toida, Keisuke, et al.
Publicado: (2024)
por: Toida, Keisuke, et al.
Publicado: (2024)
Interpretable Machine Learning-Derived Spectral Indices for Vegetation Monitoring
por: Lotfi, Ali, et al.
Publicado: (2025)
por: Lotfi, Ali, et al.
Publicado: (2025)
Yanyun-3: Enabling Cross-Platform Strategy Game Operation with Vision-Language Models
por: Wang, Guoyan, et al.
Publicado: (2025)
por: Wang, Guoyan, et al.
Publicado: (2025)
Ejemplares similares
-
Polarization-Based Eye Tracking with Personalized Siamese Architectures
por: Kalkanli, Beyza, et al.
Publicado: (2026) -
Key-Scan-Based Mobile Robot Navigation: Integrated Mapping, Planning, and Control using Graphs of Scan Regions
por: Latha, Dharshan Bashkaran, et al.
Publicado: (2024) -
Smelly, dense, and spreaded: The Object Detection for Olfactory References (ODOR) dataset
por: Zinnen, Mathias, et al.
Publicado: (2025) -
MSPCaps: A Multi-Scale Patchify Capsule Network with Cross-Agreement Routing for Visual Recognition
por: Hu, Yudong, et al.
Publicado: (2025) -
Floorplan2Guide: LLM-Guided Floorplan Parsing for BLV Indoor Navigation
por: Ayanzadeh, Aydin, et al.
Publicado: (2025)