:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Çetinkaya, Evren, Lee, Sangmin, Kim, Jung Uk, Lee, Hong Joo, Navab, Nassir
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2604.17455
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Object-aware Sound Source Localization via Audio-Visual Scene Understanding
by: Um, Sung Jin, et al.
Published: (2025)

Adversarial Wear and Tear: Exploiting Natural Damage for Generating Physical-World Adversarial Examples
by: Irshad, Samra, et al.
Published: (2025)

Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality
by: Park, Kyu Ri, et al.
Published: (2024)

Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge
by: Kim, Dongjin, et al.
Published: (2024)

Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection
by: Um, Sung Jin, et al.
Published: (2025)

Do We Need Perfect Data? Leveraging Noise for Domain Generalized Segmentation
by: Kim, Taeyeong, et al.
Published: (2025)

EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything
by: Song, Joonhyeon, et al.
Published: (2024)

APT: Improving Diffusion Models for High Resolution Image Generation with Adaptive Path Tracing
by: Han, Sangmin, et al.
Published: (2025)

Question-Aware Gaussian Experts for Audio-Visual Question Answering
by: Kim, Hongyeob, et al.
Published: (2025)

Language-Guided Open-World Anomaly Segmentation
by: Reichard, Klara, et al.
Published: (2025)

Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
by: Jeong, Jinho, et al.
Published: (2025)

HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation
by: Biagini, Diego, et al.
Published: (2025)

See, Rank, and Filter: Important Word-Aware Clip Filtering via Scene Understanding for Moment Retrieval and Highlight Detection
by: Lee, YuEun, et al.
Published: (2025)

Where It Moves, It Matters: Referring Surgical Instrument Segmentation via Motion
by: Wei, Meng, et al.
Published: (2026)

A Taxonomy and Library for Visualizing Learned Features in Convolutional Neural Networks
by: Grün, Felix, et al.
Published: (2016)

PromptVFX: Text-Driven Fields for Open-World 3D Gaussian Animation
by: Kiray, Mert, et al.
Published: (2025)

Leveraging Textual Compositional Reasoning for Robust Change Captioning
by: Park, Kyu Ri, et al.
Published: (2025)

StarryGazer: Leveraging Monocular Depth Estimation Models for Domain-Agnostic Single Depth Image Completion
by: Hong, Sangmin, et al.
Published: (2025)

PanORama: Multiview Consistent Panoptic Segmentation in Operating Rooms
by: Gürbüz, Tuna, et al.
Published: (2026)

From Open-Vocabulary to Vocabulary-Free Semantic Segmentation
by: Reichard, Klara, et al.
Published: (2025)

Deep Spectral Methods for Unsupervised Ultrasound Image Interpretation
by: Tmenova, Oleksandra, et al.
Published: (2024)

Towards Comprehensive Real-Time Scene Understanding in Ophthalmic Surgery through Multimodal Image Fusion
by: Rohrmoser, Nikolo, et al.
Published: (2026)

Class-Aware Cartilage Segmentation for Autonomous US-CT Registration in Robotic Intercostal Ultrasound Imaging
by: Jiang, Zhongliang, et al.
Published: (2024)

SURGIVID: Annotation-Efficient Surgical Video Object Discovery
by: Köksal, Çağhan, et al.
Published: (2024)

Self-Prompt SAM: Medical Image Segmentation via Automatic Prompt SAM Adaptation
by: Xie, Bin, et al.
Published: (2025)

Generate, Analyze, and Refine: Training-Free Sound Source Localization via MLLM Meta-Reasoning
by: Park, Subin, et al.
Published: (2026)

Visual Autoregressive Modelling for Monocular Depth Estimation
by: El-Ghoussani, Amir, et al.
Published: (2025)

CLAPS: A CLIP-Unified Auto-Prompt Segmentation for Multi-Modal Retinal Imaging
by: Zhao, Zhihao, et al.
Published: (2025)

Diffusion as Sound Propagation: Physics-inspired Model for Ultrasound Image Generation
by: Domínguez, Marina, et al.
Published: (2024)

SpecstatOR: Speckle statistics-based iOCT Segmentation Network for Ophthalmic Surgery
by: Mach, Kristina, et al.
Published: (2024)

Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis
by: Chen, Tingxuan, et al.
Published: (2025)

Neural Semantic Map-Learning for Autonomous Vehicles
by: Herb, Markus, et al.
Published: (2024)

UltraAD: Fine-Grained Ultrasound Anomaly Classification via Few-Shot CLIP Adaptation
by: Zhou, Yue, et al.
Published: (2025)

Test-Time Modality Generalization for Medical Image Segmentation
by: Nam, Ju-Hyeon, et al.
Published: (2025)

CAD: Memory Efficient Convolutional Adapter for Segment Anything
by: Kim, Joohyeok, et al.
Published: (2024)

Neural Cellular Automata for Weakly Supervised Segmentation of White Blood Cells
by: Deutges, Michael, et al.
Published: (2025)

Temporal Differential Fields for 4D Motion Modeling via Image-to-Video Synthesis
by: You, Xin, et al.
Published: (2025)

FB-Diff: Fourier Basis-guided Diffusion for Temporal Interpolation of 4D Medical Imaging
by: You, Xin, et al.
Published: (2025)

Prompting Segment Anything Model with Domain-Adaptive Prototype for Generalizable Medical Image Segmentation
by: Wei, Zhikai, et al.
Published: (2024)

GenMix: Combining Generative and Mixture Data Augmentation for Medical Image Classification
by: Lee, Hansang, et al.
Published: (2024)