Saved in:
| Main Authors: | Çetinkaya, Evren, Lee, Sangmin, Kim, Jung Uk, Lee, Hong Joo, Navab, Nassir |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.17455 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Object-aware Sound Source Localization via Audio-Visual Scene Understanding
by: Um, Sung Jin, et al.
Published: (2025)
by: Um, Sung Jin, et al.
Published: (2025)
Adversarial Wear and Tear: Exploiting Natural Damage for Generating Physical-World Adversarial Examples
by: Irshad, Samra, et al.
Published: (2025)
by: Irshad, Samra, et al.
Published: (2025)
Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality
by: Park, Kyu Ri, et al.
Published: (2024)
by: Park, Kyu Ri, et al.
Published: (2024)
Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge
by: Kim, Dongjin, et al.
Published: (2024)
by: Kim, Dongjin, et al.
Published: (2024)
Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection
by: Um, Sung Jin, et al.
Published: (2025)
by: Um, Sung Jin, et al.
Published: (2025)
Do We Need Perfect Data? Leveraging Noise for Domain Generalized Segmentation
by: Kim, Taeyeong, et al.
Published: (2025)
by: Kim, Taeyeong, et al.
Published: (2025)
EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything
by: Song, Joonhyeon, et al.
Published: (2024)
by: Song, Joonhyeon, et al.
Published: (2024)
APT: Improving Diffusion Models for High Resolution Image Generation with Adaptive Path Tracing
by: Han, Sangmin, et al.
Published: (2025)
by: Han, Sangmin, et al.
Published: (2025)
Question-Aware Gaussian Experts for Audio-Visual Question Answering
by: Kim, Hongyeob, et al.
Published: (2025)
by: Kim, Hongyeob, et al.
Published: (2025)
Language-Guided Open-World Anomaly Segmentation
by: Reichard, Klara, et al.
Published: (2025)
by: Reichard, Klara, et al.
Published: (2025)
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
by: Jeong, Jinho, et al.
Published: (2025)
by: Jeong, Jinho, et al.
Published: (2025)
HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation
by: Biagini, Diego, et al.
Published: (2025)
by: Biagini, Diego, et al.
Published: (2025)
See, Rank, and Filter: Important Word-Aware Clip Filtering via Scene Understanding for Moment Retrieval and Highlight Detection
by: Lee, YuEun, et al.
Published: (2025)
by: Lee, YuEun, et al.
Published: (2025)
Where It Moves, It Matters: Referring Surgical Instrument Segmentation via Motion
by: Wei, Meng, et al.
Published: (2026)
by: Wei, Meng, et al.
Published: (2026)
A Taxonomy and Library for Visualizing Learned Features in Convolutional Neural Networks
by: Grün, Felix, et al.
Published: (2016)
by: Grün, Felix, et al.
Published: (2016)
PromptVFX: Text-Driven Fields for Open-World 3D Gaussian Animation
by: Kiray, Mert, et al.
Published: (2025)
by: Kiray, Mert, et al.
Published: (2025)
Leveraging Textual Compositional Reasoning for Robust Change Captioning
by: Park, Kyu Ri, et al.
Published: (2025)
by: Park, Kyu Ri, et al.
Published: (2025)
StarryGazer: Leveraging Monocular Depth Estimation Models for Domain-Agnostic Single Depth Image Completion
by: Hong, Sangmin, et al.
Published: (2025)
by: Hong, Sangmin, et al.
Published: (2025)
PanORama: Multiview Consistent Panoptic Segmentation in Operating Rooms
by: Gürbüz, Tuna, et al.
Published: (2026)
by: Gürbüz, Tuna, et al.
Published: (2026)
From Open-Vocabulary to Vocabulary-Free Semantic Segmentation
by: Reichard, Klara, et al.
Published: (2025)
by: Reichard, Klara, et al.
Published: (2025)
Deep Spectral Methods for Unsupervised Ultrasound Image Interpretation
by: Tmenova, Oleksandra, et al.
Published: (2024)
by: Tmenova, Oleksandra, et al.
Published: (2024)
Towards Comprehensive Real-Time Scene Understanding in Ophthalmic Surgery through Multimodal Image Fusion
by: Rohrmoser, Nikolo, et al.
Published: (2026)
by: Rohrmoser, Nikolo, et al.
Published: (2026)
Class-Aware Cartilage Segmentation for Autonomous US-CT Registration in Robotic Intercostal Ultrasound Imaging
by: Jiang, Zhongliang, et al.
Published: (2024)
by: Jiang, Zhongliang, et al.
Published: (2024)
SURGIVID: Annotation-Efficient Surgical Video Object Discovery
by: Köksal, Çağhan, et al.
Published: (2024)
by: Köksal, Çağhan, et al.
Published: (2024)
Self-Prompt SAM: Medical Image Segmentation via Automatic Prompt SAM Adaptation
by: Xie, Bin, et al.
Published: (2025)
by: Xie, Bin, et al.
Published: (2025)
Generate, Analyze, and Refine: Training-Free Sound Source Localization via MLLM Meta-Reasoning
by: Park, Subin, et al.
Published: (2026)
by: Park, Subin, et al.
Published: (2026)
Visual Autoregressive Modelling for Monocular Depth Estimation
by: El-Ghoussani, Amir, et al.
Published: (2025)
by: El-Ghoussani, Amir, et al.
Published: (2025)
CLAPS: A CLIP-Unified Auto-Prompt Segmentation for Multi-Modal Retinal Imaging
by: Zhao, Zhihao, et al.
Published: (2025)
by: Zhao, Zhihao, et al.
Published: (2025)
Diffusion as Sound Propagation: Physics-inspired Model for Ultrasound Image Generation
by: Domínguez, Marina, et al.
Published: (2024)
by: Domínguez, Marina, et al.
Published: (2024)
SpecstatOR: Speckle statistics-based iOCT Segmentation Network for Ophthalmic Surgery
by: Mach, Kristina, et al.
Published: (2024)
by: Mach, Kristina, et al.
Published: (2024)
Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis
by: Chen, Tingxuan, et al.
Published: (2025)
by: Chen, Tingxuan, et al.
Published: (2025)
Neural Semantic Map-Learning for Autonomous Vehicles
by: Herb, Markus, et al.
Published: (2024)
by: Herb, Markus, et al.
Published: (2024)
UltraAD: Fine-Grained Ultrasound Anomaly Classification via Few-Shot CLIP Adaptation
by: Zhou, Yue, et al.
Published: (2025)
by: Zhou, Yue, et al.
Published: (2025)
Test-Time Modality Generalization for Medical Image Segmentation
by: Nam, Ju-Hyeon, et al.
Published: (2025)
by: Nam, Ju-Hyeon, et al.
Published: (2025)
CAD: Memory Efficient Convolutional Adapter for Segment Anything
by: Kim, Joohyeok, et al.
Published: (2024)
by: Kim, Joohyeok, et al.
Published: (2024)
Neural Cellular Automata for Weakly Supervised Segmentation of White Blood Cells
by: Deutges, Michael, et al.
Published: (2025)
by: Deutges, Michael, et al.
Published: (2025)
Temporal Differential Fields for 4D Motion Modeling via Image-to-Video Synthesis
by: You, Xin, et al.
Published: (2025)
by: You, Xin, et al.
Published: (2025)
FB-Diff: Fourier Basis-guided Diffusion for Temporal Interpolation of 4D Medical Imaging
by: You, Xin, et al.
Published: (2025)
by: You, Xin, et al.
Published: (2025)
Prompting Segment Anything Model with Domain-Adaptive Prototype for Generalizable Medical Image Segmentation
by: Wei, Zhikai, et al.
Published: (2024)
by: Wei, Zhikai, et al.
Published: (2024)
GenMix: Combining Generative and Mixture Data Augmentation for Medical Image Classification
by: Lee, Hansang, et al.
Published: (2024)
by: Lee, Hansang, et al.
Published: (2024)
Similar Items
-
Object-aware Sound Source Localization via Audio-Visual Scene Understanding
by: Um, Sung Jin, et al.
Published: (2025) -
Adversarial Wear and Tear: Exploiting Natural Damage for Generating Physical-World Adversarial Examples
by: Irshad, Samra, et al.
Published: (2025) -
Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality
by: Park, Kyu Ri, et al.
Published: (2024) -
Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge
by: Kim, Dongjin, et al.
Published: (2024) -
Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection
by: Um, Sung Jin, et al.
Published: (2025)