Saved in:
| Main Authors: | Negrini, Alessio, Reiß, Simon |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.15200 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
From Static to Interactive: Adapting Visual in-Context Learners for User-Driven Tasks
by: Schmidt, Carlos, et al.
Published: (2026)
by: Schmidt, Carlos, et al.
Published: (2026)
SD-RetinaNet: Topologically Constrained Semi-Supervised Retinal Lesion and Layer Segmentation in OCT
by: Fazekas, Botond, et al.
Published: (2025)
by: Fazekas, Botond, et al.
Published: (2025)
Is Visual in-Context Learning for Compositional Medical Tasks within Reach?
by: Reiß, Simon, et al.
Published: (2025)
by: Reiß, Simon, et al.
Published: (2025)
Retina Vision Transformer (RetinaViT): Introducing Scaled Patches into Vision Transformers
by: Shu, Yuyang, et al.
Published: (2024)
by: Shu, Yuyang, et al.
Published: (2024)
Probing Intrinsic Medical Task Relationships: A Contrastive Learning Perspective
by: Muth, Jonas, et al.
Published: (2026)
by: Muth, Jonas, et al.
Published: (2026)
Bringing the Context Back into Object Recognition, Robustly
by: Janouskova, Klara, et al.
Published: (2024)
by: Janouskova, Klara, et al.
Published: (2024)
Applications of No-Collision Transportation Maps in Manifold Learning
by: Negrini, Elisa, et al.
Published: (2023)
by: Negrini, Elisa, et al.
Published: (2023)
Divide-and-Conquer Approach to Holistic Cognition in High-Similarity Contexts with Limited Data
by: Wang, Shijie, et al.
Published: (2026)
by: Wang, Shijie, et al.
Published: (2026)
Bringing Multimodality to Amazon Visual Search System
by: Zhu, Xinliang, et al.
Published: (2024)
by: Zhu, Xinliang, et al.
Published: (2024)
Visual Para-Thinker: Divide-and-Conquer Reasoning for Visual Comprehension
by: Xu, Haoran, et al.
Published: (2026)
by: Xu, Haoran, et al.
Published: (2026)
RMK RetinaNet: Rotated Multi-Kernel RetinaNet for Robust Oriented Object Detection in Remote Sensing Imagery
by: Sun, Huiran
Published: (2026)
by: Sun, Huiran
Published: (2026)
Learning to Segment Corneal Tissue Interfaces in OCT Images
by: Mathai, Tejas Sudharshan, et al.
Published: (2018)
by: Mathai, Tejas Sudharshan, et al.
Published: (2018)
StateSpaceDiffuser: Bringing Long Context to Diffusion World Models
by: Savov, Nedko, et al.
Published: (2025)
by: Savov, Nedko, et al.
Published: (2025)
Test-Time Visual In-Context Tuning
by: Xie, Jiahao, et al.
Published: (2025)
by: Xie, Jiahao, et al.
Published: (2025)
From Web to Pixels: Bringing Agentic Search into Visual Perception
by: Yang, Bokang, et al.
Published: (2026)
by: Yang, Bokang, et al.
Published: (2026)
DCText: Scheduled Attention Masking for Visual Text Generation via Divide-and-Conquer Strategy
by: Song, Jaewoo, et al.
Published: (2025)
by: Song, Jaewoo, et al.
Published: (2025)
CLIP Brings Better Features to Visual Aesthetics Learners
by: Xu, Liwu, et al.
Published: (2023)
by: Xu, Liwu, et al.
Published: (2023)
Divide-and-Conquer: Tree-structured Strategy with Answer Distribution Estimator for Goal-Oriented Visual Dialogue
by: Cai, Shuo, et al.
Published: (2025)
by: Cai, Shuo, et al.
Published: (2025)
MERG3R: A Divide-and-Conquer Approach to Large-Scale Neural Visual Geometry
by: Cheng, Leo Kaixuan, et al.
Published: (2026)
by: Cheng, Leo Kaixuan, et al.
Published: (2026)
Divide and Conquer: Reliable Multi-View Evidential Learning for Deepfake Detection
by: Kang, Xiaolu, et al.
Published: (2026)
by: Kang, Xiaolu, et al.
Published: (2026)
Memory Consistency Guided Divide-and-Conquer Learning for Generalized Category Discovery
by: Tu, Yuanpeng, et al.
Published: (2024)
by: Tu, Yuanpeng, et al.
Published: (2024)
Learning Privacy from Visual Entities
by: Xompero, Alessio, et al.
Published: (2025)
by: Xompero, Alessio, et al.
Published: (2025)
Learning Temporally Equivariance for Degenerative Disease Progression in OCT by Predicting Future Representations
by: Emre, Taha, et al.
Published: (2024)
by: Emre, Taha, et al.
Published: (2024)
RetinaGuard: Obfuscating Retinal Age in Fundus Images for Biometric Privacy Preserving
by: Luo, Zhengquan, et al.
Published: (2025)
by: Luo, Zhengquan, et al.
Published: (2025)
Masked Image Modelling for retinal OCT understanding
by: Pissas, Theodoros, et al.
Published: (2024)
by: Pissas, Theodoros, et al.
Published: (2024)
NeuralOCT: Airway OCT Analysis via Neural Fields
by: Jiao, Yining, et al.
Published: (2024)
by: Jiao, Yining, et al.
Published: (2024)
Bring Event into RGB and LiDAR: Hierarchical Visual-Motion Fusion for Scene Flow
by: Zhou, Hanyu, et al.
Published: (2024)
by: Zhou, Hanyu, et al.
Published: (2024)
DCA: Dividing and Conquering Amnesia in Incremental Object Detection
by: Zhang, Aoting, et al.
Published: (2025)
by: Zhang, Aoting, et al.
Published: (2025)
DC-Net: Divide-and-Conquer for Salient Object Detection
by: Zhu, Jiayi, et al.
Published: (2023)
by: Zhu, Jiayi, et al.
Published: (2023)
Towards Better Visualizing the Decision Basis of Networks via Unfold and Conquer Attribution Guidance
by: Hong, Jung-Ho, et al.
Published: (2023)
by: Hong, Jung-Ho, et al.
Published: (2023)
Divide-and-Conquer Inference for Large-Scale Visual Recognition with Multimodal Large Language Models
by: Ye, Zhipeng, et al.
Published: (2026)
by: Ye, Zhipeng, et al.
Published: (2026)
Comparative Analysis of YOLOv5, Faster R-CNN, SSD, and RetinaNet for Motorbike Detection in Kigali Autonomous Driving Context
by: Yinkfu, Ngeyen, et al.
Published: (2025)
by: Yinkfu, Ngeyen, et al.
Published: (2025)
VAMAE: Vessel-Aware Masked Autoencoders for OCT Angiography
by: Abolade, Ilerioluwakiiye, et al.
Published: (2026)
by: Abolade, Ilerioluwakiiye, et al.
Published: (2026)
LUMOS: Universal Semi-Supervised OCT Retinal Layer Segmentation with Hierarchical Reliable Mutual Learning
by: Fang, Yizhou, et al.
Published: (2026)
by: Fang, Yizhou, et al.
Published: (2026)
Foreign object segmentation in chest x-rays through anatomy-guided shape insertion
by: Seibold, Constantin, et al.
Published: (2025)
by: Seibold, Constantin, et al.
Published: (2025)
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision
by: Silva-Rodríguez, Julio, et al.
Published: (2023)
by: Silva-Rodríguez, Julio, et al.
Published: (2023)
Personal Visual Context Learning in Large Multimodal Models
by: Xue, Zihui, et al.
Published: (2026)
by: Xue, Zihui, et al.
Published: (2026)
Exploring Effective Factors for Improving Visual In-Context Learning
by: Sun, Yanpeng, et al.
Published: (2023)
by: Sun, Yanpeng, et al.
Published: (2023)
Enhancing Visual In-Context Learning by Multi-Faceted Fusion
by: Liao, Wenwen, et al.
Published: (2026)
by: Liao, Wenwen, et al.
Published: (2026)
Divide and Conquer: Decoupled Representation Alignment for Multimodal World Models
by: Xiao, Junyuan, et al.
Published: (2026)
by: Xiao, Junyuan, et al.
Published: (2026)
Similar Items
-
From Static to Interactive: Adapting Visual in-Context Learners for User-Driven Tasks
by: Schmidt, Carlos, et al.
Published: (2026) -
SD-RetinaNet: Topologically Constrained Semi-Supervised Retinal Lesion and Layer Segmentation in OCT
by: Fazekas, Botond, et al.
Published: (2025) -
Is Visual in-Context Learning for Compositional Medical Tasks within Reach?
by: Reiß, Simon, et al.
Published: (2025) -
Retina Vision Transformer (RetinaViT): Introducing Scaled Patches into Vision Transformers
by: Shu, Yuyang, et al.
Published: (2024) -
Probing Intrinsic Medical Task Relationships: A Contrastive Learning Perspective
by: Muth, Jonas, et al.
Published: (2026)