Saved in:
| Main Authors: | Liao, Zehui, Hu, Shishuai, Zou, Ke, Jin, Mengyuan, Zhang, Yanning, Fu, Huazhu, Zhen, Liangli, Xia, Yong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.20504 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Cycle Context Verification for In-Context Medical Image Segmentation
by: Hu, Shishuai, et al.
Published: (2025)
by: Hu, Shishuai, et al.
Published: (2025)
Unleashing the Potential of Open-set Noisy Samples Against Label Noise for Medical Image Classification
by: Liao, Zehui, et al.
Published: (2024)
by: Liao, Zehui, et al.
Published: (2024)
V-Loop: Visual Logical Loop Verification for Hallucination Detection in Medical Visual Question Answering
by: Jin, Mengyuan, et al.
Published: (2026)
by: Jin, Mengyuan, et al.
Published: (2026)
Towards Clinician-Preferred Segmentation: Leveraging Human-in-the-Loop for Test Time Adaptation in Medical Image Segmentation
by: Hu, Shishuai, et al.
Published: (2024)
by: Hu, Shishuai, et al.
Published: (2024)
Instance-dependent Label Distribution Estimation for Learning with Label Noise
by: Liao, Zehui, et al.
Published: (2022)
by: Liao, Zehui, et al.
Published: (2022)
UniMotion: A Unified Framework for Motion-Text-Vision Understanding and Generation
by: Wang, Ziyi, et al.
Published: (2026)
by: Wang, Ziyi, et al.
Published: (2026)
FRCNet Frequency and Region Consistency for Semi-supervised Medical Image Segmentation
by: He, Along, et al.
Published: (2024)
by: He, Along, et al.
Published: (2024)
Vivim: a Video Vision Mamba for Medical Video Segmentation
by: Yang, Yijun, et al.
Published: (2024)
by: Yang, Yijun, et al.
Published: (2024)
UniVLR: Unifying Text and Vision in Visual Latent Reasoning for Multimodal LLMs
by: Jiang, Houcheng, et al.
Published: (2026)
by: Jiang, Houcheng, et al.
Published: (2026)
UniVision: A Unified Framework for Vision-Centric 3D Perception
by: Hong, Yu, et al.
Published: (2024)
by: Hong, Yu, et al.
Published: (2024)
STLLaVA-Med: Self-Training Large Language and Vision Assistant for Medical Question-Answering
by: Sun, Guohao, et al.
Published: (2024)
by: Sun, Guohao, et al.
Published: (2024)
EH-Benchmark Ophthalmic Hallucination Benchmark and Agent-Driven Top-Down Traceable Reasoning Workflow
by: Pan, Xiaoyu, et al.
Published: (2025)
by: Pan, Xiaoyu, et al.
Published: (2025)
Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-training
by: Bawazir, Ameera, et al.
Published: (2024)
by: Bawazir, Ameera, et al.
Published: (2024)
Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias
by: Wan, Zhongwei, et al.
Published: (2023)
by: Wan, Zhongwei, et al.
Published: (2023)
Enhancing Medical Visual Grounding via Knowledge-guided Spatial Prompts
by: Gao, Yifan, et al.
Published: (2026)
by: Gao, Yifan, et al.
Published: (2026)
Hallucination Filtering in Radiology Vision-Language Models Using Discrete Semantic Entropy
by: Wienholt, Patrick, et al.
Published: (2025)
by: Wienholt, Patrick, et al.
Published: (2025)
Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling
by: Wong, Bryan, et al.
Published: (2025)
by: Wong, Bryan, et al.
Published: (2025)
Cross-Modal Obfuscation for Jailbreak Attacks on Large Vision-Language Models
by: Jiang, Lei, et al.
Published: (2025)
by: Jiang, Lei, et al.
Published: (2025)
Vision-Language Model IP Protection via Prompt-based Learning
by: Wang, Lianyu, et al.
Published: (2025)
by: Wang, Lianyu, et al.
Published: (2025)
Detecting and Evaluating Medical Hallucinations in Large Vision Language Models
by: Chen, Jiawei, et al.
Published: (2024)
by: Chen, Jiawei, et al.
Published: (2024)
Rethinking KV Cache Eviction via a Unified Information-Theoretic Objective
by: Yang, Jiaming, et al.
Published: (2026)
by: Yang, Jiaming, et al.
Published: (2026)
VIHD: Visual Intervention-based Hallucination Detection for Medical Visual Question Answering
by: Chen, Jiayi, et al.
Published: (2026)
by: Chen, Jiayi, et al.
Published: (2026)
Noise-Adaptive Diffusion Sampling for Inverse Problems Without Task-Specific Tuning
by: Xia, Yingzhi, et al.
Published: (2026)
by: Xia, Yingzhi, et al.
Published: (2026)
UniViTAR: Unified Vision Transformer with Native Resolution
by: Qiao, Limeng, et al.
Published: (2025)
by: Qiao, Limeng, et al.
Published: (2025)
UniStitch: Unifying Semantic and Geometric Features for Image Stitching
by: Mei, Yuan, et al.
Published: (2026)
by: Mei, Yuan, et al.
Published: (2026)
Towards Reliable Medical Image Segmentation by Modeling Evidential Calibrated Uncertainty
by: Zou, Ke, et al.
Published: (2023)
by: Zou, Ke, et al.
Published: (2023)
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
by: Wang, Zhecan, et al.
Published: (2023)
by: Wang, Zhecan, et al.
Published: (2023)
Reliable Source Approximation: Source-Free Unsupervised Domain Adaptation for Vestibular Schwannoma MRI Segmentation
by: Zeng, Hongye, et al.
Published: (2024)
by: Zeng, Hongye, et al.
Published: (2024)
Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer
by: Huang, Ziyuan, et al.
Published: (2025)
by: Huang, Ziyuan, et al.
Published: (2025)
UMIT: Unifying Medical Imaging Tasks via Vision-Language Models
by: Yu, Haiyang, et al.
Published: (2025)
by: Yu, Haiyang, et al.
Published: (2025)
UniFusion: Vision-Language Model as Unified Encoder in Image Generation
by: Li, Kevin, et al.
Published: (2025)
by: Li, Kevin, et al.
Published: (2025)
UniHM: Unified Dexterous Hand Manipulation with Vision Language Model
by: Zhang, Zhenhao, et al.
Published: (2026)
by: Zhang, Zhenhao, et al.
Published: (2026)
UniEDU: A Unified Language and Vision Assistant for Education Applications
by: Chu, Zhendong, et al.
Published: (2025)
by: Chu, Zhendong, et al.
Published: (2025)
UniCompress: Token Compression for Unified Vision-Language Understanding and Generation
by: Wang, Ziyao, et al.
Published: (2026)
by: Wang, Ziyao, et al.
Published: (2026)
Asymmetric Visual Semantic Embedding Framework for Efficient Vision-Language Alignment
by: Liu, Yang, et al.
Published: (2025)
by: Liu, Yang, et al.
Published: (2025)
Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models
by: Chen, Mengyuan, et al.
Published: (2024)
by: Chen, Mengyuan, et al.
Published: (2024)
Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers
by: Dong, Bo, et al.
Published: (2021)
by: Dong, Bo, et al.
Published: (2021)
ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification
by: Shi, Jiangbo, et al.
Published: (2025)
by: Shi, Jiangbo, et al.
Published: (2025)
Structured Semantic Cloaking for Jailbreak Attacks on Large Language Models
by: Sun, Xiaobing, et al.
Published: (2026)
by: Sun, Xiaobing, et al.
Published: (2026)
UniISP: A Unified ISP Framework for Both Human and Machine Vision
by: Li, Hanxi, et al.
Published: (2026)
by: Li, Hanxi, et al.
Published: (2026)
Similar Items
-
Cycle Context Verification for In-Context Medical Image Segmentation
by: Hu, Shishuai, et al.
Published: (2025) -
Unleashing the Potential of Open-set Noisy Samples Against Label Noise for Medical Image Classification
by: Liao, Zehui, et al.
Published: (2024) -
V-Loop: Visual Logical Loop Verification for Hallucination Detection in Medical Visual Question Answering
by: Jin, Mengyuan, et al.
Published: (2026) -
Towards Clinician-Preferred Segmentation: Leveraging Human-in-the-Loop for Test Time Adaptation in Medical Image Segmentation
by: Hu, Shishuai, et al.
Published: (2024) -
Instance-dependent Label Distribution Estimation for Learning with Label Noise
by: Liao, Zehui, et al.
Published: (2022)