Saved in:
| Main Authors: | Yao, Yue, Wen, Zelin, Tong, Yan, Tian, Xinyu, Li, Xuqing, Ma, Xiao, Xu, Dongliang, Gedeon, Tom |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.11989 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Visual and Textual Prompts in VLLMs for Enhancing Emotion Recognition
by: Wang, Zhifeng, et al.
Published: (2025)
by: Wang, Zhifeng, et al.
Published: (2025)
A Utility-preserving De-identification Pipeline for Cross-hospital Radiology Data Sharing
by: Liu, Chenhao, et al.
Published: (2026)
by: Liu, Chenhao, et al.
Published: (2026)
Bipartite Mode Matching for Vision Training Set Search from a Hierarchical Data Server
by: Yao, Yue, et al.
Published: (2026)
by: Yao, Yue, et al.
Published: (2026)
MMRad-22K: A Structured Multimodal Evidence Dataset for Chest X-ray Report Generation
by: Zhao, Yichen, et al.
Published: (2026)
by: Zhao, Yichen, et al.
Published: (2026)
Line of Sight: On Linear Representations in VLLMs
by: Rajaram, Achyuta, et al.
Published: (2025)
by: Rajaram, Achyuta, et al.
Published: (2025)
Enhancing Chest X-ray Classification through Knowledge Injection in Cross-Modality Learning
by: Yan, Yang, et al.
Published: (2025)
by: Yan, Yang, et al.
Published: (2025)
What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?
by: Tu, Weijie, et al.
Published: (2024)
by: Tu, Weijie, et al.
Published: (2024)
UniT: Unified Multimodal Chain-of-Thought Test-time Scaling
by: Chen, Leon Liangyu, et al.
Published: (2026)
by: Chen, Leon Liangyu, et al.
Published: (2026)
Chest X-ray Foundation Model with Global and Local Representations Integration
by: Yang, Zefan, et al.
Published: (2025)
by: Yang, Zefan, et al.
Published: (2025)
X-Ray-CoT: Interpretable Chest X-ray Diagnosis with Vision-Language Models via Chain-of-Thought Reasoning
by: Ng, Chee, et al.
Published: (2025)
by: Ng, Chee, et al.
Published: (2025)
CheXPO: Preference Optimization for Chest X-ray VLMs with Counterfactual Rationale
by: Liang, Xiao, et al.
Published: (2025)
by: Liang, Xiao, et al.
Published: (2025)
GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis
by: Liu, Bo, et al.
Published: (2024)
by: Liu, Bo, et al.
Published: (2024)
EVA-X: A Foundation Model for General Chest X-ray Analysis with Self-supervised Learning
by: Yao, Jingfeng, et al.
Published: (2024)
by: Yao, Jingfeng, et al.
Published: (2024)
Benchmarking Chest X-ray Diagnosis Models Across Multinational Datasets
by: Xu, Qinmei, et al.
Published: (2025)
by: Xu, Qinmei, et al.
Published: (2025)
MedRAX: Medical Reasoning Agent for Chest X-ray
by: Fallahpour, Adibvafa, et al.
Published: (2025)
by: Fallahpour, Adibvafa, et al.
Published: (2025)
CheXPO-v2: Preference Optimization for Chest X-ray VLMs with Knowledge Graph Consistency
by: Liang, Xiao, et al.
Published: (2025)
by: Liang, Xiao, et al.
Published: (2025)
CyberV: Cybernetics for Test-time Scaling in Video Understanding
by: Meng, Jiahao, et al.
Published: (2025)
by: Meng, Jiahao, et al.
Published: (2025)
Explaining Chest X-ray Pathology Models using Textual Concepts
by: Sadashivaiah, Vijay, et al.
Published: (2024)
by: Sadashivaiah, Vijay, et al.
Published: (2024)
Learnable Expansion of Graph Operators for Multi-Modal Feature Fusion
by: Ding, Dexuan, et al.
Published: (2024)
by: Ding, Dexuan, et al.
Published: (2024)
AT-CXR: Uncertainty-Aware Agentic Triage for Chest X-rays
by: Li, Xueyang, et al.
Published: (2025)
by: Li, Xueyang, et al.
Published: (2025)
Unsupervised Search for Ethnic Minorities' Medical Segmentation Training Set
by: Chen, Yixiao, et al.
Published: (2025)
by: Chen, Yixiao, et al.
Published: (2025)
DUCX: Decomposing Unfairness in Tool-Using Chest X-ray Agents
by: Xu, Zikang, et al.
Published: (2026)
by: Xu, Zikang, et al.
Published: (2026)
PolyG: Adaptive Graph Traversal for Diverse GraphRAG Questions
by: Liu, Renjie, et al.
Published: (2025)
by: Liu, Renjie, et al.
Published: (2025)
Xray2Xray: World Model from Chest X-rays with Volumetric Context
by: Yang, Zefan, et al.
Published: (2025)
by: Yang, Zefan, et al.
Published: (2025)
When Token Pruning is Worse than Random: Understanding Visual Token Information in VLLMs
by: Wang, Yahong, et al.
Published: (2025)
by: Wang, Yahong, et al.
Published: (2025)
DAXA: Traversing the X-ray desert by Democratising Archival X-ray Astronomy
by: Turner, David J., et al.
Published: (2024)
by: Turner, David J., et al.
Published: (2024)
GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition
by: Wang, Jingchao, et al.
Published: (2025)
by: Wang, Jingchao, et al.
Published: (2025)
A Closer Look at the Robustness of Contrastive Language-Image Pre-Training (CLIP)
by: Tu, Weijie, et al.
Published: (2024)
by: Tu, Weijie, et al.
Published: (2024)
TrackNetV4: Enhancing Fast Sports Object Tracking with Motion Attention Maps
by: Raj, Arjun, et al.
Published: (2024)
by: Raj, Arjun, et al.
Published: (2024)
Deep transfer learning for image classification: a survey
by: Plested, Jo, et al.
Published: (2022)
by: Plested, Jo, et al.
Published: (2022)
Toward a Holistic Evaluation of Robustness in CLIP Models
by: Tu, Weijie, et al.
Published: (2024)
by: Tu, Weijie, et al.
Published: (2024)
Policy of Thoughts: Scaling LLM Reasoning via Test-time Policy Evolution
by: Jiao, Zhengbo, et al.
Published: (2026)
by: Jiao, Zhengbo, et al.
Published: (2026)
Structural Graph Neural Networks with Anatomical Priors for Explainable Chest X-ray Diagnosis
by: Berkani, Khaled
Published: (2026)
by: Berkani, Khaled
Published: (2026)
Hierarchical structure understanding in complex tables with VLLMs: a benchmark and experiments
by: Bindini, Luca, et al.
Published: (2025)
by: Bindini, Luca, et al.
Published: (2025)
Waste-Bench: A Comprehensive Benchmark for Evaluating VLLMs in Cluttered Environments
by: Ali, Muhammad, et al.
Published: (2025)
by: Ali, Muhammad, et al.
Published: (2025)
CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers
by: Mallis, Dimitrios, et al.
Published: (2024)
by: Mallis, Dimitrios, et al.
Published: (2024)
ECHO: Efficient Chest X-ray Report Generation with One-step Block Diffusion
by: Chen, Lifeng, et al.
Published: (2026)
by: Chen, Lifeng, et al.
Published: (2026)
Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation
by: Yao, Wenfang, et al.
Published: (2024)
by: Yao, Wenfang, et al.
Published: (2024)
Instruction-Guided Lesion Segmentation for Chest X-rays with Automatically Generated Large-Scale Dataset
by: Choi, Geon, et al.
Published: (2025)
by: Choi, Geon, et al.
Published: (2025)
UniX: Unifying Autoregression and Diffusion for Chest X-Ray Understanding and Generation
by: Zhang, Ruiheng, et al.
Published: (2026)
by: Zhang, Ruiheng, et al.
Published: (2026)
Similar Items
-
Visual and Textual Prompts in VLLMs for Enhancing Emotion Recognition
by: Wang, Zhifeng, et al.
Published: (2025) -
A Utility-preserving De-identification Pipeline for Cross-hospital Radiology Data Sharing
by: Liu, Chenhao, et al.
Published: (2026) -
Bipartite Mode Matching for Vision Training Set Search from a Hierarchical Data Server
by: Yao, Yue, et al.
Published: (2026) -
MMRad-22K: A Structured Multimodal Evidence Dataset for Chest X-ray Report Generation
by: Zhao, Yichen, et al.
Published: (2026) -
Line of Sight: On Linear Representations in VLLMs
by: Rajaram, Achyuta, et al.
Published: (2025)