Enregistré dans:
| Auteurs principaux: | Lee, Kinhei, Jing, Peiyuan, Zhang, Zhenxuan, Yang, Yue, Wang, Tao, Marshall, Dominic C, Fang, Yingying, Yang, Guang |
|---|---|
| Format: | Preprint |
| Publié: |
2026
|
| Sujets: | |
| Accès en ligne: | https://arxiv.org/abs/2604.14316 |
| Tags: |
Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
|
Documents similaires
Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation
par: Jing, Peiyuan, et autres
Publié: (2025)
par: Jing, Peiyuan, et autres
Publié: (2025)
GEMA-Score: Granular Explainable Multi-Agent Scoring Framework for Radiology Report Evaluation
par: Zhang, Zhenxuan, et autres
Publié: (2025)
par: Zhang, Zhenxuan, et autres
Publié: (2025)
Seeing Like Radiologists: Context- and Gaze-Guided Vision-Language Pretraining for Chest X-rays
par: Liu, Kang, et autres
Publié: (2026)
par: Liu, Kang, et autres
Publié: (2026)
See Where You Read with Eye Gaze Tracking and Large Language Model
par: Yang, Sikai, et autres
Publié: (2024)
par: Yang, Sikai, et autres
Publié: (2024)
Unleashing Video Language Models for Fine-grained HRCT Report Generation
par: Fang, Yingying, et autres
Publié: (2026)
par: Fang, Yingying, et autres
Publié: (2026)
Through the Expert's Eyes: Exploring Asynchronous Expert Perspectives and Gaze Visualizations in XR
par: Sayffaerth, Clara, et autres
Publié: (2025)
par: Sayffaerth, Clara, et autres
Publié: (2025)
Pretext Task Adversarial Learning for Unpaired Low-field to Ultra High-field MRI Synthesis
par: Zhang, Zhenxuan, et autres
Publié: (2025)
par: Zhang, Zhenxuan, et autres
Publié: (2025)
Enhancing Gaze Reasoning in Vision Foundation Models for Gaze Following
par: Wang, Shijing, et autres
Publié: (2026)
par: Wang, Shijing, et autres
Publié: (2026)
Seeing Through Their Eyes: Evaluating Visual Perspective Taking in Vision Language Models
par: Góral, Gracjan, et autres
Publié: (2024)
par: Góral, Gracjan, et autres
Publié: (2024)
Eyes on VLM: Benchmarking Gaze Following and Social Gaze Prediction in Vision Language Models
par: Wang, Hengfei, et autres
Publié: (2026)
par: Wang, Hengfei, et autres
Publié: (2026)
More Expert-like Eye Gaze Movement Patterns are Related to Better X-ray Reading
par: Yang, Pingjing, et autres
Publié: (2025)
par: Yang, Pingjing, et autres
Publié: (2025)
More Than Meets the Eye? Uncovering the Reasoning-Planning Disconnect in Training Vision-Language Driving Models
par: Song, Xurui, et autres
Publié: (2025)
par: Song, Xurui, et autres
Publié: (2025)
Unpaired Translation of Chest X-ray Images for Lung Opacity Diagnosis via Adaptive Activation Masks and Cross-Domain Alignment
par: Ning, Junzhi, et autres
Publié: (2025)
par: Ning, Junzhi, et autres
Publié: (2025)
Seeing Eye to AI: Comparing Human Gaze and Model Attention in Video Memorability
par: Kumar, Prajneya, et autres
Publié: (2023)
par: Kumar, Prajneya, et autres
Publié: (2023)
Seeing Eye to AI: Human Alignment via Gaze-Based Response Rewards for Large Language Models
par: Lopez-Cardona, Angela, et autres
Publié: (2024)
par: Lopez-Cardona, Angela, et autres
Publié: (2024)
MoE-dqINR: A Unified Mixture-of-Experts Implicit Neural Representation Framework for Scan-Specific Dynamic and Quantitative MRI Reconstruction
par: Wu, Yinzhe, et autres
Publié: (2026)
par: Wu, Yinzhe, et autres
Publié: (2026)
Musical Score Understanding Benchmark: Evaluating Large Language Models' Comprehension of Complete Musical Scores
par: Dai, Congren, et autres
Publié: (2025)
par: Dai, Congren, et autres
Publié: (2025)
Cross-Stage Attention Multi-Expert Network for Radiologist-Inspired Breast Ultrasound Diagnosis
par: Zhai, Xinyang, et autres
Publié: (2026)
par: Zhai, Xinyang, et autres
Publié: (2026)
HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction
par: Yuan, Ruicheng, et autres
Publié: (2026)
par: Yuan, Ruicheng, et autres
Publié: (2026)
Seeing Sarcasm Through Different Eyes: Analyzing Multimodal Sarcasm Perception in Large Vision-Language Models
par: Chen, Junjie, et autres
Publié: (2025)
par: Chen, Junjie, et autres
Publié: (2025)
3D Wavelet-Based Structural Priors for Controlled Diffusion in Whole-Body Low-Dose PET Denoising
par: Jing, Peiyuan, et autres
Publié: (2026)
par: Jing, Peiyuan, et autres
Publié: (2026)
Interpreting Radiologist's Intention from Eye Movements in Chest X-ray Diagnosis
par: Pham, Trong-Thang, et autres
Publié: (2025)
par: Pham, Trong-Thang, et autres
Publié: (2025)
Enhancing Human-Computer Interaction in Chest X-ray Analysis using Vision and Language Model with Eye Gaze Patterns
par: Kim, Yunsoo, et autres
Publié: (2024)
par: Kim, Yunsoo, et autres
Publié: (2024)
Supporting Mitosis Detection AI Training with Inter-Observer Eye-Gaze Consistencies
par: Gu, Hongyan, et autres
Publié: (2024)
par: Gu, Hongyan, et autres
Publié: (2024)
See Through the Noise: Improving Domain Generalization in Gaze Estimation
par: Peng, Yanming, et autres
Publié: (2026)
par: Peng, Yanming, et autres
Publié: (2026)
Decoding Decision Reasoning: A Counterfactual-Powered Model for Knowledge Discovery
par: Fang, Yingying, et autres
Publié: (2024)
par: Fang, Yingying, et autres
Publié: (2024)
GazeInterpreter: Parsing Eye Gaze to Generate Eye-Body-Coordinated Narrations
par: Chang, Qing, et autres
Publié: (2025)
par: Chang, Qing, et autres
Publié: (2025)
Jigsaw-Puzzles: From Seeing to Understanding to Reasoning in Vision-Language Models
par: Lyu, Zesen, et autres
Publié: (2025)
par: Lyu, Zesen, et autres
Publié: (2025)
TalkingEyes: Pluralistic Speech-Driven 3D Eye Gaze Animation
par: Zhuang, Yixiang, et autres
Publié: (2025)
par: Zhuang, Yixiang, et autres
Publié: (2025)
Effect of Reporting Mode and Clinical Experience on Radiologists' Gaze and Image Analysis Behavior in Chest Radiography
par: Khoobi, Mahta, et autres
Publié: (2025)
par: Khoobi, Mahta, et autres
Publié: (2025)
Gaze-Regularized Vision-Language-Action Models for Robotic Manipulation
par: Pani, Anupam, et autres
Publié: (2026)
par: Pani, Anupam, et autres
Publié: (2026)
Read Like a Radiologist: Efficient Vision-Language Model for 3D Medical Imaging Interpretation
par: Lee, Changsun, et autres
Publié: (2024)
par: Lee, Changsun, et autres
Publié: (2024)
Tri-Cam: Practical Eye Gaze Tracking via Camera Network
par: Yang, Sikai, et autres
Publié: (2024)
par: Yang, Sikai, et autres
Publié: (2024)
GazeTrack: High-Precision Eye Tracking Based on Regularization and Spatial Computing
par: Yang, Xiaoyin
Publié: (2025)
par: Yang, Xiaoyin
Publié: (2025)
Radiologist-in-the-Loop Self-Training for Generalizable CT Metal Artifact Reduction
par: Ma, Chenglong, et autres
Publié: (2025)
par: Ma, Chenglong, et autres
Publié: (2025)
Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning
par: Xing, Xiaodan, et autres
Publié: (2024)
par: Xing, Xiaodan, et autres
Publié: (2024)
MAP-Diff: Multi-Anchor Guided Diffusion for Progressive 3D Whole-Body Low-Dose PET Denoising
par: Jing, Peiyuan, et autres
Publié: (2026)
par: Jing, Peiyuan, et autres
Publié: (2026)
A Progressive Training Strategy for Vision-Language Models to Counteract Spatio-Temporal Hallucinations in Embodied Reasoning
par: Yang, Xiaoda, et autres
Publié: (2026)
par: Yang, Xiaoda, et autres
Publié: (2026)
Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation
par: Yue, Zhiling, et autres
Publié: (2024)
par: Yue, Zhiling, et autres
Publié: (2024)
FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Generation
par: Pham, Trong Thang, et autres
Publié: (2024)
par: Pham, Trong Thang, et autres
Publié: (2024)
Documents similaires
-
Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation
par: Jing, Peiyuan, et autres
Publié: (2025) -
GEMA-Score: Granular Explainable Multi-Agent Scoring Framework for Radiology Report Evaluation
par: Zhang, Zhenxuan, et autres
Publié: (2025) -
Seeing Like Radiologists: Context- and Gaze-Guided Vision-Language Pretraining for Chest X-rays
par: Liu, Kang, et autres
Publié: (2026) -
See Where You Read with Eye Gaze Tracking and Large Language Model
par: Yang, Sikai, et autres
Publié: (2024) -
Unleashing Video Language Models for Fine-grained HRCT Report Generation
par: Fang, Yingying, et autres
Publié: (2026)