Saved in:
| Main Authors: | Shi, Danli, Zhang, Weiyi, Chen, Xiaolan, Liu, Yexin, Yang, Jiancheng, Huang, Siyu, Tham, Yih Chung, Zheng, Yingfeng, He, Mingguang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.11338 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Benchmarking Large Multimodal Models for Ophthalmic Visual Question Answering with OphthalWeChat
by: Xu, Pusheng, et al.
Published: (2025)
by: Xu, Pusheng, et al.
Published: (2025)
EyeGPT: Ophthalmic Assistant with Large Language Models
by: Chen, Xiaolan, et al.
Published: (2024)
by: Chen, Xiaolan, et al.
Published: (2024)
Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance
by: Zhang, Weiyi, et al.
Published: (2024)
by: Zhang, Weiyi, et al.
Published: (2024)
Fundus to Fluorescein Angiography Video Generation as a Retinal Generative Foundation Model
by: Zhang, Weiyi, et al.
Published: (2024)
by: Zhang, Weiyi, et al.
Published: (2024)
EyeWorld: A Generative World Model of Ocular State and Dynamics
by: Gao, Ziyu, et al.
Published: (2026)
by: Gao, Ziyu, et al.
Published: (2026)
EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis
by: Shi, Danli, et al.
Published: (2024)
by: Shi, Danli, et al.
Published: (2024)
Evaluating large language models in medical applications: a survey
by: Chen, Xiaolan, et al.
Published: (2024)
by: Chen, Xiaolan, et al.
Published: (2024)
EyeDiff: text-to-image diffusion model improves rare eye disease diagnosis
by: Chen, Ruoyu, et al.
Published: (2024)
by: Chen, Ruoyu, et al.
Published: (2024)
EyeAgent: An Agentic AI System for Multimodal Clinical Decision Support in Ophthalmology
by: Shi, Danli, et al.
Published: (2025)
by: Shi, Danli, et al.
Published: (2025)
Visual Question Answering in Ophthalmology: A Progressive and Practical Perspective
by: Chen, Xiaolan, et al.
Published: (2024)
by: Chen, Xiaolan, et al.
Published: (2024)
FusionFM: Fusing Eye-specific Foundational Models for Optimized Ophthalmic Diagnosis
by: Zou, Ke, et al.
Published: (2025)
by: Zou, Ke, et al.
Published: (2025)
ChatMyopia: An AI Agent for Pre-consultation Education in Primary Eye Care Settings
by: Wu, Yue, et al.
Published: (2025)
by: Wu, Yue, et al.
Published: (2025)
UWF-RI2FA: Generating Multi-frame Ultrawide-field Fluorescein Angiography from Ultrawide-field Retinal Imaging Improves Diabetic Retinopathy Stratification
by: Chen, Ruoyu, et al.
Published: (2024)
by: Chen, Ruoyu, et al.
Published: (2024)
DeepSeek-R1 Outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in Bilingual Complex Ophthalmology Reasoning
by: Xu, Pusheng, et al.
Published: (2025)
by: Xu, Pusheng, et al.
Published: (2025)
VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence
by: Qiu, Jianing, et al.
Published: (2023)
by: Qiu, Jianing, et al.
Published: (2023)
EH-Benchmark Ophthalmic Hallucination Benchmark and Agent-Driven Top-Down Traceable Reasoning Workflow
by: Pan, Xiaoyu, et al.
Published: (2025)
by: Pan, Xiaoyu, et al.
Published: (2025)
AI-powered virtual eye: perspective, challenges and opportunities
by: Wu, Yue, et al.
Published: (2025)
by: Wu, Yue, et al.
Published: (2025)
OBUSight: Clinically Aligned Generative AI for Ophthalmic Ultrasound Interpretation and Diagnosis
by: Xiaocong Liu, et al.
Published: (2026)
by: Xiaocong Liu, et al.
Published: (2026)
Choroidal Vessel Segmentation on Indocyanine Green Angiography Images via Human-in-the-Loop Labeling
by: Chen, Ruoyu, et al.
Published: (2024)
by: Chen, Ruoyu, et al.
Published: (2024)
Fundus2Globe: Generative AI-Driven 3D Digital Twins for Personalized Myopia Management
by: Shi, Danli, et al.
Published: (2025)
by: Shi, Danli, et al.
Published: (2025)
FFA Sora, video generation as fundus fluorescein angiography simulator
by: Wu, Xinyuan, et al.
Published: (2024)
by: Wu, Xinyuan, et al.
Published: (2024)
A Survey of Multimodal Ophthalmic Diagnostics: From Task-Specific Approaches to Foundational Models
by: Luo, Xiaoling, et al.
Published: (2025)
by: Luo, Xiaoling, et al.
Published: (2025)
Generalist Reward Models: Found Inside Large Language Models
by: Li, Yi-Chen, et al.
Published: (2025)
by: Li, Yi-Chen, et al.
Published: (2025)
Generalist versus Specialist Vision Foundation Models for Ocular Disease and Oculomics
by: Zhou, Yukun, et al.
Published: (2025)
by: Zhou, Yukun, et al.
Published: (2025)
Empowering Locally Deployable Medical Agent via State Enhanced Logical Skills for FHIR-based Clinical Tasks
by: Yang, Wanrong, et al.
Published: (2026)
by: Yang, Wanrong, et al.
Published: (2026)
Vision Foundation Models as Generalist Tokenizers for Image Generation
by: Zheng, Anlin, et al.
Published: (2026)
by: Zheng, Anlin, et al.
Published: (2026)
Stable Tracking of Eye Gaze Direction During Ophthalmic Surgery
by: Hong, Tinghe, et al.
Published: (2025)
by: Hong, Tinghe, et al.
Published: (2025)
A Clinical-oriented Multi-level Contrastive Learning Method for Disease Diagnosis in Low-quality Medical Images
by: Hou, Qingshan, et al.
Published: (2024)
by: Hou, Qingshan, et al.
Published: (2024)
LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
by: Qin, Zhenyue, et al.
Published: (2024)
by: Qin, Zhenyue, et al.
Published: (2024)
A Clinician-Friendly Platform for Ophthalmic Image Analysis Without Technical Barriers
by: Wang, Meng, et al.
Published: (2025)
by: Wang, Meng, et al.
Published: (2025)
AI in ophthalmology: From invisible to visible
by: Mingguang He
Published: (2024)
by: Mingguang He
Published: (2024)
Retinal microvasculature alterations are associated with mild behavioral impairment in a memory clinic population
by: Yingqi Liao, et al.
Published: (2025)
by: Yingqi Liao, et al.
Published: (2025)
Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents
by: Wang, Zihao, et al.
Published: (2025)
by: Wang, Zihao, et al.
Published: (2025)
Enabling Ultra-Fast Cardiovascular Imaging Across Heterogeneous Clinical Environments with A Generalist Foundation Model and Multimodal Database
by: Wang, Zi, et al.
Published: (2025)
by: Wang, Zi, et al.
Published: (2025)
Predicting Diabetic Macular Edema Treatment Responses Using OCT: Dataset and Methods of APTOS Competition
by: Zhang, Weiyi, et al.
Published: (2025)
by: Zhang, Weiyi, et al.
Published: (2025)
A Novel Ophthalmic Benchmark for Evaluating Multimodal Large Language Models with Fundus Photographs and OCT Images
by: Liang, Xiaoyi, et al.
Published: (2025)
by: Liang, Xiaoyi, et al.
Published: (2025)
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
by: LASA Team, et al.
Published: (2025)
by: LASA Team, et al.
Published: (2025)
A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist
by: Zhang, Wentao, et al.
Published: (2024)
by: Zhang, Wentao, et al.
Published: (2024)
MedVersa: A Generalist Foundation Model for Medical Image Interpretation
by: Zhou, Hong-Yu, et al.
Published: (2024)
by: Zhou, Hong-Yu, et al.
Published: (2024)
Association of metabolomic aging acceleration and body mass index phenotypes with mortality and obesity‐related morbidities
by: Xiaomin Zeng, et al.
Published: (2024)
by: Xiaomin Zeng, et al.
Published: (2024)
Similar Items
-
Benchmarking Large Multimodal Models for Ophthalmic Visual Question Answering with OphthalWeChat
by: Xu, Pusheng, et al.
Published: (2025) -
EyeGPT: Ophthalmic Assistant with Large Language Models
by: Chen, Xiaolan, et al.
Published: (2024) -
Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance
by: Zhang, Weiyi, et al.
Published: (2024) -
Fundus to Fluorescein Angiography Video Generation as a Retinal Generative Foundation Model
by: Zhang, Weiyi, et al.
Published: (2024) -
EyeWorld: A Generative World Model of Ocular State and Dynamics
by: Gao, Ziyu, et al.
Published: (2026)