:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shi, Danli, Zhang, Weiyi, Chen, Xiaolan, Liu, Yexin, Yang, Jiancheng, Huang, Siyu, Tham, Yih Chung, Zheng, Yingfeng, He, Mingguang
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2405.11338
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Benchmarking Large Multimodal Models for Ophthalmic Visual Question Answering with OphthalWeChat
by: Xu, Pusheng, et al.
Published: (2025)

EyeGPT: Ophthalmic Assistant with Large Language Models
by: Chen, Xiaolan, et al.
Published: (2024)

Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance
by: Zhang, Weiyi, et al.
Published: (2024)

Fundus to Fluorescein Angiography Video Generation as a Retinal Generative Foundation Model
by: Zhang, Weiyi, et al.
Published: (2024)

EyeWorld: A Generative World Model of Ocular State and Dynamics
by: Gao, Ziyu, et al.
Published: (2026)

EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis
by: Shi, Danli, et al.
Published: (2024)

Evaluating large language models in medical applications: a survey
by: Chen, Xiaolan, et al.
Published: (2024)

EyeDiff: text-to-image diffusion model improves rare eye disease diagnosis
by: Chen, Ruoyu, et al.
Published: (2024)

EyeAgent: An Agentic AI System for Multimodal Clinical Decision Support in Ophthalmology
by: Shi, Danli, et al.
Published: (2025)

Visual Question Answering in Ophthalmology: A Progressive and Practical Perspective
by: Chen, Xiaolan, et al.
Published: (2024)

FusionFM: Fusing Eye-specific Foundational Models for Optimized Ophthalmic Diagnosis
by: Zou, Ke, et al.
Published: (2025)

ChatMyopia: An AI Agent for Pre-consultation Education in Primary Eye Care Settings
by: Wu, Yue, et al.
Published: (2025)

UWF-RI2FA: Generating Multi-frame Ultrawide-field Fluorescein Angiography from Ultrawide-field Retinal Imaging Improves Diabetic Retinopathy Stratification
by: Chen, Ruoyu, et al.
Published: (2024)

DeepSeek-R1 Outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in Bilingual Complex Ophthalmology Reasoning
by: Xu, Pusheng, et al.
Published: (2025)

VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence
by: Qiu, Jianing, et al.
Published: (2023)

EH-Benchmark Ophthalmic Hallucination Benchmark and Agent-Driven Top-Down Traceable Reasoning Workflow
by: Pan, Xiaoyu, et al.
Published: (2025)

AI-powered virtual eye: perspective, challenges and opportunities
by: Wu, Yue, et al.
Published: (2025)

OBUSight: Clinically Aligned Generative AI for Ophthalmic Ultrasound Interpretation and Diagnosis
by: Xiaocong Liu, et al.
Published: (2026)

Choroidal Vessel Segmentation on Indocyanine Green Angiography Images via Human-in-the-Loop Labeling
by: Chen, Ruoyu, et al.
Published: (2024)

Fundus2Globe: Generative AI-Driven 3D Digital Twins for Personalized Myopia Management
by: Shi, Danli, et al.
Published: (2025)

FFA Sora, video generation as fundus fluorescein angiography simulator
by: Wu, Xinyuan, et al.
Published: (2024)

A Survey of Multimodal Ophthalmic Diagnostics: From Task-Specific Approaches to Foundational Models
by: Luo, Xiaoling, et al.
Published: (2025)

Generalist Reward Models: Found Inside Large Language Models
by: Li, Yi-Chen, et al.
Published: (2025)

Generalist versus Specialist Vision Foundation Models for Ocular Disease and Oculomics
by: Zhou, Yukun, et al.
Published: (2025)

Empowering Locally Deployable Medical Agent via State Enhanced Logical Skills for FHIR-based Clinical Tasks
by: Yang, Wanrong, et al.
Published: (2026)

Vision Foundation Models as Generalist Tokenizers for Image Generation
by: Zheng, Anlin, et al.
Published: (2026)

Stable Tracking of Eye Gaze Direction During Ophthalmic Surgery
by: Hong, Tinghe, et al.
Published: (2025)

A Clinical-oriented Multi-level Contrastive Learning Method for Disease Diagnosis in Low-quality Medical Images
by: Hou, Qingshan, et al.
Published: (2024)

LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
by: Qin, Zhenyue, et al.
Published: (2024)

A Clinician-Friendly Platform for Ophthalmic Image Analysis Without Technical Barriers
by: Wang, Meng, et al.
Published: (2025)

AI in ophthalmology: From invisible to visible
by: Mingguang He
Published: (2024)

Retinal microvasculature alterations are associated with mild behavioral impairment in a memory clinic population
by: Yingqi Liao, et al.
Published: (2025)

Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents
by: Wang, Zihao, et al.
Published: (2025)

Enabling Ultra-Fast Cardiovascular Imaging Across Heterogeneous Clinical Environments with A Generalist Foundation Model and Multimodal Database
by: Wang, Zi, et al.
Published: (2025)

Predicting Diabetic Macular Edema Treatment Responses Using OCT: Dataset and Methods of APTOS Competition
by: Zhang, Weiyi, et al.
Published: (2025)

A Novel Ophthalmic Benchmark for Evaluating Multimodal Large Language Models with Fundus Photographs and OCT Images
by: Liang, Xiaoyi, et al.
Published: (2025)

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
by: LASA Team, et al.
Published: (2025)

A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist
by: Zhang, Wentao, et al.
Published: (2024)

MedVersa: A Generalist Foundation Model for Medical Image Interpretation
by: Zhou, Hong-Yu, et al.
Published: (2024)

Association of metabolomic aging acceleration and body mass index phenotypes with mortality and obesity‐related morbidities
by: Xiaomin Zeng, et al.
Published: (2024)