:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Gao, Ziyu, Wu, Xinyuan, Chen, Xiaolan, Liu, Zhuoran, Chen, Ruoyu, Liu, Bowen, Yan, Bingjie, Wang, Zhenhan, Jin, Kai, Yang, Jiancheng, Tham, Yih Chung, He, Mingguang, Shi, Danli
Format:	Preprint
Veröffentlicht:	2026
Schlagworte:	Computer Vision and Pattern Recognition
Online-Zugang:	https://arxiv.org/abs/2603.14039
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

EyeFound: A Multimodal Generalist Foundation Model for Ophthalmic Imaging
von: Shi, Danli, et al.
Veröffentlicht: (2024)

EyeDiff: text-to-image diffusion model improves rare eye disease diagnosis
von: Chen, Ruoyu, et al.
Veröffentlicht: (2024)

EyeAgent: An Agentic AI System for Multimodal Clinical Decision Support in Ophthalmology
von: Shi, Danli, et al.
Veröffentlicht: (2025)

EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis
von: Shi, Danli, et al.
Veröffentlicht: (2024)

DeepSeek-R1 Outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in Bilingual Complex Ophthalmology Reasoning
von: Xu, Pusheng, et al.
Veröffentlicht: (2025)

Visual Question Answering in Ophthalmology: A Progressive and Practical Perspective
von: Chen, Xiaolan, et al.
Veröffentlicht: (2024)

Fundus2Globe: Generative AI-Driven 3D Digital Twins for Personalized Myopia Management
von: Shi, Danli, et al.
Veröffentlicht: (2025)

Fundus to Fluorescein Angiography Video Generation as a Retinal Generative Foundation Model
von: Zhang, Weiyi, et al.
Veröffentlicht: (2024)

Benchmarking Large Multimodal Models for Ophthalmic Visual Question Answering with OphthalWeChat
von: Xu, Pusheng, et al.
Veröffentlicht: (2025)

FFA Sora, video generation as fundus fluorescein angiography simulator
von: Wu, Xinyuan, et al.
Veröffentlicht: (2024)

Evaluating large language models in medical applications: a survey
von: Chen, Xiaolan, et al.
Veröffentlicht: (2024)

ChatMyopia: An AI Agent for Pre-consultation Education in Primary Eye Care Settings
von: Wu, Yue, et al.
Veröffentlicht: (2025)

EyeGPT: Ophthalmic Assistant with Large Language Models
von: Chen, Xiaolan, et al.
Veröffentlicht: (2024)

Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance
von: Zhang, Weiyi, et al.
Veröffentlicht: (2024)

Choroidal Vessel Segmentation on Indocyanine Green Angiography Images via Human-in-the-Loop Labeling
von: Chen, Ruoyu, et al.
Veröffentlicht: (2024)

UWF-RI2FA: Generating Multi-frame Ultrawide-field Fluorescein Angiography from Ultrawide-field Retinal Imaging Improves Diabetic Retinopathy Stratification
von: Chen, Ruoyu, et al.
Veröffentlicht: (2024)

Empowering Locally Deployable Medical Agent via State Enhanced Logical Skills for FHIR-based Clinical Tasks
von: Yang, Wanrong, et al.
Veröffentlicht: (2026)

AI-powered virtual eye: perspective, challenges and opportunities
von: Wu, Yue, et al.
Veröffentlicht: (2025)

Retinal microvasculature alterations are associated with mild behavioral impairment in a memory clinic population
von: Yingqi Liao, et al.
Veröffentlicht: (2025)

Bird's Eye View Based Pretrained World model for Visual Navigation
von: Lekkala, Kiran, et al.
Veröffentlicht: (2023)

Images of safe tourism destinations in the United States held by African Americans
von: Bingjie Liu
Veröffentlicht: (2013)

Agent-RewardBench: Towards a Unified Benchmark for Reward Modeling across Perception, Planning, and Safety in Real-World Multimodal Agents
von: Men, Tianyi, et al.
Veröffentlicht: (2025)

Evaluating the Real‐World Applicability of Eye‐Tracking Cognitive Assessment
von: Xinxin Liu, et al.
Veröffentlicht: (2026)

A Clinical-oriented Multi-level Contrastive Learning Method for Disease Diagnosis in Low-quality Medical Images
von: Hou, Qingshan, et al.
Veröffentlicht: (2024)

CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models
von: Wu, Xiaoxue, et al.
Veröffentlicht: (2025)

LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
von: Qin, Zhenyue, et al.
Veröffentlicht: (2024)

Efficacy of Platelet-Rich Plasma Combined with Wen Shen Formula in the Treatment of Thin Endometrium-Related Infertility
von: Liu,, Xiaofang, et al.
Veröffentlicht: (2025)

Do Blue Light Filters Reduce Visual Fatigue When Using Digital Maps? An Eye Tracking Experiment to Promote Vision Health
von: Sizhuo Gao, et al.
Veröffentlicht: (2025)

EH-Benchmark Ophthalmic Hallucination Benchmark and Agent-Driven Top-Down Traceable Reasoning Workflow
von: Pan, Xiaoyu, et al.
Veröffentlicht: (2025)

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models
von: Jin, Zhuoran, et al.
Veröffentlicht: (2024)

World-Env: Leveraging World Model as a Virtual Environment for VLA Post-Training
von: Xiao, Junjin, et al.
Veröffentlicht: (2025)

Imitation game: Ocular tuberculosis camouflaged as acute retinal necrosis
von: Xin Yee Chong, et al.
Veröffentlicht: (2025)

GenMed: A Pairwise Generative Reformulation of Medical Diagnostic Tasks
von: Zhang, Hantao, et al.
Veröffentlicht: (2026)

Association of metabolomic aging acceleration and body mass index phenotypes with mortality and obesity‐related morbidities
von: Xiaomin Zeng, et al.
Veröffentlicht: (2024)

Dry Eye Disease: Oxidative Stress on Ocular Surface and Cutting‐Edge Antioxidants
von: Rong Hu, et al.
Veröffentlicht: (2025)

APTOS-2024 challenge report: Generation of synthetic 3D OCT images from fundus photographs
von: Liu, Bowen, et al.
Veröffentlicht: (2025)

Humanoid Factors: Design Principles for AI Humanoids in Human Worlds
von: Liu, Xinyuan, et al.
Veröffentlicht: (2026)

FusionFM: Fusing Eye-specific Foundational Models for Optimized Ophthalmic Diagnosis
von: Zou, Ke, et al.
Veröffentlicht: (2025)

Reward Prediction with Factorized World States
von: Shen, Yijun, et al.
Veröffentlicht: (2026)

STARRY: Spatial-Temporal Action-Centric World Modeling for Robotic Manipulation
von: Tian, Yuxuan, et al.
Veröffentlicht: (2026)