Enregistré dans:
Détails bibliographiques
Auteurs principaux: Wan, Midi, Li, Pengfei, Liang, Yizhuo, Wu, Di, Pan, Yushan, Zhu, Guangzhen, Wang, Hao
Format: Preprint
Publié: 2025
Sujets:
Accès en ligne:https://arxiv.org/abs/2505.08247
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866909608632647680
author Wan, Midi
Li, Pengfei
Liang, Yizhuo
Wu, Di
Pan, Yushan
Zhu, Guangzhen
Wang, Hao
author_facet Wan, Midi
Li, Pengfei
Liang, Yizhuo
Wu, Di
Pan, Yushan
Zhu, Guangzhen
Wang, Hao
contents Medical image synthesis plays a crucial role in providing anatomically accurate images for diagnosis and treatment. Hallux valgus, which affects approximately 19% of the global population, requires frequent weight-bearing X-rays for assessment, placing additional strain on both patients and healthcare providers. Existing X-ray models often struggle to balance image fidelity, skeletal consistency, and physical constraints, particularly in diffusion-based methods that lack skeletal guidance. We propose the Skeletal-Constrained Conditional Diffusion Model (SCCDM) and introduce KCC, a foot evaluation method utilizing skeletal landmarks. SCCDM incorporates multi-scale feature extraction and attention mechanisms, improving the Structural Similarity Index (SSIM) by 5.72% (0.794) and Peak Signal-to-Noise Ratio (PSNR) by 18.34% (21.40 dB). When combined with KCC, the model achieves an average score of 0.85, demonstrating strong clinical applicability. The code is available at https://github.com/midisec/SCCDM.
format Preprint
id arxiv_https___arxiv_org_abs_2505_08247
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Skeleton-Guided Diffusion Model for Accurate Foot X-ray Synthesis in Hallux Valgus Diagnosis
Wan, Midi
Li, Pengfei
Liang, Yizhuo
Wu, Di
Pan, Yushan
Zhu, Guangzhen
Wang, Hao
Image and Video Processing
Computer Vision and Pattern Recognition
Medical image synthesis plays a crucial role in providing anatomically accurate images for diagnosis and treatment. Hallux valgus, which affects approximately 19% of the global population, requires frequent weight-bearing X-rays for assessment, placing additional strain on both patients and healthcare providers. Existing X-ray models often struggle to balance image fidelity, skeletal consistency, and physical constraints, particularly in diffusion-based methods that lack skeletal guidance. We propose the Skeletal-Constrained Conditional Diffusion Model (SCCDM) and introduce KCC, a foot evaluation method utilizing skeletal landmarks. SCCDM incorporates multi-scale feature extraction and attention mechanisms, improving the Structural Similarity Index (SSIM) by 5.72% (0.794) and Peak Signal-to-Noise Ratio (PSNR) by 18.34% (21.40 dB). When combined with KCC, the model achieves an average score of 0.85, demonstrating strong clinical applicability. The code is available at https://github.com/midisec/SCCDM.
title Skeleton-Guided Diffusion Model for Accurate Foot X-ray Synthesis in Hallux Valgus Diagnosis
topic Image and Video Processing
Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2505.08247