Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zaman, Fahim Ahmed, Jacob, Mathews, Chang, Amanda, Liu, Kan, Sonka, Milan, Wu, Xiaodong
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2407.12952
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915108894015488
author	Zaman, Fahim Ahmed Jacob, Mathews Chang, Amanda Liu, Kan Sonka, Milan Wu, Xiaodong
author_facet	Zaman, Fahim Ahmed Jacob, Mathews Chang, Amanda Liu, Kan Sonka, Milan Wu, Xiaodong
contents	Diffusion Probabilistic Models (DPMs) suffer from inefficient inference due to their slow sampling and high memory consumption, which limits their applicability to various medical imaging applications. In this work, we propose a novel conditional diffusion modeling framework (LDSeg) for medical image segmentation, utilizing the learned inherent low-dimensional latent shape manifolds of the target objects and the embeddings of the source image with an end-to-end framework. Conditional diffusion in latent space not only ensures accurate image segmentation for multiple interacting objects, but also tackles the fundamental issues of traditional DPM-based segmentation methods: (1) high memory consumption, (2) time-consuming sampling process, and (3) unnatural noise injection in the forward and reverse processes. The end-to-end training strategy enables robust representation learning in the latent space related to segmentation features, ensuring significantly faster sampling from the posterior distribution for segmentation generation in the inference phase. Our experiments demonstrate that LDSeg achieved state-of-the-art segmentation accuracy on three medical image datasets with different imaging modalities. In addition, we showed that our proposed model was significantly more robust to noise compared to traditional deterministic segmentation models. The code is available at https://github.com/FahimZaman/LDSeg.git.
format	Preprint
id	arxiv_https___arxiv_org_abs_2407_12952
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Latent Diffusion for Medical Image Segmentation: End to end learning for fast sampling and accuracy Zaman, Fahim Ahmed Jacob, Mathews Chang, Amanda Liu, Kan Sonka, Milan Wu, Xiaodong Computer Vision and Pattern Recognition Diffusion Probabilistic Models (DPMs) suffer from inefficient inference due to their slow sampling and high memory consumption, which limits their applicability to various medical imaging applications. In this work, we propose a novel conditional diffusion modeling framework (LDSeg) for medical image segmentation, utilizing the learned inherent low-dimensional latent shape manifolds of the target objects and the embeddings of the source image with an end-to-end framework. Conditional diffusion in latent space not only ensures accurate image segmentation for multiple interacting objects, but also tackles the fundamental issues of traditional DPM-based segmentation methods: (1) high memory consumption, (2) time-consuming sampling process, and (3) unnatural noise injection in the forward and reverse processes. The end-to-end training strategy enables robust representation learning in the latent space related to segmentation features, ensuring significantly faster sampling from the posterior distribution for segmentation generation in the inference phase. Our experiments demonstrate that LDSeg achieved state-of-the-art segmentation accuracy on three medical image datasets with different imaging modalities. In addition, we showed that our proposed model was significantly more robust to noise compared to traditional deterministic segmentation models. The code is available at https://github.com/FahimZaman/LDSeg.git.
title	Latent Diffusion for Medical Image Segmentation: End to end learning for fast sampling and accuracy
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2407.12952

Similar Items