Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Tang, Yixuan, Lin, Zhenghong, Sun, Yandong, Hsu, Wynne, Lee, Mong Li, Tung, Anthony K. H.
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2603.01690
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910038767960064
author	Tang, Yixuan Lin, Zhenghong Sun, Yandong Hsu, Wynne Lee, Mong Li Tung, Anthony K. H.
author_facet	Tang, Yixuan Lin, Zhenghong Sun, Yandong Hsu, Wynne Lee, Mong Li Tung, Anthony K. H.
contents	While dense biomedical embeddings achieve strong performance, their black-box nature limits their utility in clinical decision-making. Recent question-based interpretable embeddings represent text as binary answers to natural-language questions, but these approaches often rely on heuristic or surface-level contrastive signals and overlook specialized domain knowledge. We propose QIME, an ontology-grounded framework for constructing interpretable medical text embeddings in which each dimension corresponds to a clinically meaningful yes/no question. By conditioning on cluster-specific medical concept signatures, QIME generates semantically atomic questions that capture fine-grained distinctions in biomedical text. Furthermore, QIME supports a training-free embedding construction strategy that eliminates per-question classifier training while further improving performance. Experiments across biomedical semantic similarity, clustering, and retrieval benchmarks show that QIME consistently outperforms prior interpretable embedding methods and substantially narrows the gap to strong black-box biomedical encoders, while providing concise and clinically informative explanations.
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_01690
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	QIME: Constructing Interpretable Medical Text Embeddings via Ontology-Grounded Questions Tang, Yixuan Lin, Zhenghong Sun, Yandong Hsu, Wynne Lee, Mong Li Tung, Anthony K. H. Computation and Language Artificial Intelligence While dense biomedical embeddings achieve strong performance, their black-box nature limits their utility in clinical decision-making. Recent question-based interpretable embeddings represent text as binary answers to natural-language questions, but these approaches often rely on heuristic or surface-level contrastive signals and overlook specialized domain knowledge. We propose QIME, an ontology-grounded framework for constructing interpretable medical text embeddings in which each dimension corresponds to a clinically meaningful yes/no question. By conditioning on cluster-specific medical concept signatures, QIME generates semantically atomic questions that capture fine-grained distinctions in biomedical text. Furthermore, QIME supports a training-free embedding construction strategy that eliminates per-question classifier training while further improving performance. Experiments across biomedical semantic similarity, clustering, and retrieval benchmarks show that QIME consistently outperforms prior interpretable embedding methods and substantially narrows the gap to strong black-box biomedical encoders, while providing concise and clinically informative explanations.
title	QIME: Constructing Interpretable Medical Text Embeddings via Ontology-Grounded Questions
topic	Computation and Language Artificial Intelligence
url	https://arxiv.org/abs/2603.01690

Similar Items