Saved in:
Bibliographic Details
Main Authors: Prasad, Apoorv, McRoy, Susan
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.14356
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913167997665280
author Prasad, Apoorv
McRoy, Susan
author_facet Prasad, Apoorv
McRoy, Susan
contents Women with polycystic ovary syndrome (PCOS) face substantially elevated risks of body image distress, disordered eating, and metabolic challenges, yet existing natural language processing approaches for detecting these conditions lack transparency and cannot identify co-occurring presentations. We developed small, open-source language models to automatically detect this triple burden in social media posts with grounded explainability. We collected 1,000 PCOS-related posts from six subreddits, with two trained annotators labeling posts using guidelines operationalizing Lee et al. (2017) clinical framework. Three models (Gemma-2-2B, Qwen3-1.7B, DeepSeek-R1-Distill-Qwen-1.5B) were fine-tuned using Low-Rank Adaptation to generate structured explanations with textual evidence. The best model achieved 75.3 percent exact match accuracy on 150 held-out posts, with robust comorbidity detection and strong explainability. Performance declined with diagnostic complexity, indicating their best use is for screening rather than autonomous diagnosis.
format Preprint
id arxiv_https___arxiv_org_abs_2604_14356
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle When PCOS Meets Eating Disorders: An Explainable AI Approach to Detecting the Hidden Triple Burden
Prasad, Apoorv
McRoy, Susan
Computation and Language
Artificial Intelligence
Women with polycystic ovary syndrome (PCOS) face substantially elevated risks of body image distress, disordered eating, and metabolic challenges, yet existing natural language processing approaches for detecting these conditions lack transparency and cannot identify co-occurring presentations. We developed small, open-source language models to automatically detect this triple burden in social media posts with grounded explainability. We collected 1,000 PCOS-related posts from six subreddits, with two trained annotators labeling posts using guidelines operationalizing Lee et al. (2017) clinical framework. Three models (Gemma-2-2B, Qwen3-1.7B, DeepSeek-R1-Distill-Qwen-1.5B) were fine-tuned using Low-Rank Adaptation to generate structured explanations with textual evidence. The best model achieved 75.3 percent exact match accuracy on 150 held-out posts, with robust comorbidity detection and strong explainability. Performance declined with diagnostic complexity, indicating their best use is for screening rather than autonomous diagnosis.
title When PCOS Meets Eating Disorders: An Explainable AI Approach to Detecting the Hidden Triple Burden
topic Computation and Language
Artificial Intelligence
url https://arxiv.org/abs/2604.14356