Saved in:
Bibliographic Details
Main Authors: Feng, Kaidong, Sun, Zhu, Lee, Roy Ka-Wei, Jiang, Xun, Theng, Yin-Leng, Ding, Yi
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.03603
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912819341950976
author Feng, Kaidong
Sun, Zhu
Lee, Roy Ka-Wei
Jiang, Xun
Theng, Yin-Leng
Ding, Yi
author_facet Feng, Kaidong
Sun, Zhu
Lee, Roy Ka-Wei
Jiang, Xun
Theng, Yin-Leng
Ding, Yi
contents Smartphone sensing offers an unobtrusive and scalable way to track daily behaviors linked to mental health, capturing changes in sleep, mobility, and phone use that often precede symptoms of stress, anxiety, or depression. While most prior studies focus on detection that responds to existing conditions, forecasting mental health enables proactive support through Just-in-Time Adaptive Interventions. In this paper, we present the first comprehensive benchmarking study comparing traditional machine learning (ML), deep learning (DL), and large language model (LLM) approaches for mental health forecasting using the College Experience Sensing (CES) dataset, the most extensive longitudinal dataset of college student mental health to date. We systematically evaluate models across temporal windows, feature granularities, personalization strategies, and class imbalance handling. Our results show that DL models, particularly Transformer (Macro-F1 = 0.58), achieve the best overall performance, while LLMs show strength in contextual reasoning but weaker temporal modeling. Personalization substantially improves forecasts of severe mental health states. By revealing how different modeling approaches interpret phone sensing behavioral data over time, this work lays the groundwork for next-generation, adaptive, and human-centered mental health technologies that can advance both research and real-world well-being.
format Preprint
id arxiv_https___arxiv_org_abs_2601_03603
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle A Comparative Study of Traditional Machine Learning, Deep Learning, and Large Language Models for Mental Health Forecasting using Smartphone Sensing Data
Feng, Kaidong
Sun, Zhu
Lee, Roy Ka-Wei
Jiang, Xun
Theng, Yin-Leng
Ding, Yi
Machine Learning
Smartphone sensing offers an unobtrusive and scalable way to track daily behaviors linked to mental health, capturing changes in sleep, mobility, and phone use that often precede symptoms of stress, anxiety, or depression. While most prior studies focus on detection that responds to existing conditions, forecasting mental health enables proactive support through Just-in-Time Adaptive Interventions. In this paper, we present the first comprehensive benchmarking study comparing traditional machine learning (ML), deep learning (DL), and large language model (LLM) approaches for mental health forecasting using the College Experience Sensing (CES) dataset, the most extensive longitudinal dataset of college student mental health to date. We systematically evaluate models across temporal windows, feature granularities, personalization strategies, and class imbalance handling. Our results show that DL models, particularly Transformer (Macro-F1 = 0.58), achieve the best overall performance, while LLMs show strength in contextual reasoning but weaker temporal modeling. Personalization substantially improves forecasts of severe mental health states. By revealing how different modeling approaches interpret phone sensing behavioral data over time, this work lays the groundwork for next-generation, adaptive, and human-centered mental health technologies that can advance both research and real-world well-being.
title A Comparative Study of Traditional Machine Learning, Deep Learning, and Large Language Models for Mental Health Forecasting using Smartphone Sensing Data
topic Machine Learning
url https://arxiv.org/abs/2601.03603