Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Alamgeer, Sana, Souissi, Yasine, Ngu, Anne H. H.
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2505.04660
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908353554284544
author	Alamgeer, Sana Souissi, Yasine Ngu, Anne H. H.
author_facet	Alamgeer, Sana Souissi, Yasine Ngu, Anne H. H.
contents	Training fall detection systems is challenging due to the scarcity of real-world fall data, particularly from elderly individuals. To address this, we explore the potential of Large Language Models (LLMs) for generating synthetic fall data. This study evaluates text-to-motion (T2M, SATO, ParCo) and text-to-text models (GPT4o, GPT4, Gemini) in simulating realistic fall scenarios. We generate synthetic datasets and integrate them with four real-world baseline datasets to assess their impact on fall detection performance using a Long Short-Term Memory (LSTM) model. Additionally, we compare LLM-generated synthetic data with a diffusion-based method to evaluate their alignment with real accelerometer distributions. Results indicate that dataset characteristics significantly influence the effectiveness of synthetic data, with LLM-generated data performing best in low-frequency settings (e.g., 20Hz) while showing instability in high-frequency datasets (e.g., 200Hz). While text-to-motion models produce more realistic biomechanical data than text-to-text models, their impact on fall detection varies. Diffusion-based synthetic data demonstrates the closest alignment to real data but does not consistently enhance model performance. An ablation study further confirms that the effectiveness of synthetic data depends on sensor placement and fall representation. These findings provide insights into optimizing synthetic data generation for fall detection models.
format	Preprint
id	arxiv_https___arxiv_org_abs_2505_04660
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	AI-Generated Fall Data: Assessing LLMs and Diffusion Model for Wearable Fall Detection Alamgeer, Sana Souissi, Yasine Ngu, Anne H. H. Computation and Language Computer Vision and Pattern Recognition Training fall detection systems is challenging due to the scarcity of real-world fall data, particularly from elderly individuals. To address this, we explore the potential of Large Language Models (LLMs) for generating synthetic fall data. This study evaluates text-to-motion (T2M, SATO, ParCo) and text-to-text models (GPT4o, GPT4, Gemini) in simulating realistic fall scenarios. We generate synthetic datasets and integrate them with four real-world baseline datasets to assess their impact on fall detection performance using a Long Short-Term Memory (LSTM) model. Additionally, we compare LLM-generated synthetic data with a diffusion-based method to evaluate their alignment with real accelerometer distributions. Results indicate that dataset characteristics significantly influence the effectiveness of synthetic data, with LLM-generated data performing best in low-frequency settings (e.g., 20Hz) while showing instability in high-frequency datasets (e.g., 200Hz). While text-to-motion models produce more realistic biomechanical data than text-to-text models, their impact on fall detection varies. Diffusion-based synthetic data demonstrates the closest alignment to real data but does not consistently enhance model performance. An ablation study further confirms that the effectiveness of synthetic data depends on sensor placement and fall representation. These findings provide insights into optimizing synthetic data generation for fall detection models.
title	AI-Generated Fall Data: Assessing LLMs and Diffusion Model for Wearable Fall Detection
topic	Computation and Language Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2505.04660

Similar Items