Saved in:
Bibliographic Details
Main Author: K, Mithra D
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.02489
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • I present an application of established machine learning techniques to NHANES health survey data for predicting diabetes status. I compare baseline models (logistic regression, random forest, XGBoost) with a hybrid approach that uses an XGBoost feature encoder and a lightweight multilayer perceptron (MLP) head. Experiments show the hybrid model attains improved AUC and balanced accuracy compared to baselines on the processed NHANES subset. I release code and reproducible scripts to encourage replication.