Saved in:
Bibliographic Details
Main Authors: Shen, Yujie, Wang, Zihan, Qian, Jian, Lei, Qi
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.08723
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908822913679360
author Shen, Yujie
Wang, Zihan
Qian, Jian
Lei, Qi
author_facet Shen, Yujie
Wang, Zihan
Qian, Jian
Lei, Qi
contents Training data reconstruction from KKT conditions has shown striking empirical success, yet it remains unclear when the resulting KKT equations have unique solutions and, even in identifiable regimes, how to reliably recover solutions by optimization. This work hereby focuses on these two complementary questions: identifiability and optimization. On the identifiability side, we discuss the sufficient conditions for KKT system of two-layer networks with polynomial activations to uniquely determine the training data, providing a theoretical explanation of when and why reconstruction is possible. On the optimization side, we introduce sample splitting, a curvature-aware refinement step applicable to general reconstruction objectives (not limited to KKT-based formulations): it creates additional descent directions to escape poor stationary points and refine solutions. Experiments demonstrate that augmenting several existing reconstruction methods with sample splitting consistently improves reconstruction performance.
format Preprint
id arxiv_https___arxiv_org_abs_2602_08723
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Data Reconstruction: Identifiability and Optimization with Sample Splitting
Shen, Yujie
Wang, Zihan
Qian, Jian
Lei, Qi
Machine Learning
Cryptography and Security
Training data reconstruction from KKT conditions has shown striking empirical success, yet it remains unclear when the resulting KKT equations have unique solutions and, even in identifiable regimes, how to reliably recover solutions by optimization. This work hereby focuses on these two complementary questions: identifiability and optimization. On the identifiability side, we discuss the sufficient conditions for KKT system of two-layer networks with polynomial activations to uniquely determine the training data, providing a theoretical explanation of when and why reconstruction is possible. On the optimization side, we introduce sample splitting, a curvature-aware refinement step applicable to general reconstruction objectives (not limited to KKT-based formulations): it creates additional descent directions to escape poor stationary points and refine solutions. Experiments demonstrate that augmenting several existing reconstruction methods with sample splitting consistently improves reconstruction performance.
title Data Reconstruction: Identifiability and Optimization with Sample Splitting
topic Machine Learning
Cryptography and Security
url https://arxiv.org/abs/2602.08723