Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Shen, Yujie, Wang, Zihan, Qian, Jian, Lei, Qi
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Cryptography and Security
Online Access:	https://arxiv.org/abs/2602.08723
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908822913679360
author	Shen, Yujie Wang, Zihan Qian, Jian Lei, Qi
author_facet	Shen, Yujie Wang, Zihan Qian, Jian Lei, Qi
contents	Training data reconstruction from KKT conditions has shown striking empirical success, yet it remains unclear when the resulting KKT equations have unique solutions and, even in identifiable regimes, how to reliably recover solutions by optimization. This work hereby focuses on these two complementary questions: identifiability and optimization. On the identifiability side, we discuss the sufficient conditions for KKT system of two-layer networks with polynomial activations to uniquely determine the training data, providing a theoretical explanation of when and why reconstruction is possible. On the optimization side, we introduce sample splitting, a curvature-aware refinement step applicable to general reconstruction objectives (not limited to KKT-based formulations): it creates additional descent directions to escape poor stationary points and refine solutions. Experiments demonstrate that augmenting several existing reconstruction methods with sample splitting consistently improves reconstruction performance.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_08723
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Data Reconstruction: Identifiability and Optimization with Sample Splitting Shen, Yujie Wang, Zihan Qian, Jian Lei, Qi Machine Learning Cryptography and Security Training data reconstruction from KKT conditions has shown striking empirical success, yet it remains unclear when the resulting KKT equations have unique solutions and, even in identifiable regimes, how to reliably recover solutions by optimization. This work hereby focuses on these two complementary questions: identifiability and optimization. On the identifiability side, we discuss the sufficient conditions for KKT system of two-layer networks with polynomial activations to uniquely determine the training data, providing a theoretical explanation of when and why reconstruction is possible. On the optimization side, we introduce sample splitting, a curvature-aware refinement step applicable to general reconstruction objectives (not limited to KKT-based formulations): it creates additional descent directions to escape poor stationary points and refine solutions. Experiments demonstrate that augmenting several existing reconstruction methods with sample splitting consistently improves reconstruction performance.
title	Data Reconstruction: Identifiability and Optimization with Sample Splitting
topic	Machine Learning Cryptography and Security
url	https://arxiv.org/abs/2602.08723

Similar Items