Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Rathod, Ameya, Belsare, Sujay, Nautiyal, Salvik Krishna, Laad, Dhruv, Kumaraguru, Ponnurangam
Format: Preprint
Veröffentlicht: 2026
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2602.06899
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866912885004828672
author Rathod, Ameya
Belsare, Sujay
Nautiyal, Salvik Krishna
Laad, Dhruv
Kumaraguru, Ponnurangam
author_facet Rathod, Ameya
Belsare, Sujay
Nautiyal, Salvik Krishna
Laad, Dhruv
Kumaraguru, Ponnurangam
contents Recovering a unique causal graph from observational data is an ill-posed problem because multiple generating mechanisms can lead to the same observational distribution. This problem becomes solvable only by exploiting specific structural or distributional assumptions. While recent work has separately utilized time-series dynamics or multi-environment heterogeneity to constrain this problem, we integrate both as complementary sources of heterogeneity. This integration yields unified necessary identifiability conditions and enables a rigorous analysis of the statistical limits of recovery under thin versus heavy-tailed noise. In particular, temporal structure is shown to effectively substitute for missing environmental diversity, possibly achieving identifiability even under insufficient heterogeneity. Extending this analysis to heavy-tailed (Student's t) distributions, we demonstrate that while geometric identifiability conditions remain invariant, the sample complexity diverges significantly from the Gaussian baseline. Explicit information-theoretic bounds quantify this cost of robustness, establishing the fundamental limits of covariance-based causal graph recovery methods in realistic non-stationary systems. This work shifts the focus from whether causal structure is identifiable to whether it is statistically recoverable in practice.
format Preprint
id arxiv_https___arxiv_org_abs_2602_06899
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Sample Complexity of Causal Identification with Temporal Heterogeneity
Rathod, Ameya
Belsare, Sujay
Nautiyal, Salvik Krishna
Laad, Dhruv
Kumaraguru, Ponnurangam
Machine Learning
Recovering a unique causal graph from observational data is an ill-posed problem because multiple generating mechanisms can lead to the same observational distribution. This problem becomes solvable only by exploiting specific structural or distributional assumptions. While recent work has separately utilized time-series dynamics or multi-environment heterogeneity to constrain this problem, we integrate both as complementary sources of heterogeneity. This integration yields unified necessary identifiability conditions and enables a rigorous analysis of the statistical limits of recovery under thin versus heavy-tailed noise. In particular, temporal structure is shown to effectively substitute for missing environmental diversity, possibly achieving identifiability even under insufficient heterogeneity. Extending this analysis to heavy-tailed (Student's t) distributions, we demonstrate that while geometric identifiability conditions remain invariant, the sample complexity diverges significantly from the Gaussian baseline. Explicit information-theoretic bounds quantify this cost of robustness, establishing the fundamental limits of covariance-based causal graph recovery methods in realistic non-stationary systems. This work shifts the focus from whether causal structure is identifiable to whether it is statistically recoverable in practice.
title Sample Complexity of Causal Identification with Temporal Heterogeneity
topic Machine Learning
url https://arxiv.org/abs/2602.06899