Saved in:
Bibliographic Details
Main Authors: Kampani, Shiv, Hidary, David, van der Poel, Constantijn, Ganahl, Martin, Miao, Brenda
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.21141
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912090047905792
author Kampani, Shiv
Hidary, David
van der Poel, Constantijn
Ganahl, Martin
Miao, Brenda
author_facet Kampani, Shiv
Hidary, David
van der Poel, Constantijn
Ganahl, Martin
Miao, Brenda
contents The discovery of causal relationships between random variables is an important yet challenging problem that has applications across many scientific domains. Differentiable causal discovery (DCD) methods are effective in uncovering causal relationships from observational data; however, these approaches often suffer from limited interpretability and face challenges in incorporating domain-specific prior knowledge. In contrast, Large Language Models (LLMs)-based causal discovery approaches have recently been shown capable of providing useful priors for causal discovery but struggle with formal causal reasoning. In this paper, we propose LLM-DCD, which uses an LLM to initialize the optimization of the maximum likelihood objective function of DCD approaches, thereby incorporating strong priors into the discovery method. To achieve this initialization, we design our objective function to depend on an explicitly defined adjacency matrix of the causal graph as its only variational parameter. Directly optimizing the explicitly defined adjacency matrix provides a more interpretable approach to causal discovery. Additionally, we demonstrate higher accuracy on key benchmarking datasets of our approach compared to state-of-the-art alternatives, and provide empirical evidence that the quality of the initialization directly impacts the quality of the final output of our DCD approach. LLM-DCD opens up new opportunities for traditional causal discovery methods like DCD to benefit from future improvements in the causal reasoning capabilities of LLMs.
format Preprint
id arxiv_https___arxiv_org_abs_2410_21141
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle LLM-initialized Differentiable Causal Discovery
Kampani, Shiv
Hidary, David
van der Poel, Constantijn
Ganahl, Martin
Miao, Brenda
Machine Learning
The discovery of causal relationships between random variables is an important yet challenging problem that has applications across many scientific domains. Differentiable causal discovery (DCD) methods are effective in uncovering causal relationships from observational data; however, these approaches often suffer from limited interpretability and face challenges in incorporating domain-specific prior knowledge. In contrast, Large Language Models (LLMs)-based causal discovery approaches have recently been shown capable of providing useful priors for causal discovery but struggle with formal causal reasoning. In this paper, we propose LLM-DCD, which uses an LLM to initialize the optimization of the maximum likelihood objective function of DCD approaches, thereby incorporating strong priors into the discovery method. To achieve this initialization, we design our objective function to depend on an explicitly defined adjacency matrix of the causal graph as its only variational parameter. Directly optimizing the explicitly defined adjacency matrix provides a more interpretable approach to causal discovery. Additionally, we demonstrate higher accuracy on key benchmarking datasets of our approach compared to state-of-the-art alternatives, and provide empirical evidence that the quality of the initialization directly impacts the quality of the final output of our DCD approach. LLM-DCD opens up new opportunities for traditional causal discovery methods like DCD to benefit from future improvements in the causal reasoning capabilities of LLMs.
title LLM-initialized Differentiable Causal Discovery
topic Machine Learning
url https://arxiv.org/abs/2410.21141