Saved in:
Bibliographic Details
Main Authors: Sourav, Shashwat, Baibakova, Viktoriia, Das, Sanjay, Elgedawy, Ran, Mahbub, Maria, Herron, Emily, Ghosal, Tirthankar
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.27176
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Knowledge graphs (KGs) can provide structured scientific context to language models, but it remains unclear which graph facts actually shape the generated hypotheses. We study KG-guided hypothesis generation for battery materials across Mistral-7B, Llama-3.1-70B, and Gemini 2.5 Flash. We perturb local KGs by varying density, ontology richness, topology, and control structure, and evaluate outputs with both provided-graph and fixed-reference metrics. Across models, KG utility is selective and model-dependent: graph context changes outputs, but no-KG outputs also recover substantial graph content from model priors. Compact top-k subgraphs often approximate full-KG behavior, including when claimed-outcome triples are held out. At the same time, compression is not unique to one semantic ranking rule, random and topology-based subsets can also recover much of the signal. These results support a redundancy-aware Compressive KG hypothesis: useful KG signal is often recoverable from compact, scientifically structured subgraphs rather than requiring the full local graph.