Saved in:
Bibliographic Details
Main Author: Podstawski, Michal
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.18092
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917422544453632
author Podstawski, Michal
author_facet Podstawski, Michal
contents Small language models fine-tuned for graph property estimation have demonstrated strong in-distribution performance, yet their generalization capabilities beyond training conditions remain poorly understood. In this work, we systematically investigate the boundaries of structural inference in fine-tuned small language models along two generalization axes - graph size and graph family distribution - and assess domain-learning capability on real-world graph benchmarks. Using a controlled experimental setup with three instruction-tuned models in the 3-4B parameter class and two graph serialization formats, we evaluate performance on graphs substantially larger than the training range and across held-out random graph families. Our results show that fine-tuned models maintain strong ordinal consistency across structurally distinct graph families and continue to rank graphs by structural properties on inputs substantially larger than those seen during training, with distinct architecture-specific degradation profiles. These findings delineate where fine-tuned small language models generalize reliably, providing empirical grounding for their use in graph-based reasoning tasks.
format Preprint
id arxiv_https___arxiv_org_abs_2604_18092
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Generalization Boundaries of Fine-Tuned Small Language Models for Graph Structural Inference
Podstawski, Michal
Machine Learning
Small language models fine-tuned for graph property estimation have demonstrated strong in-distribution performance, yet their generalization capabilities beyond training conditions remain poorly understood. In this work, we systematically investigate the boundaries of structural inference in fine-tuned small language models along two generalization axes - graph size and graph family distribution - and assess domain-learning capability on real-world graph benchmarks. Using a controlled experimental setup with three instruction-tuned models in the 3-4B parameter class and two graph serialization formats, we evaluate performance on graphs substantially larger than the training range and across held-out random graph families. Our results show that fine-tuned models maintain strong ordinal consistency across structurally distinct graph families and continue to rank graphs by structural properties on inputs substantially larger than those seen during training, with distinct architecture-specific degradation profiles. These findings delineate where fine-tuned small language models generalize reliably, providing empirical grounding for their use in graph-based reasoning tasks.
title Generalization Boundaries of Fine-Tuned Small Language Models for Graph Structural Inference
topic Machine Learning
url https://arxiv.org/abs/2604.18092