Saved in:
Bibliographic Details
Main Authors: Qiu, Mufan, Hu, Xinyu, Zhan, Fengwei, Yun, Sukwon, Peng, Jie, Zhang, Ruichen, Kailkhura, Bhavya, Yang, Jiekun, Chen, Tianlong
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.01682
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915180494979072
author Qiu, Mufan
Hu, Xinyu
Zhan, Fengwei
Yun, Sukwon
Peng, Jie
Zhang, Ruichen
Kailkhura, Bhavya
Yang, Jiekun
Chen, Tianlong
author_facet Qiu, Mufan
Hu, Xinyu
Zhan, Fengwei
Yun, Sukwon
Peng, Jie
Zhang, Ruichen
Kailkhura, Bhavya
Yang, Jiekun
Chen, Tianlong
contents Foundation models for single-cell RNA sequencing (scRNA-seq) have shown promising capabilities in capturing gene expression patterns. However, current approaches face critical limitations: they ignore biological prior knowledge encoded in gene regulatory relationships and fail to leverage multi-omics signals that could provide complementary regulatory insights. In this paper, we propose GRNFormer, a new framework that systematically integrates multi-scale Gene Regulatory Networks (GRNs) inferred from multi-omics data into RNA foundation model training. Our framework introduces two key innovations. First, we introduce a pipeline for constructing hierarchical GRNs that capture regulatory relationships at both cell-type-specific and cell-specific resolutions. Second, we design a structure-aware integration framework that addresses the information asymmetry in GRNs through two technical advances: (1) A graph topological adapter using multi-head cross-attention to weight regulatory relationships dynamically, and (2) a novel edge perturbation strategy that perturb GRNs with biologically-informed co-expression links to augment graph neural network training. Comprehensive experiments have been conducted on three representative downstream tasks across multiple model architectures to demonstrate the effectiveness of GRNFormer. It achieves consistent improvements over state-of-the-art (SoTA) baselines: $3.6\%$ increase in drug response prediction correlation, $9.6\%$ improvement in single-cell drug classification AUC, and $1.1\%$ average gain in gene perturbation prediction accuracy.
format Preprint
id arxiv_https___arxiv_org_abs_2503_01682
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle GRNFormer: A Biologically-Guided Framework for Integrating Gene Regulatory Networks into RNA Foundation Models
Qiu, Mufan
Hu, Xinyu
Zhan, Fengwei
Yun, Sukwon
Peng, Jie
Zhang, Ruichen
Kailkhura, Bhavya
Yang, Jiekun
Chen, Tianlong
Machine Learning
Foundation models for single-cell RNA sequencing (scRNA-seq) have shown promising capabilities in capturing gene expression patterns. However, current approaches face critical limitations: they ignore biological prior knowledge encoded in gene regulatory relationships and fail to leverage multi-omics signals that could provide complementary regulatory insights. In this paper, we propose GRNFormer, a new framework that systematically integrates multi-scale Gene Regulatory Networks (GRNs) inferred from multi-omics data into RNA foundation model training. Our framework introduces two key innovations. First, we introduce a pipeline for constructing hierarchical GRNs that capture regulatory relationships at both cell-type-specific and cell-specific resolutions. Second, we design a structure-aware integration framework that addresses the information asymmetry in GRNs through two technical advances: (1) A graph topological adapter using multi-head cross-attention to weight regulatory relationships dynamically, and (2) a novel edge perturbation strategy that perturb GRNs with biologically-informed co-expression links to augment graph neural network training. Comprehensive experiments have been conducted on three representative downstream tasks across multiple model architectures to demonstrate the effectiveness of GRNFormer. It achieves consistent improvements over state-of-the-art (SoTA) baselines: $3.6\%$ increase in drug response prediction correlation, $9.6\%$ improvement in single-cell drug classification AUC, and $1.1\%$ average gain in gene perturbation prediction accuracy.
title GRNFormer: A Biologically-Guided Framework for Integrating Gene Regulatory Networks into RNA Foundation Models
topic Machine Learning
url https://arxiv.org/abs/2503.01682