Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Lin, Yuxi, Fang, Yaxue, Zhang, Zehong, Liu, Zhouwu, Zhong, Siyun, Wang, Zhongfang, Yu, Fulong
Format:	Preprint
Published:	2025
Subjects:	Quantitative Methods Artificial Intelligence
Online Access:	https://arxiv.org/abs/2507.16801
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915816927133696
author	Lin, Yuxi Fang, Yaxue Zhang, Zehong Liu, Zhouwu Zhong, Siyun Wang, Zhongfang Yu, Fulong
author_facet	Lin, Yuxi Fang, Yaxue Zhang, Zehong Liu, Zhouwu Zhong, Siyun Wang, Zhongfang Yu, Fulong
contents	Understanding how 5' untranslated regions (5'UTRs) regulate mRNA translation is critical for controlling protein expression and designing effective therapeutic mRNAs. While recent deep learning models have shown promise in predicting translational efficiency from 5'UTR sequences, most are constrained by fixed input lengths and limited interpretability. We introduce UTR-STCNet, a Transformer-based architecture for flexible and biologically grounded modeling of variable-length 5'UTRs. UTR-STCNet integrates a Saliency-Aware Token Clustering (SATC) module that iteratively aggregates nucleotide tokens into multi-scale, semantically meaningful units based on saliency scores. A Saliency-Guided Transformer (SGT) block then captures both local and distal regulatory dependencies using a lightweight attention mechanism. This combined architecture achieves efficient and interpretable modeling without input truncation or increased computational cost. Evaluated across three benchmark datasets, UTR-STCNet consistently outperforms state-of-the-art baselines in predicting mean ribosome load (MRL), a key proxy for translational efficiency. Moreover, the model recovers known functional elements such as upstream AUGs and Kozak motifs, highlighting its potential for mechanistic insight into translation regulation.
format	Preprint
id	arxiv_https___arxiv_org_abs_2507_16801
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Decoding Translation-Related Functional Sequences in 5'UTRs Using Interpretable Deep Learning Models Lin, Yuxi Fang, Yaxue Zhang, Zehong Liu, Zhouwu Zhong, Siyun Wang, Zhongfang Yu, Fulong Quantitative Methods Artificial Intelligence Understanding how 5' untranslated regions (5'UTRs) regulate mRNA translation is critical for controlling protein expression and designing effective therapeutic mRNAs. While recent deep learning models have shown promise in predicting translational efficiency from 5'UTR sequences, most are constrained by fixed input lengths and limited interpretability. We introduce UTR-STCNet, a Transformer-based architecture for flexible and biologically grounded modeling of variable-length 5'UTRs. UTR-STCNet integrates a Saliency-Aware Token Clustering (SATC) module that iteratively aggregates nucleotide tokens into multi-scale, semantically meaningful units based on saliency scores. A Saliency-Guided Transformer (SGT) block then captures both local and distal regulatory dependencies using a lightweight attention mechanism. This combined architecture achieves efficient and interpretable modeling without input truncation or increased computational cost. Evaluated across three benchmark datasets, UTR-STCNet consistently outperforms state-of-the-art baselines in predicting mean ribosome load (MRL), a key proxy for translational efficiency. Moreover, the model recovers known functional elements such as upstream AUGs and Kozak motifs, highlighting its potential for mechanistic insight into translation regulation.
title	Decoding Translation-Related Functional Sequences in 5'UTRs Using Interpretable Deep Learning Models
topic	Quantitative Methods Artificial Intelligence
url	https://arxiv.org/abs/2507.16801

Similar Items