Saved in:
Bibliographic Details
Main Authors: Wang, Zhe, Liu, Zijing, Xu, Chencheng, Yao, Yuan
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.05479
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912881014996992
author Wang, Zhe
Liu, Zijing
Xu, Chencheng
Yao, Yuan
author_facet Wang, Zhe
Liu, Zijing
Xu, Chencheng
Yao, Yuan
contents Drug discovery remains time-consuming, labor-intensive, and expensive, often requiring years and substantial investment per drug candidate. Predicting compound-protein interactions (CPIs) is a critical component in this process, enabling the identification of molecular interactions between drug candidates and target proteins. Recent deep learning methods have successfully modeled CPIs at the atomic level, achieving improved efficiency and accuracy over traditional energy-based approaches. However, these models do not always align with chemical realities, as molecular fragments (motifs or functional groups) typically serve as the primary units of biological recognition and binding. In this paper, we propose Phi-former, a pairwise hierarchical interaction representation learning method that addresses this gap by incorporating the biological role of motifs in CPIs. Phi-former represents compounds and proteins hierarchically and employs a pairwise pre-training framework to model interactions systematically across atom-atom, motif-motif, and atom-motif levels, reflecting how biological systems recognize molecular partners. We design intra-level and inter-level learning pipelines that make different interaction levels mutually beneficial. Experimental results demonstrate that Phi-former achieves superior performance on CPI-related tasks. A case study shows that our method accurately identifies specific atoms or motifs activated in CPIs, providing interpretable model explanations. These insights may guide rational drug design and support precision medicine applications.
format Preprint
id arxiv_https___arxiv_org_abs_2602_05479
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Phi-Former: A Pairwise Hierarchical Approach for Compound-Protein Interactions Prediction
Wang, Zhe
Liu, Zijing
Xu, Chencheng
Yao, Yuan
Artificial Intelligence
Drug discovery remains time-consuming, labor-intensive, and expensive, often requiring years and substantial investment per drug candidate. Predicting compound-protein interactions (CPIs) is a critical component in this process, enabling the identification of molecular interactions between drug candidates and target proteins. Recent deep learning methods have successfully modeled CPIs at the atomic level, achieving improved efficiency and accuracy over traditional energy-based approaches. However, these models do not always align with chemical realities, as molecular fragments (motifs or functional groups) typically serve as the primary units of biological recognition and binding. In this paper, we propose Phi-former, a pairwise hierarchical interaction representation learning method that addresses this gap by incorporating the biological role of motifs in CPIs. Phi-former represents compounds and proteins hierarchically and employs a pairwise pre-training framework to model interactions systematically across atom-atom, motif-motif, and atom-motif levels, reflecting how biological systems recognize molecular partners. We design intra-level and inter-level learning pipelines that make different interaction levels mutually beneficial. Experimental results demonstrate that Phi-former achieves superior performance on CPI-related tasks. A case study shows that our method accurately identifies specific atoms or motifs activated in CPIs, providing interpretable model explanations. These insights may guide rational drug design and support precision medicine applications.
title Phi-Former: A Pairwise Hierarchical Approach for Compound-Protein Interactions Prediction
topic Artificial Intelligence
url https://arxiv.org/abs/2602.05479