Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Navarro, Carles, Tholke, Philipp, de Fabritiis, Gianni
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2604.19562
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914495365906432
author	Navarro, Carles Tholke, Philipp de Fabritiis, Gianni
author_facet	Navarro, Carles Tholke, Philipp de Fabritiis, Gianni
contents	Structure-based drug discovery faces the dual challenge of accurately capturing 3D protein-ligand interactions while navigating ultra-large chemical spaces to identify synthetically accessible candidates. In this work, we present a unified framework that addresses these challenges by combining contrastive 3D structure encoding with autoregressive molecular generation conditioned on commercial compound spaces. First, we introduce an SE(3)-equivariant transformer that encodes ligand and pocket structures into a shared embedding space via contrastive learning, achieving competitive results in zero-shot virtual screening. Second, we integrate these embeddings into a multimodal Chemical Language Model (MCLM). The model generates target-specific molecules conditioned on either pocket or ligand structures, with a learned dataset token that steers the output toward targeted chemical spaces, yielding candidates with favorable predicted binding properties across diverse targets.
format	Preprint
id	arxiv_https___arxiv_org_abs_2604_19562
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Structure-guided molecular design with contrastive 3D protein-ligand learning Navarro, Carles Tholke, Philipp de Fabritiis, Gianni Machine Learning Structure-based drug discovery faces the dual challenge of accurately capturing 3D protein-ligand interactions while navigating ultra-large chemical spaces to identify synthetically accessible candidates. In this work, we present a unified framework that addresses these challenges by combining contrastive 3D structure encoding with autoregressive molecular generation conditioned on commercial compound spaces. First, we introduce an SE(3)-equivariant transformer that encodes ligand and pocket structures into a shared embedding space via contrastive learning, achieving competitive results in zero-shot virtual screening. Second, we integrate these embeddings into a multimodal Chemical Language Model (MCLM). The model generates target-specific molecules conditioned on either pocket or ligand structures, with a learned dataset token that steers the output toward targeted chemical spaces, yielding candidates with favorable predicted binding properties across diverse targets.
title	Structure-guided molecular design with contrastive 3D protein-ligand learning
topic	Machine Learning
url	https://arxiv.org/abs/2604.19562

Similar Items