Saved in:
Bibliographic Details
Main Authors: Możejko, Marcin, Bielecki, Adam, Prądzyński, Jurand, Traskowski, Marcin, Janowski, Antoni, Lee, Hyun-Su, Torres, Marcelo Der Torossian, Kmicikiewicz, Michał, Szymczak, Paulina, Jurasz, Karol, Kucharczyk, Michał, de la Fuente-Nunez, Cesar, Szczurek, Ewa
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.01988
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915815073251328
author Możejko, Marcin
Bielecki, Adam
Prądzyński, Jurand
Traskowski, Marcin
Janowski, Antoni
Lee, Hyun-Su
Torres, Marcelo Der Torossian
Kmicikiewicz, Michał
Szymczak, Paulina
Jurasz, Karol
Kucharczyk, Michał
de la Fuente-Nunez, Cesar
Szczurek, Ewa
author_facet Możejko, Marcin
Bielecki, Adam
Prądzyński, Jurand
Traskowski, Marcin
Janowski, Antoni
Lee, Hyun-Su
Torres, Marcelo Der Torossian
Kmicikiewicz, Michał
Szymczak, Paulina
Jurasz, Karol
Kucharczyk, Michał
de la Fuente-Nunez, Cesar
Szczurek, Ewa
contents Antimicrobial peptide discovery is challenged by the astronomical size of peptide space and the relative scarcity of active peptides. Generative models provide continuous latent "maps" of peptide space, but conventionally ignore decoder-induced geometry and rely on flat Euclidean metrics, rendering exploration and optimization distorted and inefficient. Prior manifold-based remedies assume fixed intrinsic dimensionality, which critically fails in practice for peptide data. Here, we introduce PepCompass, a geometry-aware framework for peptide exploration and optimization. At its core, we define a Union of $κ$-Stable Riemannian Manifolds $\mathbb{M}^κ$, a family of decoder-induced manifolds that captures local geometry while ensuring computational stability. We propose two local exploration methods: Second-Order Riemannian Brownian Efficient Sampling, which provides a convergent second-order approximation to Riemannian Brownian motion, and Mutation Enumeration in Tangent Space, which reinterprets tangent directions as discrete amino-acid substitutions. Combining these yields Local Enumeration Bayesian Optimization (LE-BO), an efficient algorithm for local activity optimization. Finally, we introduce Potential-minimizing Geodesic Search (PoGS), which interpolates between prototype embeddings along property-enriched geodesics, biasing discovery toward seeds, i.e. peptides with favorable activity. In-vitro validation confirms the effectiveness of PepCompass: PoGS yields four novel seeds, and subsequent optimization with LE-BO discovers 25 highly active peptides with broad-spectrum activity, including against resistant bacterial strains. These results demonstrate that geometry-informed exploration provides a powerful new paradigm for antimicrobial peptide design.
format Preprint
id arxiv_https___arxiv_org_abs_2510_01988
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle PepCompass: Navigating peptide embedding spaces using Riemannian Geometry
Możejko, Marcin
Bielecki, Adam
Prądzyński, Jurand
Traskowski, Marcin
Janowski, Antoni
Lee, Hyun-Su
Torres, Marcelo Der Torossian
Kmicikiewicz, Michał
Szymczak, Paulina
Jurasz, Karol
Kucharczyk, Michał
de la Fuente-Nunez, Cesar
Szczurek, Ewa
Machine Learning
Antimicrobial peptide discovery is challenged by the astronomical size of peptide space and the relative scarcity of active peptides. Generative models provide continuous latent "maps" of peptide space, but conventionally ignore decoder-induced geometry and rely on flat Euclidean metrics, rendering exploration and optimization distorted and inefficient. Prior manifold-based remedies assume fixed intrinsic dimensionality, which critically fails in practice for peptide data. Here, we introduce PepCompass, a geometry-aware framework for peptide exploration and optimization. At its core, we define a Union of $κ$-Stable Riemannian Manifolds $\mathbb{M}^κ$, a family of decoder-induced manifolds that captures local geometry while ensuring computational stability. We propose two local exploration methods: Second-Order Riemannian Brownian Efficient Sampling, which provides a convergent second-order approximation to Riemannian Brownian motion, and Mutation Enumeration in Tangent Space, which reinterprets tangent directions as discrete amino-acid substitutions. Combining these yields Local Enumeration Bayesian Optimization (LE-BO), an efficient algorithm for local activity optimization. Finally, we introduce Potential-minimizing Geodesic Search (PoGS), which interpolates between prototype embeddings along property-enriched geodesics, biasing discovery toward seeds, i.e. peptides with favorable activity. In-vitro validation confirms the effectiveness of PepCompass: PoGS yields four novel seeds, and subsequent optimization with LE-BO discovers 25 highly active peptides with broad-spectrum activity, including against resistant bacterial strains. These results demonstrate that geometry-informed exploration provides a powerful new paradigm for antimicrobial peptide design.
title PepCompass: Navigating peptide embedding spaces using Riemannian Geometry
topic Machine Learning
url https://arxiv.org/abs/2510.01988