Saved in:
Bibliographic Details
Main Authors: Hsu, Wei-Tse, Grevtsev, Savva, Douglas, Thomas, Magarkar, Aniket, Biggin, Philip C.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.07882
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • We evaluate the feasibility of using co-folding models for synthetic data augmentation in training machine learning-based scoring functions (MLSFs) for binding affinity prediction. Our results show that performance gains depend critically on the structural quality of augmented data. In light of this, we established simple heuristics for identifying high-quality co-folding predictions without reference structures, enabling them to substitute for experimental structures in MLSF training. Our study informs future data augmentation strategies based on co-folding models.