Saved in:
Bibliographic Details
Main Authors: Ahmed, Mansoor, Taj, Nadeem, Khan, Imdad Ullah, Venkateswara, Hemanth, Patterson, Murray
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.13431
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908927467192320
author Ahmed, Mansoor
Taj, Nadeem
Khan, Imdad Ullah
Venkateswara, Hemanth
Patterson, Murray
author_facet Ahmed, Mansoor
Taj, Nadeem
Khan, Imdad Ullah
Venkateswara, Hemanth
Patterson, Murray
contents Computational antibody design has seen rapid methodological progress, with dozens of deep generative methods proposed in the past three years, yet the field lacks a standardized benchmark for fair comparison and model development. These methods are evaluated on different SAbDab snapshots, non-overlapping test sets, and incompatible metrics, and the literature fragments the design problem into numerous sub-tasks with no common definition. We introduce \textsc{Chimera-Bench} (\textbf{C}DR \textbf{M}odeling with \textbf{E}pitope-guided \textbf{R}edesign), a unified benchmark built around a single canonical task: \emph{epitope-conditioned CDR sequence-structure co-design}. \textsc{Chimera-Bench} provides (1) a curated, deduplicated dataset of \textbf{2,922} antibody-antigen complexes with epitope and paratope annotations; (2) three biologically motivated splits testing generalization to unseen epitopes, unseen antigen folds, and prospective temporal targets; and (3) a comprehensive evaluation protocol with five metric groups including novel epitope-specificity measures. We benchmark representative methods spanning different generative paradigms and report results across all splits. \textsc{Chimera-Bench} is the largest dataset of its kind for the antibody design problem, allowing the community to develop and test novel methods and evaluate their generalizability. The source code and data are available at: https://github.com/mansoor181/chimera-bench.git
format Preprint
id arxiv_https___arxiv_org_abs_2603_13431
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle CHIMERA-Bench: A Benchmark Dataset for Epitope-Specific Antibody Design
Ahmed, Mansoor
Taj, Nadeem
Khan, Imdad Ullah
Venkateswara, Hemanth
Patterson, Murray
Machine Learning
Artificial Intelligence
Computational antibody design has seen rapid methodological progress, with dozens of deep generative methods proposed in the past three years, yet the field lacks a standardized benchmark for fair comparison and model development. These methods are evaluated on different SAbDab snapshots, non-overlapping test sets, and incompatible metrics, and the literature fragments the design problem into numerous sub-tasks with no common definition. We introduce \textsc{Chimera-Bench} (\textbf{C}DR \textbf{M}odeling with \textbf{E}pitope-guided \textbf{R}edesign), a unified benchmark built around a single canonical task: \emph{epitope-conditioned CDR sequence-structure co-design}. \textsc{Chimera-Bench} provides (1) a curated, deduplicated dataset of \textbf{2,922} antibody-antigen complexes with epitope and paratope annotations; (2) three biologically motivated splits testing generalization to unseen epitopes, unseen antigen folds, and prospective temporal targets; and (3) a comprehensive evaluation protocol with five metric groups including novel epitope-specificity measures. We benchmark representative methods spanning different generative paradigms and report results across all splits. \textsc{Chimera-Bench} is the largest dataset of its kind for the antibody design problem, allowing the community to develop and test novel methods and evaluate their generalizability. The source code and data are available at: https://github.com/mansoor181/chimera-bench.git
title CHIMERA-Bench: A Benchmark Dataset for Epitope-Specific Antibody Design
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2603.13431