AlphaBind, a domain-specific model to predict and optimize antibody–antigen binding affinity

Antibodies are versatile therapeutic molecules that use combinatorial sequence diversity to cover a vast fitness landscape. Designing optimal antibody sequences, however, remains a major challenge. Recent advances in deep learning provide opportunities to address this challenge by learning sequence–...

Full description

Saved in:
Bibliographic Details
Main Authors: Aditya A. Agarwal, James Harrang, David Noble, Kerry L. McGowan, Adrian W. Lange, Emily Engelhart, Miranda C. Lahman, Jeffrey Adamo, Xin Yu, Oliver Serang, Kyle J. Minch, Kimberly Y. Wellman, David A. Younger, Randolph M. Lopez, Ryan O. Emerson
Format: Article
Language:English
Published: Taylor & Francis Group 2025-12-01
Series:mAbs
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/19420862.2025.2534626
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Antibodies are versatile therapeutic molecules that use combinatorial sequence diversity to cover a vast fitness landscape. Designing optimal antibody sequences, however, remains a major challenge. Recent advances in deep learning provide opportunities to address this challenge by learning sequence–function relationships to accurately predict fitness landscapes. These models enable efficient in silico prescreening and optimization of antibody candidates. By focusing experimental efforts on the most promising candidates guided by deep learning predictions, antibodies with optimal properties can be designed more quickly and effectively. Here we present AlphaBind, a domain-specific model that uses protein language model embeddings and pre-training on millions of quantitative laboratory measurements of antibody–antigen binding strength to achieve state-of-the-art performance for guided affinity optimization of parental antibodies. We demonstrate that an AlphaBind-powered antibody optimization pipeline can deliver candidates with substantially improved binding affinity across four parental antibodies (some of which were already affinity-matured) and using two different types of training data. The resulting candidates, which include up to 11 mutations from parental sequence, yield a sequence diversity that allows optimization of other biophysical characteristics, all while using only a single round of data generation for each parental antibody. AlphaBind weights and code are publicly available at: https://github.com/A-Alpha-Bio/alphabind.
ISSN:1942-0862
1942-0870