Computational Antibody Papers

Filter by tags
All
Filter by published year
All
TitleKey points
    • Protein design method based on Boltz-1.
    • Boltz-1 is an open-source reproduction of AlphaFold3, which uses a diffusion module to co-fold molecular structures (proteins, ligands, etc.).
    • For design purposes, BoltzDesign1 sidesteps the full structure generation step and instead uses only the Pairformer (which outputs a distogram — a probabilistic representation of all pairwise residue distances). This allows broader exploration of sequence space, as it optimizes over the distribution of possible structures rather than committing to a single conformation.
    • Given a target (such as a small molecule or protein), they weakly initialize a binder sequence using random logits. This sequence is then iteratively refined by backpropagating loss through the Pairformer (and optionally through the Confidence module) to increase the predicted quality of the binder–target interaction.
    • A full 3D structure can be generated at the end using the Boltz-1 structure module, but this is not part of the optimization loop.
    • They benchmarked their method in silico on small molecule targets and a set of protein–protein interactions from the BindCraft benchmark, comparing performance to RfDiffusion All-Atom.
  • 2025-04-28

    BindCraft: one-shot design of functional protein binders

    • protein design
    • non-antibody stuff
    • BindCraft is an easy-to-use pipeline for computational protein binder design.
    • It employs AlphaFold2-Multimer to hallucinate binders via backpropagation.
    • Given a target structure and binder parameters (e.g., sequence length), the binder sequence is initialized with random logits and iteratively optimized via gradient descent through the AF2-Multimer network.
    • After binder hallucination, the sequence and surface residues are further optimized using MPNNsol, and AF2-Monomer is used to repredict and filter high-confidence designs.
    • Binder designs were validated experimentally through in vitro assays, X-ray crystallography, and cryo-EM.
    • Reported success rates ranged from 25% to 100%, with most binders in the nanomolar affinity range, a few in the micromolar range, and backbone RMSDs of ~1.7 Å to 3.1 Å between design models and solved structures.
    • CNN surrogate for costly SCM calculations to correlate with viscosity.
    • Developed a shallow CNN (tens of thousands of params) to correlate with calculation of SCM using MD simulations - that are inherently slow.
    • Correlation between CNN and MD-derived is ca. 0.8.
    • The CNN’s output, when translated into a viscosity prediction (via a correlation with SCM score), achieves a reasonably high correlation with experimentally measured viscosity values—again, with correlation coefficients in the range of 0.7 to 0.8.
    • Viscosity prediction of high-concentration antibodies.
    • Antibodies at low volume high concentration are needed for subcutaneous injections, which poses issues when antibodies are high-viscosity.
    • They measured 229 mabs in 20mm histidine H-CL formulation, ph 6.0 at 150 mg/mL
    • 162 were low viscosity (<20 cP) and 67 high viscosity (>20 cP)
    • They employed the dataset to perform binary classification of the high and low viscosity data points
    • They used structural features obtained from DeepSP for featurazition as well as one-hot encoded sequence.
    • They used a set of predictive models, including simple neural networks, random forests, logistic regression and others.
    • Their method performs well on independent test sets achieving >80% accuracy.
    • In comparison to other methods that can be used as proxy (DeepSCM, SHARMA, TAP), their method does the best.
    • Neural network that provides structural features for an antibody sequence that can be used to predict developability.
    • Models spacial charge map and spatial aggregation propensity - properties that are normally obtained by MD simulations.
    • They model ca. 20,000 paired sequences from OAS and perform MD simulations to calculate the properties the canonical way.
    • They use these features as labels to train a small neural network that achieves correlation 0.87 on average for each of the 30 features predicted.
    • Introduced Spatial Charge Map - simple structural descriptor that correlates with viscosity measurements.
    • Prior studies have correlated pronounced negative surface patches on the antibody’s Fv domain with elevated solution viscosity.
    • The SCM score aggregates partial charges from the Fv and correlates them to viscosity readouts.
    • Pfizer Medi and Novartis contributed antibodies to benchmark.
    • A panel of IgG1 antibodies was selected and their viscosities were measured experimentally at high concentrations (around 150 mg/mL) under nearly identical formulation conditions (e.g., pH 5.8, temperature at 25°C).
    • In each of the cases the high viscosity abs had the highest SCM scores showing that this is a good way to go about predicting viscosity.
    • Benchmarking of a proprietary antibody design algorithm.
    • The method generates novel antibodies against a target, for a specific epitopic constraint & can be used to re-design antibodies.
    • Altogether they find good affinity scfv binders for six targets for which they found a complex in the PDB, like PD1 and Her2.
    • The de novo antibody design methods were computationally benchmarked against a curated set of 32 experimentally resolved antibody–antigen complexes using metrics like the G-pass rate and orientation recovery (measured by Fw RMSD). This allowed the authors to compare their method (across different versions) against other approaches.
    • They compare against RFAntibody and dyMEAN but in the computational tasks - reproducing the orientation of an existing antibody.
    • Several rounds of biopanning are employed to enrich for high-affinity, target-specific binders from a pre-designed library and do not involve the introduction of new mutations.
    • They benchmark the developability properties such as monomericity, yield and polyreactivity to show that their antibodies have good properties.
    • They demonstrate that most of their designed binders have less than 50% H3 sequence identity to antibodies in the PDB.
    • Computational details and binders are not given.
    • AbMAP - Language model transfer learning framework with applications to antibody engineering.
    • Authors address the process of dichotomy of language models in antibodies - either one uses a bare-bones protein model like ESM or only antibody model like Antiberty/IgLM. Normal protein models will not capture hypervariability of CDRs whereas antibody models would focus too much on the framework. They focus solely on CDRs + flanking regions as a solution.
    • They show their applicability to three off the shelf models with structure template finding as well as low-n generative modeling.
    • Novel generative model for antibodies that allows one to fill in, inpaint inverse fold etc.
    • The model employs Bayesian Flow Networks which is somewhat similar to diffusion.
    • The model is trained on data from OAS - unpaired data as a first pass and paired data as a second pass.
    • Models are benchmarked on a range of computational metrics, chiefly sequence recovery (for infiling/inverse folding).
    • Developability is checked by computational prediction of solubility (CamSol) and humanness (AbNativ)
  • 2025-03-31

    AI-Augmented Physics-Based Docking for Antibody-Antigen Complex Prediction

    • epitope prediction
    • docking
    • structure prediction
    • Benchmarking of the structure prediction/docking and co-folding methods for antibody design
    • Authors measure the impact of antibody-antigen model quality on the success rate of epitope prediction and antibody design.
    • For epitope prediction and antibody design they use a proxy measure of DockQ score - they call success when DockQ is better than 0.23, for antibody design they use a stricter threshold of 0.49.
    • Using these measures, AlphaFold3 comes out on top, and it would be successful roughly ~47% times.
    • THey introduce an approach where ProPOSE and ZDOCK decoys are refined using AlphaFold. With this combined protocol they reach success rates of 35% for epitope mapping and 30% for antibody design.