Blog

How to Interpret AlphaFold Confidence Scores for Multi-Chain Complexes

Jan 21, 2026

You ran AlphaFold-Multimer on your protein complex. The structure looks beautiful—two chains wrapped around each other in an intricate dance. The pLDDT scores are deep blue (>90) across both proteins. You're ready to publish. But wait: those chains might not actually interact. That beautiful interface might be a computational artifact.


Welcome to the most misunderstood aspect of AlphaFold: confidence scores for complexes are fundamentally different from monomers, and misreading them leads to wrong biological conclusions.

Key Takeaways

  • pLDDT measures local confidence, not whether an interface is real

  • PAE (Predicted Aligned Error) is the critical metric for complex interactions

  • Low inter-chain PAE (< 10 Å) indicates confident interaction; high PAE (> 20 Å) means AlphaFold is guessing

  • Common trap: Two well-folded chains placed near each other ≠ biological complex

  • Validation: Always check PAE matrices before trusting complex predictions

The Two Confidence Metrics: pLDDT vs PAE

AlphaFold reports two confidence metrics. For monomers, pLDDT is usually sufficient. For complexes, PAE becomes essential.

pLDDT: Per-Residue Confidence

What it measures: How confident is AlphaFold in the local structure around each residue? The pLDDT (predicted Local Distance Difference Test) provides a per-residue confidence score on a scale from 0 to 100 (Jumper et al., 2021).


The scale (EBI AlphaFold Training):

  • > 90 (deep blue): Very high confidence. Backbone and side chains likely accurate.

  • 70-90 (light blue): Confident. Backbone reliable, some side chain uncertainty.

  • 50-70 (yellow): Low confidence. Structure may be approximately correct.

  • < 50 (orange/red): Very low confidence. Often indicates intrinsic disorder.


What pLDDT tells you:

  • Whether each domain is well-folded

  • Which regions are structured vs disordered

  • Where the model is uncertain about local conformation


What pLDDT does NOT tell you:

  • Whether two domains are correctly positioned relative to each other

  • Whether an interface between chains is real

  • Whether the complex actually forms in biology


The trap: A complex can have pLDDT > 90 everywhere and still be completely wrong about the interface.

PAE: Predicted Aligned Error

What it measures: The PAE (Predicted Aligned Error) provides confidence in the relative position of two residues—if you align the structure on residue X, how confident is AlphaFold about the position of residue Y? (Evans et al., 2022)


The scale:

  • < 5 Å: Very high confidence in relative positioning

  • 5-10 Å: High confidence

  • 10-20 Å: Moderate confidence

  • > 20 Å: Low confidence—positions are essentially uncertain


How to read PAE matrices:


The PAE is displayed as a heatmap where:

  • X-axis: Residue being aligned on (the "reference")

  • Y-axis: Residue whose position is being evaluated

  • Color: Expected error in position (blue = low error = confident)


For a monomer:

  • The matrix is symmetric

  • Well-folded domains show blue blocks along the diagonal

  • Flexible linkers between domains show high PAE (red/yellow) between the domain blocks


For a complex:

  • The matrix has blocks for each chain

  • On-diagonal blocks: Intra-chain confidence (each chain's internal structure)

  • Off-diagonal blocks: Inter-chain confidence (how chains relate to each other)

Reading PAE for Complexes: The Critical Skill

What a Real Interaction Looks Like

When AlphaFold confidently predicts that two chains interact:


PAE matrix signature:

  • Off-diagonal blocks are predominantly blue (< 10 Å)

  • The pattern is often asymmetric but shows clear low-error regions

  • Interface residues show particularly low PAE to each other


Example: Antibody-Antigen Complex

        Chain A (Heavy)  Chain B (Light)  Chain C (Antigen)
Heavy      [BLUE]          [BLUE]           [BLUE at CDRs]
Light      [BLUE]          [BLUE]           [BLUE at CDRs]
Antigen    [BLUE at epi]   [BLUE at epi]    [BLUE]

The off-diagonal blue regions indicate where AlphaFold is confident about the interface. The CDRs of the antibody show low PAE to the epitope on the antigen.

What a Fake Interaction Looks Like

When AlphaFold doesn't have evolutionary evidence for an interaction, but places chains near each other anyway:


PAE matrix signature:

  • Off-diagonal blocks are predominantly red/yellow (> 20 Å)

  • Each chain's diagonal block may be blue (chains fold well individually)

  • But the inter-chain regions show no confidence


Example: Two Unrelated Proteins Forced Together

        Chain A          Chain B
A         [BLUE]          [RED/YELLOW]
B         [RED/YELLOW]    [BLUE]

Both chains are well-folded (blue diagonals), but AlphaFold has no confidence in their relative positions (red off-diagonals). This complex is likely an artifact.


The danger: The 3D visualization still shows the chains in proximity. Without checking PAE, you might think they interact.

Common Misinterpretations (And How to Avoid Them)

Mistake 1: Trusting pLDDT Alone

The scenario:

  • You predict a heterodimer

  • Both chains have pLDDT > 85

  • You conclude the complex is well-predicted


The reality:

  • High pLDDT means both chains fold well individually

  • It says nothing about whether they interact

  • The chains might be randomly oriented


The fix: Always check the inter-chain PAE. If it's > 15-20 Å, the relative orientation is not reliable.

Mistake 2: Ignoring Asymmetry in PAE

The scenario:

  • Chain A to Chain B PAE is low (5 Å)

  • Chain B to Chain A PAE is high (25 Å)

  • You're confused


What this means:

  • PAE measures: "If I align on X, how accurate is Y?"

  • Asymmetry often occurs when one chain is well-anchored and the other is flexible

  • The interaction might be real, but one partner has conformational flexibility


The fix: Look for regions where BOTH directions show low PAE. These are the confident interface contacts.

Mistake 3: Expecting Perfect Blue Blocks

The scenario:

  • Your complex shows patchy blue in the off-diagonal regions

  • It's not a clean block like within-chain regions

  • You're unsure if the interaction is real


The reality:

  • Interfaces are often smaller than full domains

  • A 20-residue interface will show a small blue patch, not a large block

  • Patchy blue at specific positions often indicates real contacts


The fix: Look for consistent low PAE at specific residue pairs, not overall blue blocks. Map these residues to the structure—they should be at the interface.

Mistake 4: Confusing Low PAE with Binding Affinity

The scenario:

  • Your complex shows beautiful low PAE at the interface

  • You conclude it's a high-affinity interaction


The reality:

  • PAE measures prediction confidence, not binding strength

  • A transient, low-affinity interaction can show low PAE if there's evolutionary evidence

  • A high-affinity interaction might show high PAE if it's novel or poorly represented in sequences


The fix: PAE tells you "AlphaFold is confident this is the structure." It doesn't tell you "this interaction is strong." Use experimental validation (SPR, ITC) for affinity.

The Interface Confidence Checklist

Before trusting an AlphaFold complex prediction, verify:

Step 1: Check Inter-Chain PAE

  • [ ] Off-diagonal PAE blocks are < 15 Å for key regions

  • [ ] Interface residues specifically show low PAE to each other

  • [ ] Pattern is consistent (not just a few random low-PAE spots)

Step 2: Evaluate Interface Quality

  • [ ] Interface area is reasonable (> 500 Ų for stable complexes)

  • [ ] Hydrophobic contacts are present at the interface

  • [ ] No severe clashes at the interface

Step 3: Check Biological Plausibility

  • [ ] Do these proteins co-localize in the cell?

  • [ ] Is there experimental evidence of interaction (Co-IP, yeast two-hybrid)?

  • [ ] Are there homologous complexes with known structures?

Step 4: Consider Alternative Conformations

  • [ ] Did you try multiple seeds? (AlphaFold can give different answers)

  • [ ] Are there known conformational states you should sample?

  • [ ] Would MSA subsampling reveal alternative interfaces?

Quantifying Interface Confidence

Beyond visual inspection, you can quantify interface confidence:

Interface PAE Score

Calculate the mean PAE for all residue pairs at the interface:

  1. Identify interface residues (< 8 Å between chains)

  2. Extract PAE values for all inter-chain pairs at interface

  3. Calculate mean


Interpretation:

  • Mean interface PAE < 8 Å: High confidence interface

  • Mean interface PAE 8-15 Å: Moderate confidence

  • Mean interface PAE > 15 Å: Low confidence—be skeptical

pTM and ipTM Scores

AlphaFold-Multimer reports:


pTM (predicted TM-score):

  • Overall structural accuracy metric

  • Range: 0-1 (higher is better)

  • pTM > 0.5 suggests reasonable overall structure


ipTM (interface predicted TM-score) (Bryant et al., 2022):

  • Specifically measures interface quality

  • This is the key metric for complexes

  • ipTM > 0.8: Confident interface

  • ipTM 0.6-0.8: Moderate confidence

  • ipTM < 0.6: Low confidence


Ranking models: If you generate multiple models (recommended), rank by:

ranking_score = 0.8 × ipTM + 0.2 × pTM

This weights interface quality appropriately for complex predictions.

Case Study: The Deceptive Dimer

The Setup

A research group wanted to understand how two signaling proteins interact. They:

  1. Ran AlphaFold-Multimer with both sequences

  2. Got a beautiful dimer structure

  3. Observed pLDDT > 85 across both chains

  4. Saw the proteins "touching" in the 3D view

  5. Designed mutations at the putative interface

  6. Spent 6 months making and testing mutants

The Result

None of the mutations disrupted the interaction in biochemical assays. The proteins still co-immunoprecipitated. Pull-downs still worked.

What Went Wrong

When they finally checked the PAE matrix:

  • Chain A internal: PAE < 5 Å (blue)

  • Chain B internal: PAE < 5 Å (blue)

  • Chain A to Chain B: PAE > 25 Å (red)

  • Chain B to Chain A: PAE > 25 Å (red)


AlphaFold was confident about each protein individually, but had no idea how they interacted. The dimer structure was essentially random—AlphaFold placed two well-folded proteins near each other, but the interface was imaginary.

The Lesson

The "interface" mutations weren't at the real interface. They were at a computationally-generated artifact. The real interface was elsewhere entirely—and required experimental structure determination to identify.


Time lost: 6 months of mutagenesis and biochemistry How to prevent: 10 minutes checking the PAE matrix

When Can You Trust Complex Predictions?

AlphaFold-Multimer works best when:

Strong Evolutionary Signal

The ideal case:

  • Both proteins are from a well-studied family

  • Homologous complexes exist in the PDB

  • Deep MSAs are available for both proteins

  • Co-evolution signal exists between chains


Examples that work well:

  • Antibody-antigen (many structures available)

  • GPCR-G protein (conserved interface)

  • Enzyme-inhibitor pairs (co-evolved)

  • Ribosomal complexes (extremely conserved)

Weak Evolutionary Signal

The challenging case:

  • Novel protein pairs

  • Lineage-specific interactions

  • Transient or conditional interactions

  • Recently evolved interfaces


Examples that often fail:

  • Two human proteins that only interact in humans

  • Pathogen-host interfaces (rapid evolution)

  • Moonlighting interactions (not the "main" function)

  • Complexes assembled only under specific conditions

The Rule of Thumb

If the interaction is:

  • Ancient and conserved → AlphaFold can probably predict it

  • Novel or species-specific → AlphaFold might guess wrongly

Improving Complex Predictions

If your first prediction shows uncertain inter-chain PAE:

Strategy 1: Try Different Stoichiometries

AlphaFold-Multimer can predict:

  • 1:1 heterodimers (A:B)

  • 2:2 heterotetramers (A₂:B₂)

  • Homomers with different symmetry (A₂, A₃, A₄...)


The "true" stoichiometry might have better evolutionary support.

Strategy 2: Include Related Sequences

If your two proteins don't co-evolve, try:

  • Adding a third protein that bridges them

  • Including homologs that might have better MSA coverage

  • Running the prediction with proteins from a different species

Strategy 3: Use Template Information

If experimental structures exist for related complexes:

  • Provide them as templates

  • This can guide the prediction toward the correct interface

Strategy 4: MSA Pairing Strategies

Different MSA pairing methods can reveal different interactions:

  • Paired MSAs (same operon/genomic context)

  • Block MSAs (independent MSAs concatenated)

  • Phylogenetically paired MSAs

Strategy 5: Accept Uncertainty

Sometimes the interaction genuinely isn't predictable:

  • Low conservation → no evolutionary signal

  • Multiple binding modes → AlphaFold picks one (maybe wrong)

  • Disorder-mediated interaction → hard to predict


In these cases, experimental validation is required.

The Experimental Validation Hierarchy

When PAE is ambiguous, validate with:

Tier 1: Fast Validation

Cross-linking mass spectrometry (XL-MS):

  • Identifies residue pairs in proximity

  • Validates interface contacts

  • Can distinguish between models


Hydrogen-deuterium exchange (HDX-MS):

  • Shows which regions are protected upon binding

  • Maps interface location

  • Works with transient complexes

Tier 2: Structural Validation

Cryo-EM:

  • Medium-resolution interface visualization

  • Suitable for large complexes

  • Can capture multiple conformations


X-ray crystallography:

  • High-resolution interface details

  • Requires crystal packing (not always achievable)

  • Gold standard when it works

Tier 3: Functional Validation

Mutagenesis:

  • Disrupt predicted interface residues

  • Measure binding by SPR, ITC, or pull-down

  • Caution: requires correct interface prediction first

Summary: The Complex Prediction Workflow

1. RUN ALPHAFOLD-MULTIMER
          |
          v
2. CHECK pLDDT (Are individual chains well-folded?)
          |
    NO ---+--> Structure may be unreliable
          |
    YES
          |
          v
3. CHECK PAE MATRIX (Is inter-chain PAE low?)
          |
    NO ---+--> Interface is NOT reliable
          |    AlphaFold is guessing
          |
    YES (<

The Bottom Line

AlphaFold-Multimer is a powerful tool for predicting protein complexes, but it requires careful interpretation:

Metric

What It Tells You

What It Doesn't Tell You

pLDDT

Local structure confidence

Whether chains interact

PAE (intra-chain)

Domain structure confidence

Inter-domain flexibility

PAE (inter-chain)

Interface confidence

Binding affinity

ipTM

Overall interface quality

Which interface is correct

The most important rule: Never trust a complex prediction without checking the inter-chain PAE.


A beautiful 3D structure with two proteins touching means nothing if the PAE says AlphaFold was guessing. Conversely, a somewhat rough-looking interface with low PAE might be biologically accurate.

Streamlining Complex Analysis

For researchers working with multi-chain predictions, analyzing PAE matrices and interface metrics manually is tedious. Platforms like Orbion automate this workflow:

  • Automatic interface detection from AlphaFold-Multimer outputs

  • Interface area and contact residue calculation

  • Per-chain characterization (PTMs, binding sites, topology)

  • Confidence visualization highlighting reliable vs uncertain regions


The goal is to move from "here's a complex structure" to "here's what you can actually trust about this interaction"—before you design experiments based on potentially unreliable predictions.

References

  1. Jumper J, et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature, 596:583-589. Link

  2. Evans R, et al. (2022). Protein complex prediction with AlphaFold-Multimer. bioRxiv. Link

  3. Bryant P, et al. (2023). Evaluation of AlphaFold-Multimer prediction on multi-chain protein complexes. Bioinformatics, 39(7):btad424. Link

  4. Guo H, et al. (2023). PAE viewer: a webserver for the interactive visualization of the predicted aligned error. Nucleic Acids Research, 51(W1):W404-W410. PMC10320053

  5. EBI Training. pLDDT: Understanding local confidence. AlphaFold Course. Link

  6. Ruff KM & Pappu RV. (2021). AlphaFold and Implications for Intrinsically Disordered Proteins. Journal of Molecular Biology, 433(20):167208. Link