Blog

When to Give Up on a Protein Target

Feb 13, 2026

You've spent six months on this protein. Twelve expression constructs. Three expression systems. Refolding from inclusion bodies. Co-expression with chaperones. Insect cells. Mammalian cells. You have 200 µg of mostly aggregated material that loses activity overnight. Your PI asks if you should try one more thing. Your gut says to walk away. But how do you know when enough is enough?


There's no universal answer, but there are rational criteria for deciding when a protein target has become a sunk cost rather than a future success.

Key Takeaways

  • Sunk cost fallacy is real in protein science: The time already invested doesn't make future success more likely

  • Some proteins are genuinely intractable: ~52% of human proteins fail to express solubly in E. coli; ~15% of purified proteins yield crystal structures

  • Decision criteria should be explicit: Define what "success" looks like and how many attempts justify abandonment

  • Alternatives exist: Switching targets, methods, or approaches may be more productive than continuing

  • Walking away is not failure: It's resource allocation based on evidence

The Psychology of Persistence

The Sunk Cost Problem

The more you invest in a target, the harder it is to abandon it:

  • "I've spent six months on this—I can't give up now"

  • "We've already ordered so many constructs"

  • "The preliminary data was so promising"

  • "If we just try one more thing..."


This is the sunk cost fallacy: past investment doesn't change the probability of future success. A protein that hasn't expressed in 12 attempts isn't more likely to express on attempt 13 just because you've tried 12 times.

When Persistence Pays Off vs. When It Doesn't

Persistence makes sense when:

  • Each attempt provides new information

  • There's a clear hypothesis for why previous attempts failed

  • The next attempt tests something fundamentally different

  • Success probability is revised upward based on new data


Persistence is irrational when:

  • You're repeating similar approaches hoping for different results

  • There's no hypothesis—just "maybe this will work"

  • Success probability hasn't changed despite new information

  • Opportunity cost of continuing exceeds potential benefit

The Numbers: What Are Realistic Success Rates?

Expression Success Rates

From structural genomics consortia data:

Stage

Typical Success Rate

Soluble expression (E. coli)

40-50%

Purification to homogeneity

70-80% of expressed

Sufficient yield for crystallization

60-70% of purified

Crystal hits

30-35% of trials

Diffraction-quality crystals

40-50% of crystal hits

Structure solved

~85% of diffraction-quality


Combined: From purified protein to structure, approximately 15% succeed. That means 85% of proteins that reach crystallization trials never yield a structure.

SGC Experience

The Structural Genomics Consortium achieved:

  • Successful purification of 48% of human proteins attempted

  • Structures of ~40% of purified proteins solved

  • More than 300 additional protein domains purified but never crystallized


The lesson: Even expert labs with optimized pipelines have high failure rates. Some proteins simply don't cooperate.

Crystallization Statistics

Analysis of crystallization trials shows:

  • Only 0.2% of individual crystallization conditions yield crystals

  • For every 100 purified proteins, ~30 form crystals

  • Of those, only 13-15 yield structures

The Decision Framework

Stage 1: Expression (When to Try Something Different)

Red flags that suggest moving on:

  • No expression after 3-4 different constructs (different boundaries, tags, fusion partners)

  • Consistent toxicity to host cells

  • Only inclusion bodies despite optimization

  • Expression system limitations that can't be overcome


Questions to ask:

  1. Have you varied construct boundaries systematically?

  2. Have you tried solubility-enhancing fusions (MBP, SUMO, GST)?

  3. Have you tried different expression hosts (E. coli strains, insect, mammalian)?

  4. Have you tried lower temperature induction?

  5. Is there evidence that the protein folds at all in your system?


Decision point: If you've tried 6+ constructs across 2+ expression systems with no soluble protein, the probability of success with conventional approaches is low.

Stage 2: Purification and Stability (When Quality Is the Issue)

Red flags:

  • Protein aggregates during concentration

  • Activity lost within hours of purification

  • Proteolytic degradation despite inhibitors

  • Multiple species on SEC despite fresh prep


Questions to ask:

  1. Is aggregation concentration-dependent? (Try keeping it dilute)

  2. Are cofactors or metal ions required? (Have you tried reconstitution?)

  3. Is the protein stable in different buffers? (Screen systematically)

  4. Is the oligomeric state correct?

  5. Is there a shorter, more stable domain that would suffice?


Decision point: If you can't produce stable, monodisperse protein after systematic buffer screening and domain optimization, structural work will likely fail anyway.

Stage 3: Crystallization (When Crystals Won't Come)

Red flags:

  • No hits after screening 4-5 commercial screens (1000-2500 conditions)

  • Only microcrystals or needles that don't improve

  • Crystals that don't diffract despite optimization

  • Crystals with severe pathologies (twinning, high mosaicity)


Questions to ask:

  1. Is the protein homogeneous? (Check by DLS, SEC-MALS)

  2. Are there flexible regions that could be removed?

  3. Would surface entropy reduction mutations help?

  4. Have you tried crystallization chaperones or nanobodies?

  5. Is cryo-EM a viable alternative?


Decision point: In situ proteolysis has ~12% success rate for recalcitrant proteins—if even that fails, crystallography may not be the path.

What Makes a Protein Intractable?

Known Difficult Protein Classes

Some protein types have fundamentally lower success rates:

Protein Type

Challenge

Alternative Approaches

Membrane proteins

Require detergents, lipids, nanodiscs

Cryo-EM, lipidic cubic phase

IDPs (intrinsically disordered)

Won't crystallize, no stable fold

NMR, SAXS, crosslinking MS

Large complexes

Yield, stability, crystallization

Cryo-EM is often better

Glycoproteins

Glycan heterogeneity

Mammalian expression, deglycosylation

Proteins with unstable folds

Aggregation, degradation

Stabilizing mutations, different conditions

Sequence-Based Predictors of Failure

Studies have identified features that correlate with failure:

  • Low complexity regions (proline-rich, etc.)

  • High hydrophobicity (tends to aggregate)

  • Extreme pI (very basic or very acidic)

  • Predicted disorder >30%

  • Transmembrane segments


Before investing heavily, check whether your protein has these features.

The Alternative Matrix

When to Try a Different Approach vs. a Different Target

Situation

Try Different Approach

Try Different Target

No expression

Alternative expression system

If biology allows

Inclusion bodies

Refolding, different host

If refolding repeatedly fails

Aggregation

Buffer screening, truncation

If all constructs aggregate

No crystals

Cryo-EM, NMR

If structure isn't essential

No activity

Reconstitution, different assay

If function is the goal

Alternative Structural Methods

If crystallography fails:

Cryo-EM:

  • Better for large complexes (>100 kDa helps)

  • Tolerates some heterogeneity

  • No crystallization required

  • Improving resolution rapidly


NMR:

  • Works for smaller proteins (<30 kDa typically)

  • Handles flexible regions

  • Solution conditions

  • Requires isotope labeling


SAXS:

  • Low resolution but quick

  • Confirms oligomeric state

  • Models flexible assemblies

  • Useful for validation


AlphaFold/Computational:

  • Free and instant

  • High confidence for many folds

  • May be sufficient for some applications

  • Limited for complexes and dynamics

Defining "Good Enough"

What Do You Actually Need?

Before deciding to continue or quit, clarify the goal:

Goal

What You Need

When to Quit Pursuing

High-resolution structure

Diffracting crystals or cryo-EM sample

After exhaustive crystallization + no cryo-EM path

Binding site identification

Structure OR computational prediction + validation

When AlphaFold + mutagenesis suffices

Functional assay

Active protein

When activity cannot be reconstituted

Drug screening

Stable, active protein at scale

When production isn't economical

Target validation

Any evidence of drugability

When multiple orthogonal approaches fail


Often, you don't need what you think you need. A 2.5 Å structure is great, but a confident AlphaFold model + SPR binding data might answer your actual question.

The Decision Checklist

Before Abandoning a Target

Technical checklist:

  • [ ] Tried ≥3 different construct boundaries

  • [ ] Tried ≥2 expression systems (if possible)

  • [ ] Tried solubility-enhancing fusions

  • [ ] Screened buffer conditions systematically (if expressed)

  • [ ] Checked for cofactor/metal requirements

  • [ ] Tried removal of predicted flexible regions

  • [ ] Consulted literature for successful strategies with homologs

  • [ ] Considered alternative structural methods


Strategic checklist:

  • [ ] Defined what success actually requires

  • [ ] Estimated remaining probability of success

  • [ ] Calculated opportunity cost of continuing

  • [ ] Identified what you would do instead

  • [ ] Discussed with collaborators/advisors

Questions to Answer

  1. What is the probability of success with the next attempt?

    • If you can't articulate why it would work when others didn't, it probably won't.

  2. What is the opportunity cost?

    • What else could you do with that time and money?

    • Is there a higher-probability project waiting?

  3. What is the minimum acceptable outcome?

    • Would partial success (domain structure, homolog, computational model) suffice?

    • Can you reframe the question to require less?

  4. Is the biology worth the effort?

    • Is this target validated?

    • Are there alternative targets for the same biology?

    • Would success actually advance the field?

Case Studies

Case 1: The Wise Pivot

Target: Kinase implicated in cancer Investment: 8 months, 15 constructs, E. coli and insect cells Best result: 50 µg aggregated protein Decision: Pivot to AlphaFold model + mutagenesis


Outcome: Computational model identified binding site. Mutagenesis confirmed key residues. Published without structure. Drug discovery proceeded using model.


Lesson: The structure would have been nice, but wasn't actually required for the biological question.

Case 2: The Costly Persistence

Target: "Important" transcription factor Investment: 2 years, 30+ constructs, 3 expression systems, collaboration with expert lab Best result: Low-quality crystals that never improved Decision: Continued because "so much invested"


Outcome: Eventually abandoned after additional year. No publication. No structure. Graduate student delayed.


Lesson: Sunk cost reasoning extended a failing project by a year.

Case 3: The Method Switch

Target: Membrane protein complex Investment: 1 year in crystallography, no crystals Decision: Switch to cryo-EM


Outcome: 3.2 Å cryo-EM structure in 6 months. High-impact publication.


Lesson: The right method for the problem beats persistence with the wrong method.

Case 4: The Rational Abandonment

Target: Protein predicted to be 60% disordered Investment: 6 constructs, 3 months Best result: Soluble but aggregating protein Decision: Abandon—protein is intrinsically unsuitable for structural work


Outcome: Pivoted to NMR studies of the ordered domain. Functional studies using full-length. No single structure, but biological questions answered.


Lesson: Some proteins aren't meant to be crystallized. Recognizing this early saves time.

The Conversation with Your PI/Advisor

How to Propose Abandonment

Don't say: "It's not working and I want to give up."


Do say: "I've systematically tried X, Y, and Z approaches over N months. Based on the data, I estimate the probability of success with continued effort is approximately P%. I propose we either pivot to [alternative approach] or redirect resources to [different project], where the expected value is higher."


Bring data:

  • What you've tried (constructs, conditions, results)

  • What the literature says about similar proteins

  • What alternative approaches exist

  • What you would do instead

What If They Disagree?

  1. Propose a time-boxed final attempt: "Let me try one more approach for 4 weeks. If it doesn't work, we revisit."

  2. Suggest parallel paths: "Can I start the alternative project while finishing this one?"

  3. Ask what success would look like: "What outcome in the next month would justify continuing?"


The goal is a rational discussion about resource allocation, not a battle of wills.

The Hardest Decision: "Almost Working"

When You're Close But Not There

The cruelest scenario: you have protein that almost works.

  • 80% pure, but that 20% contaminant ruins everything

  • Crystals that diffract to 4 Å but won't improve

  • Active enzyme that loses activity during concentration

  • Protein that's perfect until you need more than 100 µg


These situations are the hardest because:

  • Success feels imminent

  • Each attempt shows partial improvement

  • The sunk cost is highest

  • Walking away feels like giving up at the finish line


The reality check:

  • "Almost" can persist indefinitely

  • Marginal improvements may asymptote below usability

  • The protein may be fundamentally limited


The question: Is the remaining gap closable with reasonable effort, or have you reached the limits of what this protein can do under these conditions?

The Bottom Line

Knowing when to quit is as important as knowing how to persist. The difference between a productive career and years of frustration often comes down to recognizing intractable problems early and redirecting effort to tractable ones.

Sign to Continue

Sign to Quit

Clear hypothesis for next attempt

No idea why it might work

New information from each try

Repeating same failures

Probability estimate improving

Probability estimate flat or declining

Unique value if successful

Alternative approaches available

Reasonable time to success

Indefinite timeline


The final question: If you were starting fresh today, with full knowledge of what you now know about this protein, would you choose to work on it?


If the answer is no, the sunk cost shouldn't change your mind.

Making the Right Target Choices Upfront

The best way to avoid quitting decisions is to make better starting decisions. For researchers selecting protein targets, platforms like Orbion can help assess tractability before you begin:

  • Disorder prediction: Identify regions likely to cause aggregation or crystallization failure

  • Expression suitability assessment: Flag proteins likely to require non-standard expression systems

  • Topology analysis: Identify membrane proteins and other challenging classes

  • PTM requirements: Predict modifications that may be essential for function

  • Binding site analysis: Assess whether structural work is actually necessary for your goals


Understanding your protein's challenges before you start helps you choose targets with higher success probability—or at least enter difficult projects with realistic expectations.

References

  1. Slabinski L, et al. (2007). The challenge of protein structure determination—lessons from structural genomics. Protein Science, 16(11):2472-2482. PMC2211687

  2. Savitsky P, et al. (2010). High-throughput production of human proteins for crystallization: The SGC experience. Journal of Structural Biology, 172(1):3-13. PMC2938586

  3. Dong A, et al. (2007). In situ proteolysis for protein crystallization and structure determination. Nature Methods, 4(12):1019-1021. PMC3366506

  4. McPherson A & Gavira JA. (2014). Introduction to protein crystallization. Acta Crystallographica Section F, 70(1):2-20. PMC4601584

  5. Nettleship JE, et al. (2008). Methods for protein characterization by mass spectrometry, thermal shift (ThermoFluor) assay, and multiangle or static light scattering. Methods in Molecular Biology, 426:299-318.

  6. Singh A, et al. (2015). Protein recovery from inclusion bodies of Escherichia coli using mild solubilization process. Microbial Cell Factories, 14:41. PMC4030991

  7. Deng X, et al. (2023). Recent advances in targeting the "undruggable" proteins: from drug discovery to clinical trials. Signal Transduction and Targeted Therapy, 8:335. Nature