Skip to main content

Envisioning is an emerging technology research institute and advisory.

LinkedInInstagramGitHub

2011 — 2026

research
  • Observatory
  • Newsletter
  • Methodology
  • Origins
  • Vocab
services
  • Research Sessions
  • Signals Workspace
  • Bespoke Projects
  • Use Cases
  • Readinessfree
impact
  • ANBIMAFuture of Brazilian Capital Markets
  • IEEECharting the Energy Transition
  • Horizon 2045Future of Human and Planetary Security
  • WKOTechnology Scanning for Austria
audiences
  • Innovation
  • Strategy
  • Consultants
  • Foresight
  • Associations
  • Governments
resources
  • Pricing
  • Partners
  • How We Work
  • Data Visualization
  • Multi-Model Method
  • FAQ
  • Security & Privacy
about
  • Manifesto
  • Community
  • Events
  • Support
  • Contact
  • Login
ResearchServicesPricingPartnersAbout
ResearchServicesPricingPartnersAbout
  1. Home
  2. Vocab
  3. AlphaFold

AlphaFold

A deep learning system predicting 3D protein structures from amino acid sequences with near-experimental accuracy.

Year: 2020Generality: 703
Back to Vocab

AlphaFold is a family of deep learning models developed by DeepMind that predict three-dimensional protein structures directly from amino acid sequences. By integrating evolutionary information derived from multiple sequence alignments (MSAs) with learned geometric reasoning, AlphaFold produces atomic coordinates and per-residue confidence scores (pLDDT) that frequently match the accuracy of experimental methods such as X-ray crystallography or cryo-electron microscopy. Its landmark version, AlphaFold2, was introduced at the CASP14 protein structure prediction competition in 2020, where it achieved accuracy so far beyond prior methods that many researchers described it as effectively solving a 50-year-old grand challenge in biology.

Architecturally, AlphaFold2 centers on the "Evoformer" — a novel neural network block that jointly processes two representations: a pairwise residue-residue distance map and an MSA representation encoding evolutionary co-variation across related protein sequences. These representations iteratively exchange information through attention mechanisms, allowing the model to learn implicit physical and evolutionary constraints without hand-crafted energy functions. A downstream structure module uses invariant point attention (IPA) to produce backbone and side-chain coordinates in a rotation- and translation-equivariant manner. The entire pipeline is trained end-to-end on structures from the Protein Data Bank (PDB), with prediction recycling used as a form of self-consistency regularization.

AlphaFold's significance extends well beyond structural biology. It demonstrated that a sufficiently expressive neural architecture, trained on the right combination of evolutionary and structural data, can internalize complex physical rules that previously required decades of expert-crafted heuristics to approximate. This has accelerated drug discovery, enzyme engineering, and the interpretation of disease-associated genetic variants at a scale previously impossible. DeepMind subsequently released predicted structures for over 200 million proteins through the AlphaFold Protein Structure Database, making high-quality structural models freely accessible to the global research community.

Despite its transformative impact, AlphaFold retains important limitations. It performs less reliably on multi-chain protein complexes, intrinsically disordered regions, and proteins whose function depends on ligand binding, post-translational modifications, or conformational dynamics. These gaps have motivated successor systems — including AlphaFold3 and competing models — that extend the framework to broader classes of biomolecular interactions, underscoring how AlphaFold reshaped the entire field of computational structural biology.

Related

Related

PML (Protein Language Model)
PML (Protein Language Model)

Transformer-based models that learn biological meaning from protein sequence data.

Generality: 339
AlphaGeometry
AlphaGeometry

A neuro-symbolic AI system that solves olympiad-level geometry problems at human-expert level.

Generality: 94
Generative Optogenetics
Generative Optogenetics

Using generative AI models to design novel light-sensitive proteins for biological control.

Generality: 58
AFMs (Analog Foundation Models)
AFMs (Analog Foundation Models)

Large pretrained AI models designed to run on analog hardware for dramatic efficiency gains.

Generality: 96
LFMs (Liquid Foundation Models)
LFMs (Liquid Foundation Models)

Efficient generative AI models using dynamical systems principles to handle diverse data types.

Generality: 102
ALife (Artificial Life)
ALife (Artificial Life)

A field simulating biological processes in artificial systems to understand life itself.

Generality: 696