Skip to main content

Envisioning is an emerging technology research institute and advisory.

LinkedInInstagramGitHub

2011 — 2026

research
  • Reports
  • Newsletter
  • Methodology
  • Origins
  • Vocab
services
  • Research Sessions
  • Signals Workspace
  • Bespoke Projects
  • Use Cases
  • Signal Scanfree
  • Readinessfree
impact
  • ANBIMAFuture of Brazilian Capital Markets
  • IEEECharting the Energy Transition
  • Horizon 2045Future of Human and Planetary Security
  • WKOTechnology Scanning for Austria
audiences
  • Innovation
  • Strategy
  • Consultants
  • Foresight
  • Associations
  • Governments
resources
  • Pricing
  • Partners
  • How We Work
  • Data Visualization
  • Multi-Model Method
  • FAQ
  • Security & Privacy
about
  • Manifesto
  • Community
  • Events
  • Support
  • Contact
  • Login
ResearchServicesPricingPartnersAbout
ResearchServicesPricingPartnersAbout
  1. Home
  2. Vocab
  3. Capsule Networks

Capsule Networks

Neural networks that encode spatial relationships between features using grouped neuron vectors called capsules.

Year: 2017Generality: 461
Back to Vocab

Capsule Networks (CapsNets) are a class of neural network architecture designed to overcome a fundamental weakness of convolutional neural networks: the loss of spatial and pose information caused by pooling operations. Where a CNN might correctly identify that eyes, a nose, and a mouth are present in an image without caring about their relative positions, a capsule network explicitly encodes the spatial relationships between features. Each capsule is a small group of neurons whose output is a vector rather than a scalar — the vector's magnitude represents the probability that a particular entity exists, while its direction encodes instantiation parameters such as position, orientation, scale, and deformation.

The key mechanism enabling capsule networks is dynamic routing, introduced in the landmark 2017 paper "Dynamic Routing Between Capsules" by Sara Sabour, Nicholas Frosst, and Geoffrey Hinton. Instead of fixed pooling, lower-level capsules iteratively negotiate with higher-level capsules to determine which parent capsule they should send their output to. A lower-level capsule routes its output to a higher-level capsule whose current prediction best agrees with its own — a process called routing by agreement. This allows the network to build part-whole relationships in a principled way, making it naturally equivariant to spatial transformations rather than relying on data augmentation to learn invariance.

Capsule networks matter because they address a deep theoretical concern about how CNNs represent structured visual information. They show strong performance on tasks requiring viewpoint robustness and generalize better from limited training data on certain benchmarks. However, they have not yet displaced CNNs in mainstream practice, largely due to computational cost and difficulty scaling to complex, high-resolution datasets. Research into more efficient routing algorithms and hybrid architectures continues, and capsule networks remain an important conceptual framework for thinking about how neural networks could more faithfully represent the compositional, hierarchical structure of the visual world.

Related

Related

CNN (Convolutional Neural Network)
CNN (Convolutional Neural Network)

A deep learning architecture that learns spatial hierarchies of features from visual data.

Generality: 875
GCN (Graph Convolutional Networks)
GCN (Graph Convolutional Networks)

Neural networks that apply convolution-like operations to learn from graph-structured data.

Generality: 694
FCN (Fully Convolutional Network)
FCN (Fully Convolutional Network)

A neural network architecture that produces pixel-wise predictions for image segmentation.

Generality: 694
TCN (Temporal Convolutional Networks)
TCN (Temporal Convolutional Networks)

Convolutional neural networks that model sequential data using dilated, causal convolutions.

Generality: 550
ControlNet
ControlNet

A neural network architecture that adds precise spatial controls to pretrained diffusion models.

Generality: 292
Neural Network
Neural Network

A layered system of interconnected nodes that learns patterns from data.

Generality: 947