Skip to main content

Envisioning is an emerging technology research institute and advisory.

LinkedInInstagramGitHub

2011 — 2026

research
  • Observatory
  • Newsletter
  • Methodology
  • Origins
  • Vocab
services
  • Research Sessions
  • Signals Workspace
  • Bespoke Projects
  • Use Cases
  • Readinessfree
impact
  • ANBIMAFuture of Brazilian Capital Markets
  • IEEECharting the Energy Transition
  • Horizon 2045Future of Human and Planetary Security
  • WKOTechnology Scanning for Austria
audiences
  • Innovation
  • Strategy
  • Consultants
  • Foresight
  • Associations
  • Governments
resources
  • Pricing
  • Partners
  • How We Work
  • Data Visualization
  • Multi-Model Method
  • FAQ
  • Security & Privacy
about
  • Manifesto
  • Community
  • Events
  • Support
  • Contact
  • Login
ResearchServicesPricingPartnersAbout
ResearchServicesPricingPartnersAbout
  1. Home
  2. Vocab
  3. ResNet (Residual Network)

ResNet (Residual Network)

A CNN architecture using skip connections to enable training of very deep networks.

Year: 2015Generality: 795
Back to Vocab

ResNet, short for Residual Network, is a convolutional neural network architecture designed to make training very deep networks practical and effective. Introduced by Kaiming He and colleagues at Microsoft Research in 2015, it addressed a fundamental obstacle in deep learning: as networks grow deeper, gradients tend to vanish or explode during backpropagation, causing training to stall and performance to degrade. ResNet's solution was elegantly simple — rather than expecting each stack of layers to learn a direct mapping from input to output, it asks them to learn a residual function, the difference between the desired output and the input itself.

The key mechanism enabling this is the skip connection (also called a shortcut connection), which routes the input of a block directly to its output, bypassing one or more intermediate layers. The result is added element-wise to the transformed output before passing to the next block. Mathematically, if a block learns a function F(x), the block's output becomes F(x) + x rather than F(x) alone. This seemingly minor change has a profound effect: gradients can flow backward through the skip path without being attenuated by layer after layer of transformations, making it feasible to train networks with hundreds or even thousands of layers.

ResNet's impact was immediately apparent when it won the ImageNet Large Scale Visual Recognition Challenge (ILSVRC) in 2015, achieving top-5 error rates that surpassed all prior architectures by a significant margin. The original paper introduced variants ranging from ResNet-18 to ResNet-152, with the number indicating total layer depth. Subsequent work extended the concept further, producing architectures like ResNeXt, Wide ResNet, and DenseNet, all of which build on the residual learning principle.

Beyond image classification, ResNet has become a foundational backbone across computer vision tasks including object detection, semantic segmentation, and medical image analysis. Its design principles have also influenced architectures outside vision, including components of modern natural language processing models. The residual connection concept is now considered a standard building block in deep learning, demonstrating how a targeted structural innovation can unlock entirely new scales of model complexity.

Related

Related

DRL (Deep Residual Learning)
DRL (Deep Residual Learning)

A neural network design using skip connections so layers learn residual mappings, enabling much deeper models.

Generality: 752
Residual Connections
Residual Connections

Shortcut connections in deep networks that enable training of much deeper architectures.

Generality: 834
CNN (Convolutional Neural Network)
CNN (Convolutional Neural Network)

A deep learning architecture that learns spatial hierarchies of features from visual data.

Generality: 875
AlexNet
AlexNet

Landmark deep convolutional network that ignited the modern deep learning revolution in 2012.

Generality: 703
FCN (Fully Convolutional Network)
FCN (Fully Convolutional Network)

A neural network architecture that produces pixel-wise predictions for image segmentation.

Generality: 694
Vanishing Gradient
Vanishing Gradient

A training failure where gradients shrink exponentially, preventing early network layers from learning.

Generality: 720