Skip to main content

Envisioning is an emerging technology research institute and advisory.

LinkedInInstagramGitHub

2011 — 2026

research
  • Reports
  • Newsletter
  • Methodology
  • Origins
  • Vocab
services
  • Research Sessions
  • Signals Workspace
  • Bespoke Projects
  • Use Cases
  • Signal Scanfree
  • Readinessfree
impact
  • ANBIMAFuture of Brazilian Capital Markets
  • IEEECharting the Energy Transition
  • Horizon 2045Future of Human and Planetary Security
  • WKOTechnology Scanning for Austria
audiences
  • Innovation
  • Strategy
  • Consultants
  • Foresight
  • Associations
  • Governments
resources
  • Pricing
  • Partners
  • How We Work
  • Data Visualization
  • Multi-Model Method
  • FAQ
  • Security & Privacy
about
  • Manifesto
  • Community
  • Events
  • Support
  • Contact
  • Login
ResearchServicesPricingPartnersAbout
ResearchServicesPricingPartnersAbout
  1. Home
  2. Vocab
  3. CNN (Convolutional Neural Network)

CNN (Convolutional Neural Network)

A deep learning architecture that learns spatial hierarchies of features from visual data.

Year: 1998Generality: 875
Back to Vocab

A Convolutional Neural Network (CNN) is a class of deep neural network specifically designed to process data with a grid-like topology, most commonly images and video. Unlike fully connected networks that treat every input feature independently, CNNs exploit the spatial structure of visual data by applying learned filters across local regions of the input. This design choice dramatically reduces the number of parameters compared to dense networks and encodes a powerful inductive bias: that meaningful patterns can appear anywhere in an image and that nearby pixels are more related than distant ones.

The core building block of a CNN is the convolutional layer, where a set of learnable filters slides across the input to produce feature maps that highlight the presence of specific patterns—edges, textures, or more complex shapes in deeper layers. These are typically followed by nonlinear activation functions and pooling layers, which downsample feature maps to reduce spatial resolution and provide a degree of translation invariance. Stacking many such layers creates a hierarchy where early layers detect low-level features and later layers compose them into increasingly abstract representations, ultimately enabling the network to distinguish between complex categories like faces, objects, or medical anomalies.

CNNs became practically influential in machine learning with Yann LeCun's LeNet-5 in 1998, which demonstrated reliable handwritten digit recognition. The field was transformed in 2012 when AlexNet, trained on GPUs with large-scale data, won the ImageNet competition by a wide margin, triggering the modern deep learning era in computer vision. Subsequent architectures—VGGNet, GoogLeNet, ResNet, and EfficientNet—refined depth, width, and connectivity patterns to push accuracy and efficiency further.

Beyond image classification, CNNs underpin object detection, semantic segmentation, medical image analysis, autonomous driving perception, and even natural language processing tasks where local feature extraction is valuable. While transformer-based vision models have emerged as strong competitors, CNNs remain widely deployed due to their computational efficiency, well-understood behavior, and strong inductive biases that make them highly effective with limited data.

Related

Related

GCN (Graph Convolutional Networks)
GCN (Graph Convolutional Networks)

Neural networks that apply convolution-like operations to learn from graph-structured data.

Generality: 694
FCN (Fully Convolutional Network)
FCN (Fully Convolutional Network)

A neural network architecture that produces pixel-wise predictions for image segmentation.

Generality: 694
Neural Network
Neural Network

A layered system of interconnected nodes that learns patterns from data.

Generality: 947
DNN (Deep Neural Network)
DNN (Deep Neural Network)

Neural networks with many layers that learn hierarchical representations from raw data.

Generality: 871
Convolution
Convolution

A sliding filter operation that extracts spatial patterns from input data.

Generality: 871
Capsule Networks
Capsule Networks

Neural networks that encode spatial relationships between features using grouped neuron vectors called capsules.

Generality: 461