Skip to main content

Envisioning is an emerging technology research institute and advisory.

LinkedInInstagramGitHub

2011 — 2026

research
  • Observatory
  • Newsletter
  • Methodology
  • Origins
  • Vocab
services
  • Research Sessions
  • Signals Workspace
  • Bespoke Projects
  • Use Cases
  • Readinessfree
impact
  • ANBIMAFuture of Brazilian Capital Markets
  • IEEECharting the Energy Transition
  • Horizon 2045Future of Human and Planetary Security
  • WKOTechnology Scanning for Austria
audiences
  • Innovation
  • Strategy
  • Consultants
  • Foresight
  • Associations
  • Governments
resources
  • Pricing
  • Partners
  • How We Work
  • Data Visualization
  • Multi-Model Method
  • FAQ
  • Security & Privacy
about
  • Manifesto
  • Community
  • Events
  • Support
  • Contact
  • Login
ResearchServicesPricingPartnersAbout
ResearchServicesPricingPartnersAbout
  1. Home
  2. Vocab
  3. Mamba

Mamba

Selective state space model architecture replacing attention with linear-time sequence modeling

Year: 2023Generality: 445
Back to Vocab

Mamba is a sequence modeling architecture developed by Albert Gu and Tri Dao in 2023 that fundamentally challenges the dominance of attention-based transformers. Rather than computing pairwise attention scores between every pair of tokens—an O(n²) operation—Mamba uses a state space model (SSM) framework that scales linearly with sequence length, enabling processing of much longer contexts at lower computational cost.

The critical innovation in Mamba is its selection mechanism, which makes the model content-aware. Previous SSMs used fixed, global parameters that treated all inputs equally. Mamba introduces data-dependent selection where the model learns to control how much information from the current input is incorporated into its hidden state. This allows the same architecture to selectively focus on relevant information without explicit attention weights. The selection parameters are derived from the input in a learnable way, making the effective receptive field dynamic and input-dependent, unlike traditional feed-forward components.

Mamba processes tokens through a recurrent-like state update where each token updates a hidden state, then generates output through a learned projection. This recurrent structure, combined with selection, gives Mamba the pattern-matching capabilities that made transformers successful while preserving the linear scaling of SSMs. Early experiments show Mamba-based models achieve comparable or superior performance to transformers on language modeling tasks while maintaining better memory efficiency and throughput on very long sequences. The architecture has implications for real-time inference, long-context understanding, and scaling to domains like genomics and time series where sequence length is prohibitive under quadratic attention.

Related

Related

Memory Sparse Attention
Memory Sparse Attention

An attention mechanism combining persistent memory tokens with sparse connectivity for efficient long-range modeling.

Generality: 339
SSM (State-Space Model)
SSM (State-Space Model)

A mathematical framework modeling dynamic systems through evolving hidden state variables.

Generality: 720
Scalable MatMul-free Language Modeling
Scalable MatMul-free Language Modeling

Language modeling architectures that replace matrix multiplication with cheaper, scalable alternatives.

Generality: 111
Deep SSMs (Deep State-Space Models)
Deep SSMs (Deep State-Space Models)

Neural networks combined with state-space models to capture complex sequential dynamics.

Generality: 514
L2M (Large Memory Model)
L2M (Large Memory Model)

A decoder-only Transformer with addressable auxiliary memory enabling reasoning far beyond its attention window.

Generality: 189
Sequence Model
Sequence Model

A model that learns patterns and dependencies within ordered data sequences.

Generality: 840