Skip to main content

Envisioning is an emerging technology research institute and advisory.

LinkedInInstagramGitHub

2011 — 2026

research
  • Observatory
  • Newsletter
  • Methodology
  • Origins
  • Vocab
services
  • Research Sessions
  • Signals Workspace
  • Bespoke Projects
  • Use Cases
  • Readinessfree
impact
  • ANBIMAFuture of Brazilian Capital Markets
  • IEEECharting the Energy Transition
  • Horizon 2045Future of Human and Planetary Security
  • WKOTechnology Scanning for Austria
audiences
  • Innovation
  • Strategy
  • Consultants
  • Foresight
  • Associations
  • Governments
resources
  • Pricing
  • Partners
  • How We Work
  • Data Visualization
  • Multi-Model Method
  • FAQ
  • Security & Privacy
about
  • Manifesto
  • Community
  • Events
  • Support
  • Contact
  • Login
ResearchServicesPricingPartnersAbout
ResearchServicesPricingPartnersAbout
  1. Home
  2. Vocab
  3. MPC (Model-Predictive Control)

MPC (Model-Predictive Control)

Control strategy that optimizes actions by predicting future system states over a rolling horizon.

Year: 1990Generality: 662
Back to Vocab

Model-Predictive Control (MPC) is an optimization-based control strategy that uses an explicit mathematical model of a system to predict its future behavior and compute optimal control actions accordingly. At each time step, MPC solves a constrained optimization problem over a finite prediction horizon, selecting the sequence of control inputs that minimizes a defined cost function — such as tracking error or energy consumption — while satisfying physical and operational constraints. Only the first action in the optimal sequence is applied, after which the horizon rolls forward and the process repeats. This receding-horizon structure allows MPC to continuously adapt to new information and disturbances.

The power of MPC lies in its ability to handle multivariable systems with hard constraints on inputs and outputs in a principled, unified framework. Unlike classical control methods that treat constraints as afterthoughts, MPC incorporates them directly into the optimization, making it well-suited for systems where safety limits, actuator saturation, or resource bounds are critical. The underlying optimization is typically a quadratic program for linear systems, though nonlinear variants (NMPC) exist for more complex dynamics at greater computational cost.

In machine learning and reinforcement learning, MPC has gained renewed relevance as a planning and decision-making backbone. Model-based reinforcement learning methods often use learned dynamics models within an MPC loop, enabling sample-efficient control without requiring a hand-engineered model. Techniques like the Cross-Entropy Method (CEM) or random shooting are used to approximately solve the MPC optimization when gradients are unavailable or the model is a neural network. This intersection has made MPC a central component in modern robotics, autonomous driving, and continuous control benchmarks.

MPC's practical appeal stems from its interpretability and constraint-handling, properties that purely learned policies often lack. As learned world models improve in accuracy and inference speed, hybrid approaches combining the flexibility of deep learning with the structure of MPC are increasingly common, positioning MPC as a durable and adaptable tool at the frontier of intelligent control systems.

Related

Related

PPO (Proximal Policy Optimization)
PPO (Proximal Policy Optimization)

A stable, efficient reinforcement learning algorithm using clipped policy updates.

Generality: 694
MCP (Model Context Protocol)
MCP (Model Context Protocol)

Open protocol standardizing how AI models connect to external tools and data sources

Generality: 756
Policy Learning
Policy Learning

Reinforcement learning approach that directly optimizes a policy to maximize cumulative reward.

Generality: 794
MDPO (Mirror Descent Policy Optimization)
MDPO (Mirror Descent Policy Optimization)

A reinforcement learning algorithm using mirror descent for stable, geometry-aware policy updates.

Generality: 292
TRPO (Trust Region Policy Optimization)
TRPO (Trust Region Policy Optimization)

A reinforcement learning algorithm that ensures stable policy updates via constrained optimization.

Generality: 620
MCTS (Monte Carlo Tree Search)
MCTS (Monte Carlo Tree Search)

A search algorithm that uses randomized simulations to navigate large decision spaces.

Generality: 796