Skip to main content

Envisioning is an emerging technology research institute and advisory.

LinkedInInstagramGitHub

2011 — 2026

research
  • Reports
  • Newsletter
  • Methodology
  • Origins
  • Vocab
services
  • Research Sessions
  • Signals Workspace
  • Bespoke Projects
  • Use Cases
  • Signal Scanfree
  • Readinessfree
impact
  • ANBIMAFuture of Brazilian Capital Markets
  • IEEECharting the Energy Transition
  • Horizon 2045Future of Human and Planetary Security
  • WKOTechnology Scanning for Austria
audiences
  • Innovation
  • Strategy
  • Consultants
  • Foresight
  • Associations
  • Governments
resources
  • Pricing
  • Partners
  • How We Work
  • Data Visualization
  • Multi-Model Method
  • FAQ
  • Security & Privacy
about
  • Manifesto
  • Community
  • Events
  • Support
  • Contact
  • Login
ResearchServicesPricingPartnersAbout
ResearchServicesPricingPartnersAbout
  1. Home
  2. Vocab
  3. MCTS (Monte Carlo Tree Search)

MCTS (Monte Carlo Tree Search)

A search algorithm that uses randomized simulations to navigate large decision spaces.

Year: 2006Generality: 796
Back to Vocab

Monte Carlo Tree Search (MCTS) is a probabilistic search algorithm designed to find optimal decisions in problems with enormous branching factors, where exhaustive search is computationally infeasible. Rather than evaluating every possible future state, MCTS intelligently focuses computational effort on the most promising regions of a decision tree by combining strategic exploration with random sampling. It has become a cornerstone technique in game-playing AI and sequential decision-making under uncertainty.

The algorithm operates through four repeating phases. In selection, the tree is traversed from the root using a policy that balances exploration of less-visited nodes against exploitation of high-performing ones — most commonly via the UCT (Upper Confidence Bound for Trees) formula. In expansion, one or more new child nodes are added to the tree. In simulation (also called rollout), a random or heuristic-guided playout proceeds from the new node to a terminal state. Finally, in backpropagation, the result of that simulation is propagated back up the tree, updating win/visit statistics at each ancestor node. Repeated over thousands of iterations, this process builds an increasingly accurate picture of which actions are most valuable.

MCTS rose to prominence in machine learning through its role in game-playing systems. Its 2006 application to the board game Go — a domain where classical minimax search with hand-crafted evaluation functions had long struggled — marked a turning point. The algorithm's impact was dramatically amplified in 2016 when DeepMind's AlphaGo combined MCTS with deep neural networks to defeat a world champion Go player, demonstrating that learned value and policy functions could replace random rollouts with far more accurate guidance.

Beyond games, MCTS has found application in reinforcement learning, combinatorial optimization, protein structure prediction, and planning for autonomous systems. Its strength lies in its model-based nature — it requires only a simulator or environment model — and its anytime property, meaning it can return a best-guess answer at any point and improve with additional computation. These qualities make it broadly applicable wherever sequential decisions must be made under uncertainty with a large action space.

Related

Related

Monte Carlo Estimation
Monte Carlo Estimation

Approximates probabilities or expectations by averaging results across many random simulations.

Generality: 794
Search
Search

Systematic exploration of a problem space to find goal-achieving solutions or action sequences.

Generality: 871
Heuristic Search Techniques
Heuristic Search Techniques

Guided search methods that use domain knowledge to find solutions efficiently.

Generality: 731
A* Search
A* Search

An efficient pathfinding algorithm combining actual path cost with heuristic estimates.

Generality: 694
Minimax
Minimax

A game-tree search strategy that minimizes an opponent's best possible outcome.

Generality: 792
MPC (Model-Predictive Control)
MPC (Model-Predictive Control)

Control strategy that optimizes actions by predicting future system states over a rolling horizon.

Generality: 662