Skip to main content

Envisioning is an emerging technology research institute and advisory.

LinkedInInstagramGitHub

2011 — 2026

research
  • Observatory
  • Newsletter
  • Methodology
  • Origins
  • Vocab
services
  • Research Sessions
  • Signals Workspace
  • Bespoke Projects
  • Use Cases
  • Readinessfree
impact
  • ANBIMAFuture of Brazilian Capital Markets
  • IEEECharting the Energy Transition
  • Horizon 2045Future of Human and Planetary Security
  • WKOTechnology Scanning for Austria
audiences
  • Innovation
  • Strategy
  • Consultants
  • Foresight
  • Associations
  • Governments
resources
  • Pricing
  • Partners
  • How We Work
  • Data Visualization
  • Multi-Model Method
  • FAQ
  • Security & Privacy
about
  • Manifesto
  • Community
  • Events
  • Support
  • Contact
  • Login
ResearchServicesPricingPartnersAbout
ResearchServicesPricingPartnersAbout
  1. Home
  2. Vocab
  3. GRAM (Generative Recursive Reasoning Models)

GRAM (Generative Recursive Reasoning Models)

A probabilistic framework turning recursive latent reasoning into multi-trajectory computation.

Year: 2026Generality: 550
Back to Vocab

GRAM is a framework that reimagines recursive reasoning as a stochastic latent trajectory, replacing the deterministic state updates of prior Recursive Reasoning Models with probabilistic sampling at each recursion step. Instead of converging to a single solution, GRAM maintains a distribution over reasoning paths, enabling multiple hypotheses to be explored simultaneously. It models p(y|x) by marginalizing over latent trajectories and p(x) for unconditional generation. Trained with amortized variational inference, it scales inference time via both recursion depth and parallel trajectory sampling.

The core mechanism introduces stochasticity into the latent state transition: at each step the model samples from a transition distribution conditioned on the input and current state, rather than computing a single deterministic update. This produces a distribution over reasoning trajectories, each representing a distinct solution strategy. Multiple trajectories can be sampled in parallel at inference time, effectively trading compute for coverage of the solution space. The latent process reward model (LPRM) provides step-wise guidance to distinguish productive from unproductive reasoning paths.

GRAM improves over deterministic RRMs (HRM, TRM) on structured reasoning and constraint satisfaction tasks where multiple valid solutions exist. It demonstrates unconditional generation capability — generating valid puzzle instances without input — a capability prior RRMs lack. The approach trades some per-trajectory efficiency for broader solution coverage: each individual trajectory is stochastic and may require more steps to converge, but parallel sampling finds alternatives that deterministic models entirely miss.

Whether stochastic recursion consistently outperforms carefully designed deterministic recursion on tasks with single solutions remains unclear. The unconditional generation capability — generating valid structured problem instances from scratch — is underexplored and may require architectural innovations beyond what GRAM currently demonstrates. Scaling via parallel trajectories introduces memory and compute costs that grow linearly with trajectory count, potentially limiting practical deployment on memory-constrained devices.