Skip to main content

Envisioning is an emerging technology research institute and advisory.

LinkedInInstagramGitHub

2011 — 2026

research
  • Observatory
  • Newsletter
  • Methodology
  • Origins
  • Vocab
services
  • Research Sessions
  • Signals Workspace
  • Bespoke Projects
  • Use Cases
  • Readinessfree
impact
  • ANBIMAFuture of Brazilian Capital Markets
  • IEEECharting the Energy Transition
  • Horizon 2045Future of Human and Planetary Security
  • WKOTechnology Scanning for Austria
audiences
  • Innovation
  • Strategy
  • Consultants
  • Foresight
  • Associations
  • Governments
resources
  • Pricing
  • Partners
  • How We Work
  • Data Visualization
  • Multi-Model Method
  • FAQ
  • Security & Privacy
about
  • Manifesto
  • Community
  • Events
  • Support
  • Contact
  • Login
ResearchServicesPricingPartnersAbout
ResearchServicesPricingPartnersAbout
  1. Home
  2. Vocab
  3. The Bitter Lesson

The Bitter Lesson

Insight that hand-crafted methods are ultimately outpaced by general learning-based approaches at scale.

Year: 2019Generality: 720Added: May 12, 2026
Back to Vocab

The Bitter Lesson is a 2019 essay by Richard Sutton arguing that the history of artificial intelligence demonstrates a consistent pattern: methods that leverage computation and general learning consistently outpace methods that incorporate human knowledge and hand-crafted structure, as compute scales increases. Sutton's key observation is that AI researchers often invest heavily in building domain knowledge into their systems — explicit rules, heuristics, hand-crafted features, and domain-specific representations — and that this investment consistently pays less dividends than expected as the field advances. The "bitter" lesson is that the human knowledge component of these systems tends to add complexity without delivering proportional gains, while general methods (often simple ones, like search and learning) eventually surpass them.

The essay is titled "The Bitter Lesson" because the pattern is bitter in the sense that it is recurring and seemingly predictable in retrospect, yet AI researchers continue to make the same mistake generation after generation. Sutton attributes this to a cognitive bias toward valuing human-crafted solutions over solutions that work better but are less transparent or understandable — a preference for approaches that feel principled and intellectually satisfying over approaches that simply perform better. The result is a cycle of over-investment in knowledge-embedding approaches, followed by disruption by simpler, more general approaches, followed by the gradual reintroduction of knowledge components, followed by another disruption.

The relevance of the Bitter Lesson to interaction models is direct: the central argument of the interaction model research is that the current paradigm of bolting on interactivity with external harnesses (voice activity detection, turn-boundary detection, dialog management components) is an instance of the same pattern Sutton identifies. These hand-crafted components are the "human knowledge" that the Bitter Lesson predicts will be outpaced by end-to-end learned approaches. The interaction model approach — co-training interaction capability from scratch with the core model — is an application of Sutton's insight to the interaction problem.

The practical implication is that AI research labs should expect that their hand-crafted interaction infrastructure (VADs, turn managers, separate dialog policies) will ultimately be outpaced by models that learn interaction natively, just as hand-crafted speech recognition was outpaced by end-to-end neural ASR, and hand-crafted computer vision was outpaced by end-to-end CNNs. This does not mean that the transition is easy or that the hand-crafted approach is never worth pursuing in the short term — but it does suggest that investing in interaction-native architectures is the correct strategic direction, even if current implementations are inferior to harness-based approaches.