Skip to main content

Envisioning is an emerging technology research institute and advisory.

LinkedInInstagramGitHub

2011 — 2026

research
  • Reports
  • Newsletter
  • Methodology
  • Origins
  • Vocab
services
  • Research Sessions
  • Signals Workspace
  • Bespoke Projects
  • Use Cases
  • Signal Scanfree
  • Readinessfree
impact
  • ANBIMAFuture of Brazilian Capital Markets
  • IEEECharting the Energy Transition
  • Horizon 2045Future of Human and Planetary Security
  • WKOTechnology Scanning for Austria
audiences
  • Innovation
  • Strategy
  • Consultants
  • Foresight
  • Associations
  • Governments
resources
  • Pricing
  • Partners
  • How We Work
  • Data Visualization
  • Multi-Model Method
  • FAQ
  • Security & Privacy
about
  • Manifesto
  • Community
  • Events
  • Support
  • Contact
  • Login
ResearchServicesPricingPartnersAbout
ResearchServicesPricingPartnersAbout
  1. Home
  2. Vocab
  3. Retrieval-Based Model

Retrieval-Based Model

A model that responds by selecting the best match from a predefined response database.

Year: 2016Generality: 692
Back to Vocab

A retrieval-based model generates outputs not by synthesizing new text, but by searching a curated database of candidate responses and returning the one that best matches the input. This selection process relies on similarity measures ranging from classical techniques like TF-IDF and BM25 to modern dense vector representations produced by neural encoders. Given a user query, the model computes a relevance score between the query and each candidate, then returns the highest-ranked result. This architecture stands in contrast to generative models, which construct responses token by token and can produce entirely novel text.

The practical appeal of retrieval-based models lies in their predictability and controllability. Because every possible output is drawn from a hand-curated or carefully indexed corpus, the system cannot hallucinate facts or produce off-brand language — a critical advantage in customer service, medical Q&A, and enterprise chatbots where response accuracy is non-negotiable. Early deployments used keyword matching and rule-based filters, but the introduction of dual-encoder architectures and models like Dense Passage Retrieval (DPR) dramatically improved the ability to match semantically similar queries even when surface-level wording differs.

Retrieval-based approaches have gained renewed importance in the era of large language models through the paradigm of Retrieval-Augmented Generation (RAG), where a retrieval component fetches relevant documents that a generative model then uses to ground its response. This hybrid design combines the factual reliability of retrieval with the fluency of generation, and has become a dominant pattern for building knowledge-intensive NLP systems. The underlying retrieval machinery — dense indexes, approximate nearest-neighbor search, and bi-encoder models — is now a core component of modern AI infrastructure.

Related

Related

RAG (Retrieval-Augmented Generation)
RAG (Retrieval-Augmented Generation)

Enhances language model outputs by retrieving relevant documents before generating responses.

Generality: 774
Contextual Retrieval
Contextual Retrieval

A retrieval method that uses semantic context rather than exact keyword matching.

Generality: 591
IR (Information Retrieval)
IR (Information Retrieval)

Finding and ranking relevant documents from large collections in response to user queries.

Generality: 838
S2R (Speech-to-Retrieval)
S2R (Speech-to-Retrieval)

Maps spoken audio directly to retrieval-ready representations, bypassing error-prone transcription pipelines.

Generality: 174
Text-to-Text Model
Text-to-Text Model

An AI model that transforms natural language input into natural language output.

Generality: 720
Base Model
Base Model

A pre-trained model used as a starting point for task-specific adaptation.

Generality: 794