Skip to main content

Envisioning is an emerging technology research institute and advisory.

LinkedInInstagramGitHub

2011 — 2026

research
  • Reports
  • Newsletter
  • Methodology
  • Origins
  • Vocab
services
  • Research Sessions
  • Signals Workspace
  • Bespoke Projects
  • Use Cases
  • Signal Scanfree
  • Readinessfree
impact
  • ANBIMAFuture of Brazilian Capital Markets
  • IEEECharting the Energy Transition
  • Horizon 2045Future of Human and Planetary Security
  • WKOTechnology Scanning for Austria
audiences
  • Innovation
  • Strategy
  • Consultants
  • Foresight
  • Associations
  • Governments
resources
  • Pricing
  • Partners
  • How We Work
  • Data Visualization
  • Multi-Model Method
  • FAQ
  • Security & Privacy
about
  • Manifesto
  • Community
  • Events
  • Support
  • Contact
  • Login
ResearchServicesPricingPartnersAbout
ResearchServicesPricingPartnersAbout
  1. Home
  2. Vocab
  3. Data Mining

Data Mining

Automatically discovering patterns, correlations, and insights from large datasets.

Year: 1990Generality: 836
Back to Vocab

Data mining is the computational process of discovering meaningful patterns, correlations, anomalies, and insights within large datasets by combining techniques from statistics, machine learning, and database systems. Rather than testing predefined hypotheses, data mining is largely exploratory — algorithms scan through data to surface structure that was not explicitly sought. Core tasks include classification, clustering, regression, association rule learning, and anomaly detection, each suited to different types of questions and data structures. The process typically sits within the broader knowledge discovery in databases (KDD) pipeline, which encompasses data cleaning, integration, selection, transformation, mining, and interpretation of results.

In practice, data mining draws on a wide toolkit of methods. Decision trees and rule induction extract human-readable logic from labeled examples. Clustering algorithms such as k-means or DBSCAN group records by similarity without requiring labels. Association rule mining — exemplified by the Apriori algorithm — identifies co-occurrence patterns like market basket relationships. Dimensionality reduction techniques help manage high-dimensional data before applying these methods. The choice of algorithm depends heavily on the data type, volume, and the business or scientific question being addressed.

Data mining became practically significant in the 1990s as organizations began accumulating transactional and operational databases too large for manual analysis. Retail, banking, telecommunications, and healthcare were early adopters, using mined patterns for customer segmentation, fraud detection, churn prediction, and clinical risk stratification. The discipline helped establish that raw data, properly analyzed, could be a strategic asset rather than a storage burden.

Although modern machine learning has absorbed many data mining techniques and the terminology has partially merged, data mining retains a distinct emphasis on interpretability, scalability to structured relational data, and actionable business insight. It remains foundational to fields like business intelligence and data science, and its core methods continue to underpin production systems where explainability and computational efficiency matter as much as predictive accuracy.

Related

Related

Association Rule
Association Rule

A data mining technique that discovers co-occurrence patterns and relationships among items in large datasets.

Generality: 694
Data Analysis
Data Analysis

Systematic examination of datasets to extract patterns, insights, and actionable knowledge.

Generality: 928
Predictive Analytics
Predictive Analytics

Using historical data and statistical models to forecast future outcomes and behaviors.

Generality: 834
Unsupervised Learning
Unsupervised Learning

Machine learning that discovers hidden patterns in data without labeled examples.

Generality: 850
Clustering
Clustering

An unsupervised learning technique that groups similar data points together automatically.

Generality: 838
ML (Machine Learning)
ML (Machine Learning)

A paradigm where algorithms learn patterns from data rather than explicit programming.

Generality: 971