Models that learn decision boundaries between classes rather than modeling data distributions.
Discriminative models are a fundamental class of machine learning algorithms that learn to map inputs directly to outputs by modeling the conditional probability P(y|x) — the probability of a label given an input — without attempting to model how the input data itself was generated. Rather than building a full picture of the data distribution, these models focus exclusively on the boundary separating different classes or the relationship between features and targets. This targeted approach makes them highly efficient for supervised learning tasks where prediction accuracy is the primary goal.
Common examples of discriminative models include logistic regression, support vector machines (SVMs), conditional random fields (CRFs), and most modern deep neural networks used for classification. A neural network trained to classify images as cats or dogs, for instance, learns which pixel patterns reliably distinguish the two categories without ever modeling what a "typical" cat or dog image looks like in a generative sense. This contrasts with generative models, such as Naive Bayes or variational autoencoders, which learn the joint distribution P(x, y) and can in principle generate new data samples.
The practical advantage of discriminative models is their tendency to achieve higher classification accuracy on well-defined tasks, particularly when training data is abundant. Because they do not spend modeling capacity on the input distribution — which may be complex and irrelevant to the prediction task — they can allocate that capacity entirely to learning the decision boundary. This efficiency has made discriminative approaches the dominant paradigm in applied machine learning for tasks like spam filtering, sentiment analysis, speech recognition, and medical diagnosis.
The distinction between discriminative and generative models became a central organizing concept in machine learning research during the early 2000s, crystallized by influential work comparing the two approaches on classification benchmarks. With the rise of deep learning, the line has blurred somewhat — models like GANs combine both paradigms — but the discriminative framework remains foundational to understanding how most deployed classification systems operate.