Skip to main content

Envisioning is an emerging technology research institute and advisory.

LinkedInInstagramGitHub

2011 — 2026

research
  • Reports
  • Newsletter
  • Methodology
  • Origins
  • Vocab
services
  • Research Sessions
  • Signals Workspace
  • Bespoke Projects
  • Use Cases
  • Signal Scanfree
  • Readinessfree
impact
  • ANBIMAFuture of Brazilian Capital Markets
  • IEEECharting the Energy Transition
  • Horizon 2045Future of Human and Planetary Security
  • WKOTechnology Scanning for Austria
audiences
  • Innovation
  • Strategy
  • Consultants
  • Foresight
  • Associations
  • Governments
resources
  • Pricing
  • Partners
  • How We Work
  • Data Visualization
  • Multi-Model Method
  • FAQ
  • Security & Privacy
about
  • Manifesto
  • Community
  • Events
  • Support
  • Contact
  • Login
ResearchServicesPricingPartnersAbout
ResearchServicesPricingPartnersAbout
  1. Home
  2. Vocab
  3. Logistic Regression

Logistic Regression

A classification algorithm that models the probability of a binary outcome.

Year: 1958Generality: 838
Back to Vocab

Logistic regression is a foundational supervised learning algorithm used primarily for binary classification, where the goal is to predict whether an input belongs to one of two classes. Rather than outputting a raw continuous value, it models the probability that a given example belongs to the positive class by passing a linear combination of input features through the sigmoid (logistic) function. This S-shaped function squashes any real-valued number into the range [0, 1], producing an interpretable probability score. A decision threshold — typically 0.5 — then converts this probability into a discrete class label.

Training logistic regression means finding the optimal set of weights (coefficients) for the input features. This is done by maximizing the log-likelihood of the observed labels given the model's predictions, which is equivalent to minimizing binary cross-entropy loss. Unlike closed-form solutions available in linear regression, this optimization is typically performed iteratively using gradient descent or more sophisticated solvers such as L-BFGS. Regularization terms (L1 or L2) are commonly added to the loss function to prevent overfitting and encourage sparse or small-magnitude weights.

Despite its name, logistic regression is a classification algorithm, not a regression one. It generalizes naturally to multi-class problems through extensions like one-vs-rest (OvR) schemes or multinomial logistic regression (softmax regression), where a separate set of weights is learned for each class. The model's linear decision boundary makes it highly interpretable — each coefficient directly reflects the log-odds contribution of its corresponding feature — which is a significant advantage in domains like medicine, finance, and social science where explainability matters.

Logistic regression remains widely used in modern machine learning as a strong baseline, a building block in ensemble methods, and the output layer of neural networks for classification tasks. Its computational efficiency, probabilistic outputs, and interpretability make it a go-to choice when data is linearly separable or when model transparency is a priority. Understanding logistic regression also provides essential intuition for more complex models, including neural networks and generalized linear models.

Related

Related

Regression
Regression

A supervised learning approach that predicts continuous numerical outcomes from input variables.

Generality: 909
Log Odds
Log Odds

The logarithm of the odds ratio, linking probabilities to linear model outputs.

Generality: 694
Classification
Classification

A supervised learning task that assigns input data to predefined discrete categories.

Generality: 909
Logits
Logits

Raw, unnormalized scores output by a neural network before probability conversion.

Generality: 700
Classifier
Classifier

A machine learning model that assigns input data to predefined categories.

Generality: 875
Supervised Learning
Supervised Learning

Training models on labeled input-output pairs to predict or classify new data.

Generality: 900