A paradigm where algorithms learn patterns from data rather than explicit programming.
Machine learning is a branch of artificial intelligence in which computational systems improve their performance on tasks through experience rather than through hand-coded rules. Instead of a programmer specifying every decision a system should make, a machine learning model is exposed to data and adjusts its internal parameters to minimize errors or maximize a reward signal. This data-driven approach allows systems to generalize from examples, recognizing patterns that would be impractical or impossible to enumerate manually.
The field is broadly organized around three learning paradigms. In supervised learning, models train on labeled input-output pairs — for example, images tagged with object categories — and learn a mapping that generalizes to unseen examples. Unsupervised learning finds structure in unlabeled data, discovering clusters, embeddings, or latent factors without explicit guidance. Reinforcement learning takes a different approach entirely: an agent interacts with an environment, receives scalar reward signals, and learns a policy that maximizes cumulative reward over time. Many modern systems blend these paradigms, using self-supervised pretraining followed by supervised fine-tuning.
At its core, machine learning relies on optimization. A model — whether a linear function, a decision tree, or a deep neural network — has parameters that are tuned by minimizing a loss function over training data, typically using gradient-based methods. The choice of model architecture, loss function, regularization strategy, and training procedure collectively determine how well the learned model generalizes beyond its training distribution. Avoiding overfitting, where a model memorizes training data rather than learning transferable patterns, remains a central challenge.
Machine learning has become the dominant approach for problems in computer vision, natural language processing, speech recognition, drug discovery, and recommendation systems, among many others. The dramatic expansion of available data, increases in computational power, and algorithmic innovations — particularly the rise of deep learning — have transformed ML from an academic curiosity into infrastructure that underpins much of modern technology. Understanding its principles is now considered foundational for anyone working in data-intensive fields.