A learning paradigm where models generalize effectively from a single training example per class.
One-shot learning is a machine learning paradigm in which a model is trained to recognize or classify new concepts from just a single labeled example per class. This stands in sharp contrast to conventional deep learning approaches, which typically require thousands or millions of examples to achieve reliable performance. The challenge is fundamental: how can a model generalize meaningfully when it has seen almost nothing? One-shot learning addresses this by shifting the burden from raw data volume to smarter inductive biases, learned representations, and similarity-based reasoning.
The dominant approaches to one-shot learning fall into a few broad families. Metric-based methods, such as Siamese networks and matching networks, learn an embedding space where examples from the same class cluster together, allowing a new example to be classified by proximity to known prototypes. Model-based approaches use memory-augmented neural networks or recurrent architectures that can rapidly encode and retrieve new information. Optimization-based methods, such as MAML (Model-Agnostic Meta-Learning), train models to have initial parameters that can be fine-tuned to new tasks with minimal gradient steps. All of these strategies share a common thread: they are trained across many tasks so that learning to learn from few examples becomes the core competency.
One-shot learning is closely related to the broader field of few-shot learning, which generalizes the constraint to a small but not necessarily singular number of examples, and meta-learning, which frames the problem as learning how to learn efficiently. The distinction matters in practice: true one-shot settings are extremely demanding and often serve as a benchmark for the upper bound of sample efficiency.
The practical importance of one-shot learning is substantial. In medical imaging, rare conditions may yield only a handful of annotated scans. In robotics, a system may need to recognize a novel object after a single demonstration. In security and biometrics, face verification systems must authenticate individuals from minimal enrollment data. As AI is deployed in increasingly specialized and data-scarce domains, the ability to learn from minimal supervision becomes not just academically interesting but operationally essential.