A deep learning method that synthesizes photorealistic 3D scenes from 2D images.
Neural Radiance Fields (NeRF) is a technique that uses neural networks to represent and render complex 3D scenes from a sparse collection of 2D photographs. At its core, NeRF trains a multilayer perceptron to encode a scene as a continuous volumetric function: given a 3D spatial coordinate and a viewing direction as input, the network outputs the color and volume density at that point. By integrating these values along camera rays using classical volume rendering equations, NeRF can synthesize photorealistic images from arbitrary novel viewpoints — a task known as novel view synthesis.
The training process requires only a set of posed 2D images — photographs with known camera positions — and no explicit 3D geometry. The network learns the scene's geometry and appearance implicitly by minimizing the difference between rendered and observed pixel colors. This implicit representation allows NeRF to capture fine-grained details such as specular reflections, translucency, and complex lighting interactions that explicit mesh-based or point-cloud methods often struggle to reproduce faithfully. Positional encoding, which maps input coordinates into a higher-dimensional Fourier feature space, is a key ingredient that enables the network to represent high-frequency details.
NeRF's impact on computer vision and graphics has been substantial. It demonstrated that a simple MLP, trained on a single scene, could achieve reconstruction quality far surpassing prior methods. Since its introduction, hundreds of follow-up works have addressed its limitations — including slow training and rendering times, dynamic scenes, and generalization across scenes — producing variants such as Instant NGP, Mip-NeRF, and 3D Gaussian Splatting. These advances have accelerated adoption in applications ranging from virtual and augmented reality to film visual effects, robotics, and digital heritage preservation.
NeRF sits at the intersection of neural representation learning and classical rendering, and it has helped establish the broader paradigm of implicit neural representations — using neural networks as continuous, differentiable encodings of signals like shapes, textures, and occupancy fields. Its influence extends well beyond view synthesis, shaping how researchers think about scene understanding and 3D generative modeling.