<!-- This feature prevents the switcher from breaking the text and images on the mobile version. I think the switcher isn't interpreted as any simple embedded image would and it causes some formatting issues on this page. I'm sorry for weird solutions on this page, but I tested them to make sure they are working correctly. This one was painful. If something is not working after your edit, please refer to other comments and have a look at: Template:Float|Template:Clear|Template:Align|Template:Stack| - and other related templates.
Sorry for the mess and good luck!-->
<div style="float:right; width:300px; padding:3px; margin-left: 15px; border:3px double #99aecc; display:inline-block;">
</div>
The Cornell box is a test scene designed to evaluate the accuracy of rendering software by comparing a rendered image with a photograph of a real-world model under the same lighting conditions. It has become a commonly used 3D test model in computer graphics research.
The box was created by Cindy M. Goral, Kenneth E. Torrance, Donald P. Greenberg, and Bennett Battaile at the Cornell University Program of Computer Graphics as part of their research on radiosity and diffuse interreflection. Their findings were published in the paper Modeling the Interaction of Light Between Diffuse Surfaces, presented at SIGGRAPH '84.
Reference model
thumb|300px|Cornell box with three spheres illustrating different interactions of light and surfaces.
A physical model of the Cornell box is constructed and photographed using calibrated equipment. Photographic image references shared via Cornell University website were captured using a liquid-cooled Photometrics PXL1300L CCD camera with a precision of 12 bits. Seven narrow-band filters are employed to obtain a coarse sampling across the visible spectrum. To enhance accuracy, dark current is subtracted from the images, and flat-field correction is applied to compensate for cosine fall-off and lens fall-off. The precise settings of the scene are measured, including the emission spectrum of the light source, reflectance spectra of all surfaces, and the exact position and dimensions of objects, walls, light sources, and the camera.
A matching virtual 3D scene is created, and a digital image is generated during the rendering process for comparison with the reference photograph. The comparison helps evaluate the accuracy of rendering algorithms, particularly in handling global illumination, radiosity, and light transport. The Cornell box is designed to demonstrate diffuse interreflection. Light reflecting off the red and green walls subtly tints the adjacent white walls, demonstrating complex global illumination effects.
Scene configuration
The basic environment consists of:
- A single light source on the ceiling
- A green right wall
- A red left wall
- A white back wall, floor, and ceiling
Objects are commonly placed inside the box to study their interaction with light. The original configuration included two boxes, while subsequent versions introduced a reflective mirror sphere and a refractive glass sphere, commonly used in ray tracing research.
History
The original Cornell box
The original Cornell box was described by Cindy M. Goral, Kenneth E. Torrance, and Donald P. Greenberg in their 1984 paper titled Modeling the Interaction of Light Between Diffuse Surfaces, presented at SIGGRAPH '84. The hemi-cube technique allowed form factors to be calculated using scan conversion algorithms, which were supported by hardware at the time, and made it possible to calculate shadows from occluding objects inside the scene.
This version of the Cornell box was the first to feature objects placed inside — a short block on the left side, a tall block on the right side, and a light source in the center of the 'ceiling'. This configuration matches the scene data
| 320px
| Synthetic rendered image they demonstrated a method that extended light transfer simulations to handle complex reflectance properties beyond ideal diffuse or specular surfaces. This approach utilized spherical harmonic decomposition to encode bidirectional reflectance distribution functions (BRDFs) and directional intensity distributions, allowing for more accurate and efficient rendering of materials with complex reflectance characteristics. In this version of the Cornell box, the blocks are arranged in a flipped configuration, and the tall block features a mirror-like (aluminium) surface instead of a diffuse one. but few details about these changes are provided, and no documentation exists regarding the modifications. The website was created before web archiving services became widely available, meaning some updates may not be accessible or documented.
Synthetic image
thumb|Synthetic image reference contributed by [[Cornell University Program of Computer Graphics]]
A rendered image reference was uploaded by the Cornell University Program of Computer Graphics. The camera position and Cornell box configuration match the provided scene data, (e.g., the tall block has a highly specular surface) or the scene data (e.g., the camera position has been offset to reduce a distracting reflection on the tall block).
Seven monochromatic datasets were captured at different visible spectrum wavelengths. Using information obtained from the filenames, the captured wavelengths can be determined: 700 nm, 650 nm, 600 nm, 550 nm, 500 nm, 450 nm, and 400 nm. These datasets can be combined to create red, green, and blue channels, which are then used to produce a polychromatic image. One method to create these channels is by assigning the closest corresponding colors on the visible spectrum. The channels were assigned as follows:
- Red channel: 700 nm, 650 nm, 600 nm
- Green channel: 550 nm, 500 nm
- Blue channel: 450 nm, 400 nm
Basic corrections were not applied, so no adjustments were made to contrast, saturation, white balance, etc. No advanced corrections (such as lens corrections for chromatic aberration) were performed. Minimal tonemapping was applied, and no pixels were cropped. Defective pixels and capture errors were retouched, but the unedited version is also available. The same principles apply to the synthetic renders, which were generated with a slightly offset camera, as the precise data for this particular shot is not provided. The camera was not properly calibrated and is only vaguely aligned with the short block.
<div style="display: flex; justify-content: space-between; width: 100%; gap: 0px; flex-wrap: wrap;">
<div style="flex: 1; max-width: 900px; border: 1px double #99aecc; padding: 0px;">
</div>
<div style="flex: 1; max-width: 900px; border: 1px double #99aecc; padding: 0px;">
</div>
</div>
Scene data
thumb|left|100px|3D Cornell box model
The original geometry data supplied by Cornell University clearly defines the positions of the objects, with the tall block on the right and the short block on the left, as shown in the Hemi-cube form factors section. While 3D applications and algorithms may interpret axes differently, causing potential confusion, this does not affect the specific misconception in this case. Direct access to the original data is available, and although the correct configuration is understood, it's unclear why a mirrored version (flipped along one axis) is more prominently used.
This change was first observed in the 'Spherical harmonics' rendition of the box. Although the authors did not explicitly comment on this change, a plausible explanation could be that without the flipped layout and repositioned camera, the uneven reflection on the tall block would be too distracting. Additionally, if the green wall were visible in the reflection, it would not contrast as strongly as the red wall does on the white background. Repositioning the objects and camera could be a more efficient solution than recreating and repainting the entire box.
An accurate method for setting up the camera according to the original specifications is to set the focal length to 35mm and the sensor size (film gate) to 25mm. Alternative setup methods, including the camera's position, rotation, and other relevant details, are provided in the description of the 3D Cornell box model featured in this section. The alternative coordinates that could be used to reproduce the repositioned camera were never published.
Historical context and applications
The Cornell box was developed in the early 1980s as part of research into radiosity, one of the first rendering techniques capable of simulating diffuse interreflection. This work laid the foundation for physically based rendering methods, played a pivotal role in validating global illumination and inspired advancements in ray tracing methods.
Later, the Cornell box was adapted for evaluating newer methods such as photon mapping, introduced by Henrik Wann Jensen in the 1990s, which improved the simulation of caustics and indirect lighting. Modern applications of the Cornell box extend to testing Monte Carlo path tracing, machine learning-based rendering techniques and other advanced approaches to rendering. It is also frequently used as a benchmark and remains an essential test scene in many rendering engines like Blender, Unreal Engine, and Arnold.
Since its creation in the previous millennium, many companies have developed their own updated and higher-quality references for internal use, as advancements in technology have allowed for more detailed and accurate measurements. Although primarily used for computer graphics, variations of the Cornell box have also been employed in acoustics research to model sound reflections and validate simulation methods.
See also
- Utah teapot
- Stanford bunny
- Stanford dragon
- List of common 3D test models
- Path tracing
- 3D modeling
