The Camera Culture group is building new tools to better capture and share visual information. What will a camera look like in ten years? How should we change the camera to improve mobile photography? How will a billion networked and portable cameras change the social culture? We exploit unusual optics, novel illumination, and emerging sensors to build new capture devices and develop associate algorithms.
Research Projects
6D Display
Ramesh Raskar, Martin Fuchs, Hans-Peter Seidel, and Hendrik P. A. LenschIs it possible to create passive displays that respond to changes in viewpoint and incident light conditions? Holograms and 4D displays respond to changes in viewpoint. 6D displays respond to changes in viewpoint as well as surrounding light. We encode the 6D reflectance field into an ordinary 2D film. These displays are completely passive and do not require any power. Applications include novel instruction manuals and mood lights.
BiDi Screen
Henry Holtzman, Matt Hirsch, Douglas Lanman and Ramesh RaskarThe BiDi Screen is an example of a new type of thin I/O device that possesses the ability both to capture images and display them. Scene depth can be derived from BiDi Screen imagery, allowing for 3D gestural and 2D multi-touch interfaces. This bidirectional screen extends the latest trend in LCD devices, which has seen the incorporation of photo-transistors into every display pixel. Using a novel optical masking technique developed at the Media Lab, the BiDi Screen can capture light field-like quantities, unlocking a wide array of applications from 3D gesture and touch interaction with CE devices, to seamless video communication.
Bokode: Imperceptible Visual Tags for Camera-Based Interaction from a Distance
Ramesh Raskar, Ankit Mohan, Grace Woo, Shinsaku Hiura and Quinn SmithwickWith over a billion people carrying camera-phones worldwide, we have a new opportunity to upgrade the classic bar code to encourage a flexible interface between the machine world and the human world. Current bar codes must be read within a short range and the codes occupy valuable space on products. We present a new, low-cost, passive optical design so that bar codes can be shrunk to fewer than 3mm and can be read by unmodified ordinary cameras several meters away.
CATRA: Mapping of Cataract Opacities Through an Interactive Approach
Ramesh Raskar, Vitor Pamplona, Erick Passos, Jan Zizka, Jason Boggess, David Schafran, Manuel M. Oliveira, Everett Lawson, and Estebam CluaWe introduce a novel interactive method to assess cataracts in the human eye by crafting an optical solution that measures the perceptual impact of forward scattering on the foveal region. Current solutions rely on highly trained clinicians to check the back scattering in the crystallin lens and test their predictions on visual acuity tests. Close-range parallax barriers create collimated beams of light to scan through sub-apertures scattering light as it strikes a cataract. User feedback generates maps for opacity, attenuation, contrast, and local point-spread functions. The goal is to allow a general audience to operate a portable, high-contrast, light-field display to gain a meaningful understanding of their own visual conditions. The compiled maps are used to reconstruct the cataract-affected view of an individual, offering a unique approach for capturing information for screening, diagnostic, and clinical analysis.
Coded Computational Photography
Jaewon Kim, Ahmed Kirmani, Ankit Mohan and Ramesh RaskarComputational photography is an emerging multi-disciplinary field that is at the intersection of optics, signal processing, computer graphics and vision, electronics, art, and online sharing in social networks. The first phase of computational photography was about building a super-camera that has enhanced performance in terms of the traditional parameters, such as dynamic range, field of view, or depth of field. We call this 'Epsilon Photography.' The next phase of computational photography is building tools that go beyond the capabilities of this super-camera. We call this 'Coded Photography.' We can code exposure, aperture, motion, wavelength, and illumination. By blocking light over time or space, we can preserve more details about the scene in the recorded single photograph.
Compressed Sensing
Ramesh Raskar and Kshitij MarwahResearch in computer vision is riding a new tide called compressive sensing. Carefully designed capture methods exploit the sparsity of the underlying signal in a transformed domain to reduce the number of measurements and use an appropriate reconstruction method. Traditional progressive methods capture successively more detail using sequence of simple projection basis whereas random projections do not use any sequence except l0 minimization for reconstruction which is computationally in-efficient. Here, we question this new tide and claim for most situations simple methods work better and the best projective method would be in between the two extremes.
Compressive Sensing for Visual Signals
Ramesh Raskar, Kshitij Marwah and Ashok Veeraraghavan (MERL)Research in computer vision is riding a new tide called compressive sensing. Carefully designed capture methods exploit the sparsity of the underlying signal in a transformed domain to reduce the number of measurements and use an appropriate reconstruction method. Traditional progressive methods capture successively more detail using sequence of simple projection basis whereas random projections do not use any sequence except l0 minimization for reconstruction which is computationally in-efficient. Here, we question this new tide and claim for most situations simple methods work better and the best projective method would be in between the two extremes.
HR3D: Glasses-Free 3DTV
Douglas Lanman, Matthew Hirsch, Yunhee Kim, and Ramesh RaskarFor 3D displays to be successful, they must be bright enough to compete with 2D displays and not diminish display resolution. To date, stacked-LCD displays have employed parallax barriers, which use pinhole or stripe patterns to provide view-dependent imagery. We show a prototype that adapts the imagery on both layers to multi-view 3D content, increasing brightness while maintaining display resolution. This promises a future of devices with sharp 2D screens and 3D displays with full horizontal and vertical parallax.
Layered 3D: Glasses-Free 3D Printing
Gordon Wetzstein, Douglas Lanman, Matthew Hirsch, Wolfgang Heidrich, and Ramesh RaskarWe develop tomographic techniques for image synthesis on displays composed of compact volumes of light-attenuating material. Such volumetric attenuators recreate a 4D light field or high-contrast 2D image when illuminated by a uniform backlight. Since arbitrary views may be inconsistent with any single attenuator, iterative tomographic reconstruction minimizes the difference between the emitted and target light fields, subject to physical constraints on attenuation. For 3D displays, spatial resolution, depth of field, and brightness are increased, compared to parallax barriers. We conclude by demonstrating the benefits and limitations of attenuation-based light field displays using an inexpensive fabrication method: separating multiple printed transparencies with acrylic sheets.
LensChat: Sharing Photos with Strangers
Ramesh Raskar, Rob Gens and Wei-Chao ChenWith networked cameras in everyone's pockets, we are exploring the practical and creative possibilities of public imaging. LensChat allows cameras to communicate with each other using trusted optical communications, allowing users to share photos with a friend by taking pictures of each other, or borrow the perspective and abilities of many cameras.
Looking Around Corners
Ramesh Raskar, Andrew Bardagjy, Otkrist Gupta, Andreas Velten, Di Wu, and Moungi BawendiUsing a femtosecond laser and a camera with one trillion frames per second, we can capture movies of light as it moves through a scene, gets trapped inside a tomato, or bounces off the surfaces in a bottle of water. We use this ability to see the time of flight and to reconstruct images of objects that our camera can not see directly (i.e., to look around the corner).
NETRA: Smartphone Add-On for Eye Tests
Vitor Pamplona, Manuel Oliveira, Erick Passos, Ankit Mohan, David Schafran, Jason Boggess and Ramesh RaskarCan a person look at a portable display, click on a few buttons, and recover his refractive condition? Our optometry solution combines inexpensive optical elements and interactive software components to create a new optometry device suitable for developing countries. The technology allows for early, extremely low-cost, mobile, fast, and automated diagnosis of the most common refractive eye disorders: myopia (nearsightedness), hypermetropia (farsightedness), astigmatism, and presbyopia (age-related visual impairment). The patient overlaps lines in up to eight meridians and the Android app computes the prescription. The average accuracy is comparable to the prior art—and in some cases, even better. We propose the use of our technology as a self-evaluation tool for use in homes, schools, and at health centers in developing countries, and in places where an optometrist is not available or is too expensive.
Polarization Fields: Glasses-Free 3DTV
Douglas Lanman, Gordon Wetzstein, Matthew Hirsch, Wolfgang Heidrich, and Ramesh RaskarWe introduce polarization field displays as an optically efficient design for dynamic light field display using multi-layered LCDs. Such displays consist of a stacked set of liquid crystal panels with a single pair of crossed linear polarizers. Each layer is modeled as a spatially controllable polarization rotator, as opposed to a conventional spatial light modulator that directly attenuates light. We demonstrate that such displays can be controlled, at interactive refresh rates, by adopting the SART algorithm to tomographically solve for the optimal spatially varying polarization state rotations applied by each layer. We validate our design by constructing a prototype using modified off- the-shelf panels. We demonstrate interactive display using a GPU-based SART implementation supporting both polarization-based and attenuation-based architectures.
Reflectance Acquisition Using Ultrafast Imaging
Ramesh Raskar, Nikhil Naik and Andreas VeltenWe demonstrate a new technique that allows a camera to rapidly acquire reflectance properties of objects 'in the wild' from a single viewpoint, over relatively long distances and without encircling equipment. This project has a wide variety of applications in computer graphics including image relighting, material identification, and image editing.
Second Skin: Motion Capture with Actuated Feedback for Motor Learning
Ramesh Raskar, Kenichiro Fukushi and Jan ZizkaWe have created a 3D motion-tracking system with an automatic, real-time vibrotactile feedback with an assembly of photo-sensors, infrared projector pairs, vibration motors, and wearable suit. This system allows us to enhance and quicken the motor learning process in variety of fields such as healthcare (physiotherapy), entertainment (dance), and sports (martial arts).
Shield Field Imaging
Jaewon KimWe present a new method for scanning 3D objects in a single shot, shadow-based method. We decouple 3D occluders from 4D illumination using shield fields: the 4D attenuation function which acts on any light field incident on an occluder. We then analyze occluder reconstruction from cast shadows, leading to a single-shot light field camera for visual hull reconstruction.
Slow Display
Daniel Saakes, Kevin Chiu, Tyler Hutchison, Biyeun Buczyk, Naoya Koizumi and Masahiko InamiHow can we show our 16 megapixel photos from our latest trip on a digital display? How can we create screens that are visible in direct sunlight as well as complete darkness? How can we create large displays that consume less than 2W of power? How can we create design tools for digital decal application and intuitive-computer aided modeling? We introduce a display that is high resolution but updates at a low frame rate, a slow display. We use lasers and monostable light-reactive materials to provide programmable space-time resolution. This refreshable, high resolution display exploits the time decay of monostable materials, making it attractive in terms of cost and power requirements. Our effort to repurpose these materials involves solving underlying problems in color reproduction, day- night visibility, and optimal time sequences for updating content.
Soundaround
Henry Holtzman, Ramesh Raskar, Matt Hirsch, Alex Olwal and Thomas A. BaranRecently, multi-view display hardware has made compelling progress in graphics. Soundaround is a multi-viewer interactive audio system, designed to be integrated into unencumbered multi-view display systems, presenting localized audio/video channels with no need for glasses or headphones. Our technical work describes a framework for the design of multi-viewer interactive audio systems that is general and supports optimization of the system for multiple observation planes and room responses.
SpeckleSense
Jan Zizka, Alex Olwal, Andrew Bardagjy and Ramesh RaskarMotion sensing is of fundamental importance for user interfaces and input devices. In applications where optical sensing is preferred, traditional camera-based approaches can be prohibitive due to limited resolution, low frame rates, and the required computational power for image processing. We introduce a novel set of motion-sensing configurations based on laser speckle sensing that are particularly suitable for human-computer interaction. The underlying principles allow these configurations to be fast, precise, extremely compact, and low cost.
Theory Unifying Ray and Wavefront Lightfield Propagation
George Barbastathis, Ramesh Raskar, Belen Masia, Se Baek Oh and Tom CuypersThis work focuses on bringing powerful concepts from wave optics to the creation of new algorithms and applications for computer vision and graphics. Specifically, ray-based, 4D lightfield representation, based on simple 3D geometric principles, has led to a range of new applications that include digital refocusing, depth estimation, synthetic aperture, and glare reduction within a camera or using an array of cameras. The lightfield representation, however, is inadequate to describe interactions with diffractive or phase-sensitive optical elements. Therefore we use Fourier optics principles to represent wavefronts with additional phase information. We introduce a key modification to the ray-based model to support modeling of wave phenomena. The two key ideas are "negative radiance" and a "virtual light projector." This involves exploiting higher dimensional representation of light transport.
Trillion Frames Per Second Camera
Ramesh Raskar, Andreas Velten, Everett Lawson, Di Wu and Moungi BawendiWe have developed a camera that captures movies at an effective capture rate of a trillion frames per second. In one frame of our movie, light moves only about 0.6 mm. We can observe pulses of light as they propagate through a scene. We use this information to understand how images are composed and to learn things about a scene that are invisible to a regular camera.
Vision on Tap
Ramesh Raskar and Kevin ChiuComputer vision is a class of technologies that lets computers use cameras to automatically stitch together panoramas, reconstruct 3-D geometry from multiple photographs, and even tell you when the water's boiling. For decades, this technology has been advancing mostly within the confines of academic institutions and research labs. Vision on Tap is our attempt to bring computer vision to the masses.
VisionBlocks
Abhijit Bendale, Kshitij Marwah and Jason R BoggessVisionBlocks is an on-demand, in-browser, customizable, computer-vision application-building platform for the masses. Even without any prior programming experience, users can create and share computer vision applications. End-users drag and drop computer vision processing blocks to create their apps. The input feed could be either from a user's webcam or a video from the Internet. VisionBlocks is a community effort where researchers obtain fast feedback, developers monetize their vision applications, and consumers can use state-of-the-art computer vision techniques. We envision a Vision-as-a-Service (VaaS) over-the-web model, with easy-to-use interfaces for application creation for everyone.