Diffusion-based three-dimensional reconstruction of complex surface using monocular vision

Yangjie Wei; Chengdong Wu; Yi Wang; Wenxue Wang

doi:10.1364/OE.23.030364

1. Introduction

The three-dimensional (3D) reconstruction of a complex surface in computer vision is achieved by constructing a mapping function between the depth and the brightness information of 2D images. In recent years, various 3D reconstruction methods have been developed, including the depth from stereo (DFS), depth from focus (DFF), and depth from defocus (DFD) techniques, which have been investigated using real-world applications [1].

The DFS estimates depth using two images of the same scene, which are captured by cameras at different positions and with different orientations [2,3 ]. However, as this technique requires that feature points of these two images be extracted and matched, the computational cost of DFS is too large for it to be used in real-time applications. In contrast, DFF depth estimation uses a mapping relation between the focus and depth. A sequence of images with different depths is obtained, the degree of focus is determined using a measurement operator [4,5 ], and the desired depth when the measurement value is maximal or minimal is obtained. Compared to DFS, DFF is simple in principle, but its estimation accuracy is highly dependent on the number of acquired images and the sensitivity of the measurement operator [6,7 ]. Finally, DFD, which was developed by Pentland [8], measures the degree of blurring of two defocused images, and then estimates depth using a point spread function, such as a Gaussian function. DFD has proven to be an effective depth reconstruction method for the following reasons: 1) Only two defocused images of a scene are necessary; 2) Matching and masking are not required; 3) It is effective in both the frequency and spatial domains [9–14 ].

However, although DFD is comparatively mature as regards application in macro fields, some problems still occur when it is used in certain real-world applications. These problems are summarized as followings: 1) To improve the DFD reconstruction precision, the most direct approach is to increase the lens aperture. However, fabricating a large-aperture lens for real-world applications is time consuming and expensive. Moreover, when the aperture of a lens increases, its depth of field decreases accordingly, and the degree of blurring of the defocused image becomes difficult to measure. In addition, when a large aperture is used, because of the considerable differences between the focused image and its corresponding defocused image, the realism of the scene may be lost. 2) The DFD depth sensitivity is inversely proportional to the square of the object distance. Further, loss of depth sensitivity is unavoidable when a high-resolution camera is used to improve the depth-estimation precision. However, in many scenarios, it is necessary to place objects far from the camera in order to achieve a reasonable field of view. 3) In DFD techniques, it is necessary to adjust some camera parameters or the distance between the object and camera in order to capture two defocused images using monocular vision. However, in some applications, adjusting the camera parameters damages the camera, and moving the camera or the object during depth reconstruction is inconvenient.

In order to overcome these difficulties, a surface reconstruction method that can assure both depth reconstruction precision and depth sensitivity with a small aperture lens under monocular vision conditions is very necessary. Therefore, the use of optical diffusers in optical imaging has recently been considered, and a technique known as “depth reconstruction from diffusion” (DRFD) has been proposed, which is designed to reconstruct the surface of a scene. A diffuser is a light-scattering optical element that is widely used to soften and shape light in illumination and display applications [15,16 ]. This device can convert an incident ray into a cluster of scattered rays. Therefore, when it is placed in front of an object, the captured image is blurred and has a similar appearance to a defocused image; this can be seen in Fig. 1 , which is a diffused image of a wrinkled newspaper. A diffused image can be formulated as a convolution between a radiance image and a diffusion blur kernel with a locally constant diffusion angle, with the diffusion blur kernel being determined by both the diffusion function and the object-to-diffuser distance. If the diffusion function or diffusion angle is known, it is possible to calculate the object-to-diffuser distance. This is the core principle of the depth reconstruction from diffusion technique.

Fig. 1 Diffusion theory and a diffused image.

Abstract

1. Introduction

2. Imaging model of optical diffusion

2.1 Heat diffusion in physics

2.2. Imaging model for an optical diffuser

3. 3D reconstruction of complex surface based on optical diffusion

4. Experiment

4.1 Simulation with a diffusion angle of 10°

4.2 Experiment with a diffusion angle of 10°

5. Conclusion

Acknowledgments

References and links

Cited By

Figures (18)

Equations (28)

Optics Express