Refraction correction for deep-water three-dimensional visual measurement based on multi-objective optimization

Liuning Gu; Wenwu Chen; Xiaohan Hu; Zixiang Tong; Xinxing Shao

doi:10.1364/OE.499877

1. Introduction

Underwater acoustic sensing for long-distance and wide-range target detection typically suffers from low resolution [1–4]. To address this dilemma, underwater-vision-based sensing, especially in deep-water environments, has recently attracted extensive attention [5–7]. Vision-based methods can obtain rich information about underwater targets within a close range with high resolution and low cost. Therefore, it has been applied in various fields including water engineering inspection, ROV mechanical arm navigation, marine archeology, underwater search, and marine bioscience. In deep-water environments, a camera is typically enclosed in a cylindrical waterproof housing with a thick observation window to withstand the tremendous hydrostatic pressure. Therefore, light rays must pass through multiple media before being captured by a camera. If a pinhole camera model and parameters calibrated in air are employed, significant measurement errors are introduced because of refraction [8].

Numerous studies have attempted to eliminate the effect of refraction on underwater visual measurements. Early studies approximated the effects of refraction by treating it as a change in camera focal length or lens distortion based on a pinhole camera model [9–11]. Lavest suggested that if the calibration method used in air were directly applied to water, the focal length would be multiplied by the water index, while the image principal point seemed unchanged, and the radial distortion would be significantly different [9]. However, owing to the strong nonlinearity of underwater imaging, the systematic errors in these methods are large and cannot be ignored. Recently, the establishment of accurate underwater imaging models and the development of calibration methods has attracted significant interests [12–25]. Kunz et al. developed a theory for an underwater hemispherical refraction model and a camera calibration method [12]. Treibitz et al. established a simple physical model to describe camera refractive imaging, and derived a forward projection applicable to a model based on Snell’s law [13]. However, the thickness of the observation window was not considered and the optical axis was assumed to be perpendicular to the window. Agrawal et al. established an innovative theory for multi-layer flat refractive geometries [14]. A refractive axis estimation method based on coplanar constraints was proposed, and other refraction parameters were calculated; however, only a single-camera refractive imaging model was used, and the derived forward projection was quite complex. Chen et al. proposed a method for calculating the thickness of all refractive layers with a known refractive axis and relative pose of the binocular camera [15]. Su et al. established a binocular refractive imaging model with a shared refractive surface [16,17]. When calculating the refractive axis, marks must be manually laid out on the refractive surface, which limits the application of this method in deep-water environments. To solve this problem, some researchers have adopted a complete camera refractive imaging model for refraction correction. Jordt-Sedlazeck and Koch developed the concept of a virtual camera to calculate the virtual reprojection error as an objective function [18,19]. Both Zhang [20] and Kong [21] used spatial features related to a calibration target to establish an objective function. In fact, they all attempted to avoid the complexity of forward projection; therefore, minimization of the reprojection error was not considered. Refraction correction methods for deepwater three-dimensional (3D) visual measurements, particularly for high-accuracy shape and deformation measurements, have not been well studied.

To develop a sound 3D visual method applicable to deep-water environments, an underwater binocular camera imaging model was established, and a calibration method for the refraction parameters was proposed accordingly. Calibration of the refractive parameters can be divided into two steps: initial estimation and optimization based on a heuristic algorithm. Three experiments were conducted to test the performance of the proposed method in underwater 3D reconstruction. The main contributions of this study are as follows:

(1) By using a 3D calibration target, a more accurate initial estimation of the refractive axis was obtained. It should be noted that 3D calibration targets are inexpensive and easily obtained through photogrammetry.
(2) Reasonable and explicit dual optimization objectives were constructed, including reprojection and distance errors. We point out that because of the nonlinearity of the underwater imaging model, the errors on both the image and space sides should be minimized simultaneously.
(3) A numerical method for forward projection is proposed that significantly reduces complexity and accelerates calculation.
(4) An underwater displacement experiment was conducted to verify the displacement measurement accuracy of this method, which has rarely been reported in previous studies.

The remainder of this paper is organized as follows. Section 2 introduces the proposed calibration method, including the initial estimation and detailed optimization of the refractive parameters. Section 3 presents the experiments and an analysis of the results. In Section 4, we discuss the proposed method. Finally, the conclusions are presented in Section 5.

2. Method

2.1 Binocular camera refractive imaging model in deep water

Considering the hydrostatic pressure in deepwater environments, waterproof housing is typically designed to be compact. Therefore, binocular cameras are typically enclosed in a separate waterproof housing with a thick flat observation window. As shown in Fig. 1, a ray starting from Point ${P_W}$ travels through three different media: water, observation window, and air, undergoing refraction twice before being imaged by the camera sensor.

Fig. 1. Binocular camera refractive imaging model in deep water

Download Full Size | PDF

For the left camera, the interface between water and observation window is ${\pi _2}$ and the interface between air and observation window is ${\pi _1}$. The intersections of the ray path with ${\pi _1},{\pi _2}$ are $P_l^1,P_l^2$, respectively. ${p_l}$ is the projection of ${P_W}$ on the left camera sensor. The normal vector of ${\pi _1},{\pi _2}$ is ${{\boldsymbol n}_l}$ in the left camera coordinate system ${O_l} - {X_l}{Y_l}{Z_l}$. ${d_l}$ is the distance from the optical center of left camera to ${\pi _1}$, and ${h_l}$ is the thickness of the observation window. The refractive index of air, observation window and water are ${u_1},{u_2},{u_3}$ respectively. The relative pose between the two cameras is described by the rotation matrix R and translation vector T. If the effect of refraction is not considered and triangulation is directly performed, the reconstructed point $P_W^{\prime}$ would differ significantly from the true position as shown in Fig. 1.

For the reconstruction of 3D points, it is assumed that the intrinsic parameters of the two cameras have been calibrated in advance, so the vector ${O_l}{p_l}$ and ${O_r}{p_r}$ can be determined. When light rays pass through media with different refractive indices ${u_i},{u_{i + 1}}$ as shown in Fig. 2(a), according to Snell's law

(1)$${{\boldsymbol v}_{i + 1}} = \frac{{{u_i}}}{{{u_{i + 1}}}}{{\boldsymbol v}_i} + \frac{{ - {u_i}{\boldsymbol v}_i^T{\boldsymbol n} - \sqrt {u_i^2{{({\boldsymbol v}_i^T{\boldsymbol n})}^2} - (u_i^2 - u_{i + 1}^2){\boldsymbol v}_i^T{{\boldsymbol v}_i}} }}{{{u_{i + 1}}}}{\boldsymbol n}$$

where ${\boldsymbol n}$ is the normal vector of media interface. Therefore, once the variables highlighted in red in Fig. 1 $\Omega ({d_l},{d_r},{h_l},{h_r},{{\boldsymbol n}_l},{{\boldsymbol n}_r},R,T)$ are determined, vectors in observation windows and water $P_l^1P_l^2,P_r^1P_r^2$_and$P_l^2{P_W},P_r^2{P_W}$ can be determined as well. The reconstructed points ${P_W}$ will be the point with the smallest sum of distances between $P_l^2{P_W}$ and $P_r^2{P_W}$. Variables $\Omega ({d_l},{d_r},{h_l},{h_r},{{\boldsymbol n}_l},{{\boldsymbol n}_r},R,T)$ are defined as the refraction parameters that need to be calibrated.

Fig. 2. (a) Schematic diagram of Snell's law, (b) analytical forward projection (AFP) on refractive plane

Download Full Size | PDF

According to Agrawal et al. [14], the analytical forward projection (AFP) is a 12th degree equation when three different media exist. The AFP can be used to compute the projection of a known 3D point on the camera sensor; however, the conventional computation method has the shortcomings of complexity and time consumption. To realize comprehensible and fast computation of the AFP, an improved numerical computation method was proposed, as shown in Eq. (2) to (6), respectively.

In multi-layer flat refractive geometry, all rays on the propagation path are in the same plane, which is defined as refractive plane. As shown in Fig. 2(b), considering the coordinate system $O - {X_n}{Y_n}$ established on the refraction plane, O is the optical center of camera, $O{Y_n}$ is parallel to n. A point $P(X,Y,Z)$ is in the camera coordinate system $O - XYZ$, and ${P^{\prime}}(s,z)$ in $O - {X_n}{Y_n}$ is the transformed coordinate of P. The refractive angles in air, observation window, and water are ${\theta _1},{\theta _2},{\theta _3}$ respectively. According to Snell's law:

(2)$${u_1}\sin {\theta _1} = {u_2}\sin {\theta _2} = {u_3}\sin {\theta _3} = a$$

where a is a constant. According to the geometric constraints shown in Fig. 2(b),

(3)$$d\tan {\theta _1} + h\tan {\theta _2} + (z - d - h)\tan {\theta _3} = s$$

There are other equations as follows:

(4)$$\left\{ \begin{array}{l} \tan {\theta_i} = \frac{{\sin {\theta_i}}}{{\sqrt {1 - \sin {\theta_i}^2} }}\\ \sin {\theta_i} = \frac{a}{{{u_i}}} = {k_i}a \end{array} \right.,i = 1,2,3$$

Substituting Eq. (4) into Eq. (3), we get

(5)$$f(a) = d\frac{{{k_1}a}}{{\sqrt {1 - {k_1}^2{a^2}} }} + h\frac{{{k_2}a}}{{\sqrt {1 - {k_2}^2{a^2}} }} + (z - d - h)\frac{{{k_3}a}}{{\sqrt {1 - {k_3}^2{a^2}} }} - s = 0$$

The problem is transformed into finding the solution when $f(a) = 0$.

Consider the following function:

(6)$$g(x) = \frac{x}{{\sqrt {1 - {x^2}} }},x \in [{0,1} )$$

$g(x)$ is monotonically increasing within [0,1). Because ${\theta _i} \in [0,9{0^\circ })$, ${k_i}a = \sin {\theta _i} \in [0,1)$, $f(a) = d\ast g({k_1}a) + h\ast g({k_2}a) + (z - d - h)\ast g({k_3}a) - s$, $f(a)$ is also monotonically increasing and has a unique zero point. The value of a is obtained by iteration using the Newton's method. Consider ${\theta _0}$ as the angle between $O{Y_n}$ and $O{P^{\prime}}$, as shown in Fig. 2(b). The initial estimation of a will be $\sin {\theta _0}$, since ${u_1}\sin {\theta _1} = a$, ${u_1}$ is usually 1 and ${\theta _0}$ is very close to ${\theta _1}$. Due to the monotonically increasing function properties and good initial value estimation, a can be calculated in 4-5 iterations with a final error less than 10⁻⁵ pixels, which is fast and accurate. Subsequently, the coordinate of ${P^1}$ in $O - {X_n}{Y_n}$ is calculated and converted back to camera coordinate system $O - XYZ$. At last, the pixel coordinates p will be obtained by performing a perspective transformation to the converted ${P^1}$.

2.2 Initial estimation of refraction parameters

As shown in Fig. 2(b), ${{\boldsymbol v}_0}$ is the unit vector in the $Op$ direction. Assuming a point ${P_W}$ in the world coordinate system and is transformed to camera coordinate system as $P = {R_1}{P_W} + {T_1}$. Notably, ${{\boldsymbol v}_0}$ and n are both in the camera coordinate system. According to Agrawal et al. [14], the coplanarity constraint can be written as

(7)$${\boldsymbol v}_{\bf 0}^T \cdot ({\boldsymbol n} \times ({R_1} \cdot {P_w} + {T_1})) = 0$$

The coplanarity constraint is re-written as

(8)$${\boldsymbol v}_{\bf 0}^TE{P_w} + {\boldsymbol v}_{\bf 0}^Ts = 0$$

where $E = {[{\boldsymbol n}]_ \times }{R_1}$ and $s = {\boldsymbol n} \times {T_1}$. If there are N correspondences in ${P_W}$ and ${{\boldsymbol v}_0}$, we obtain

(9)$$\left[ {\begin{array}{cc} {{P_w}{{(1)}^T} \otimes {{\boldsymbol v}_{\bf 0}}{{(1)}^T}}&{{{\boldsymbol v}_{\bf 0}}{{(1)}^T}}\\ \vdots & \vdots \\ {{P_w}{{(N)}^T} \otimes {{\boldsymbol v}_{\bf 0}}{{(N)}^T}}&{{{\boldsymbol v}_{\bf 0}}{{(N)}^T}} \end{array}} \right]\left[ \begin{array}{c} E(1:9)\\ s \end{array} \right] = B\left[ \begin{array}{c} E(1:9)\\ s \end{array} \right] = 0$$

During $N \ge 12$, the solutions of E and s are given by the eigenvector corresponding to the minimum eigenvalue of matrix B after SVD. Since

(10)$${{\boldsymbol n}^T} \cdot E = {{\boldsymbol n}^T} \cdot {[{\boldsymbol n}]_ \times }R = {E^T} \cdot {\boldsymbol n} = 0$$

The solution of n is given by the eigenvector corresponding to the minimum eigenvalue of matrix ${E^T}$ after SVD decomposition. If there is only one set of correspondences, then only one matrix ${E^T}$ will be obtained. If there is significant random noise in the correspondences of ${P_W}$ and ${{\boldsymbol v}_0}$, it will affect the accuracy of n. Thereby, we created a 3D calibration target as shown in Fig. 3. The 3D coordinate of the encoded points on the target is obtained by the close-range photogrammetry method, the distance between two points can be calculated. Taking M underwater photos of the calibration target in different poses, we can get M matrix ${E^T}$. A more robust results of n is given by the eigenvector corresponding to the minimum eigenvalue of matrix C after SVD decomposition as shown in Eq. (11).

(11)$$\left[ \begin{array}{l} E_1^T\\ \ldots \\ E_M^T \end{array} \right]{\boldsymbol n} = C{\boldsymbol n} = 0$$

Because observation windows are typically made in advance, the thickness h of the observation window was measured using the Vernier scale as a constant. The extrinsic parameters between the two cameras, R and T are obtained by Zhang’s calibration using a planar calibration board in air. Although there was an observation window, its impact on the calculation of the relative pose was very small. d is calculated based on Chen’s method [15], which calculates the thickness of all refractive layers with a known refractive axis and the relative pose of the binocular camera.

Fig. 3. 3D calibration target

Download Full Size | PDF

Fig. 4. Experimental setup for axis estimation.

Download Full Size | PDF

2.3 Optimization of refraction parameters based on NSGA-II

Considering the strong nonlinearity of underwater camera imaging, we used heuristic algorithms to optimize refraction parameters. NSGA-II (Non-dominated Sorting Genetic Algorithm II) is a multi-objective optimization algorithm developed by Kalyanmoy Deb [26]. NSGA-II uses a fast and efficient sorting technique to rank solutions based on their non-dominance and diversity, and employs a crowding distance measure to preserve a diverse set of solutions. The choice of the optimization objective is the most important issue in the NSGA-II. To ensure minimal errors on both the image and space sides, the optimization objective f₁ was defined as the average reprojection error of all points, and f₂ was defined as the average distance error of all selected point pairs. The objective functions are expressed as follows:

(12)$${f_1} = \frac{1}{{2MN}}\sum\limits_{m = 1}^M {\sum\limits_{n = 1}^N {(||{p_l^{mn} - \bar{p}_l^{mn}} ||} } + ||{p_r^{mn} - \bar{p}_r^{mn}} ||)$$

(13)$${f_2} = \frac{1}{{MI}}\sum\limits_{m = 1}^M {\sum\limits_{i = 1}^I {|{d_i^m - {{\bar{d}}_i}} |} }$$

where M is the number of 3D calibration target images, N is the number of coded points on the 3D calibration target, I is the number of selected 3D point pairs, p is the re-projection coordinate of the reconstructed point, $\overline p$ is the detected coordinate of the feature point coordinates, d is the distance between the reconstructed point pairs, and $\overline d$ is the true distance. To reduce the number of parameters to be optimized, replace n^z in the refractive axis using $\sqrt {1 - {{({n^x})}^2} - {{({n^y})}^2}}$. The rotation matrix R was converted to angles using ${\omega _1},{\omega _2},{\omega _3}$. Finally, 12 parameters were optimized using ${d_l},{d_r},n_l^x,n_l^y,n_r^x,n_r^y,{\omega _1},{\omega _2},{\omega _3},{t_1},{t_2},{t_2}$.

3. Experiments and results

3.1 Results of refraction parameters

First, the accuracy of the refractive axis estimation was verified. Because it is difficult to directly obtain the true orientation of the refractive axis of the camera coordinate system, we designed the experiment shown in Fig. 4, where a high-precision electric-driven rotary stage rotated a fixed camera 2.5° four times and 10-20 photos of the 3D calibration target were taken each time. The accuracy of the proposed method was verified by calculating the change in the angle along the refractive axis. We compared the results of our algorithm with those of the 11-point algorithm proposed by Agrawal et al. [14]. Table 1 shows that our algorithm is significantly superior to the 11-point algorithm and that the measurement error of the relative rotation angle is less than 3%.

Table 1. Results of refractive axis rotation angle

View Table | View all tables in this article

Figure 5(a) shows the waterproof housing and underwater binocular camera system. The entire system operated stably for several hours under a pressure of 3.75 MPa, as shown in Fig. 5(b). We conducted a series of experiments based on the underwater binocular camera system, including calibration and underwater 3D reconstruction. Fifteen underwater images of 3D calibration targets in different poses were captured using binocular cameras, which were used for both initial value estimation and optimization. Calibration method is described in Section 2. Table 2 lists the initial values of the refractive parameters obtained from the proposed method and the range used for optimization.

Fig. 5. (a) Waterproof housing, (b) pressure experiment

Download Full Size | PDF

Table 2. Values of refraction parameters

View Table | View all tables in this article

Figure 6 shows the changes in the values of the optimization objectives during the optimization process, where each point represents an individual in the population. As the iterations progressed, the value of the objective function rapidly decreased and tended to stabilize around the 50th generation, with little change until the 100th generation. Specifically, when the iteration tends to stabilize, owing to the effect of nondominated sorting, some individuals make f₁ very small but f₂ large, whereas others make the opposite. The selected individuals in the red circle c shown in Fig. 6 make both f₁ and f₂ relatively small; therefore, the final results will be obtained from these individuals. Based on the experimental results and considering computational efficiency and accuracy, the recommended population size was 100, and the number of iterations was 50.

Fig. 6. Optimization process based on NSGA-II

Download Full Size | PDF

Part of the detailed optimization results were presented in Fig. 7. We selected point pairs 129-221, 207-223, and 156-195, and obtained the distance error between the true distance $\overline d$ and the distance d calculated by reconstructed points, $\Delta = d - \overline d$. Coded points 207, 223, and 117 were selected and reprojection error were calculated as $\Delta = |p - \overline p |$. The results indicate that among the 15 calibration poses, all the distance errors of the three-point pairs lie in -0.3∼0.3 mm, and all the reprojection errors are less than 0.25 pixel. The final results of refraction parameters are listed in the last column in Table 2.

Fig. 7. Optimization results: (a) distance error, (b) reprojection error

Download Full Size | PDF

3.2 Performance of underwater 3D measurement

Three experiments were conducted to demonstrate the performance of the proposed method in underwater 3D reconstruction. First, the absolute positional error of the reconstructed point cloud was studied. We placed a planar calibration board in the water tank to ensure that the relative position between the binocular camera and calibration board was unchanged and calculated the 3D coordinates of the feature points in air and water. When in air, the observation window of the waterproof housing was removed to ensure measurement accuracy (In principle, the waterproof housing of camera used in deep water environments should not be opened after being sealed properly). As shown in Fig. 8(b), considering the 3D coordinates in air as the ground truth, the positions of the underwater calibration board feature points obtained using the proposed method were very close to the ground truth, with an average of 1.081 mm. Furthermore, the re-projection error of the reconstructed points was very small, with an average of 0.506 pixels, as shown in Fig. 8(a).

Fig. 8. Performance of the optimized refraction parameters: (a) reprojection error, (b) absolute position error

Download Full Size | PDF

The experimental setup shown in Fig. 9 was designed to test the displacement measurement accuracy of the proposed method. A plate with speckles was fixed to an electrically driven translation stage to generate horizontal and vertical displacements. The horizontal direction x was parallel to the camera baseline, and the vertical direction y was perpendicular to the camera baseline, as shown in Fig. 9. The translation stage was moved by 1 mm 10 times. The calculated full-field displacements are listed in Table 3. In both the vertical and horizontal directions, the average displacement error was less than 0.040 mm, and the standard deviation was less than 0.032 mm, which illustrates the reliability of the method.

Fig. 9. Experimental setup for underwater displacement experiments

Download Full Size | PDF

Table 3. Results of displacement experiments (mm)

View Table | View all tables in this article

Finally, the three-dimensional surface morphology of the concrete bricks was measured. As shown in Fig. 10(b), we used a laser projector to project a random speckle onto the surface of the measured object, and then obtained the 3D point cloud of the concrete brick surface using the proposed method. The morphology provides rich information and high accuracy, and the Chinese characters on the concrete bricks are clearly visible, as shown in Fig. 10(c).

Fig. 10. Underwater 3D reconstruction: (a) concrete brick, (b) projected speckle, (c) reconstructed point cloud

Download Full Size | PDF

4. Discussion

We attempt to demonstrate the reason for adopting a 3D calibration target and dual optimization objectives. For a planar calibration board, the constraints on corner spacing, parallelism, and perpendicularity between lines must be considered simultaneously. Excessive optimization objectives will obviously slow down the optimization or easily trap the optimization results in local optima. Using a 3D calibration target facilitates the construction of simple and strong constraints on the spatial side. In Section 2, we demonstrate that multiple images of a 3D calibration target can be used for more accurate estimation of the refractive axis. In fact, 3D calibration targets based on close-range photogrammetry are not only cheap but also have high precision and are easy to make. Thus, a 3D calibration target was selected for the proposed method.

Examining the 3D reconstruction performance of individuals in the red circle a in Fig. 6, the average absolute position error was 7.168 mm, and the average reprojection error was 0.492 pixels. For the individuals in red circle b, the average absolute position error was 1.052 mm and the average reprojection error was 2.110 pixels, as shown in Fig. 11. This indicates that if only one single-side optimization objective is adopted, the optimization results may be satisfactory; however, the parameters do not truly match the model. This is probably due to the strong nonlinearity of underwater imaging. Therefore, errors on both the image and space sides were supposed to be minimized simultaneously to achieve a stronger bundling adjustment. Furthermore, the proposed numerical method for forward projection significantly accelerated the calculation, making it feasible to use the reprojection error as the optimization objective.

Fig. 11. (a) reprojection error based on individual in red circle b, (b) absolute position error based on individual in red circle a

Download Full Size | PDF

5. Conclusion

We propose a sound refraction correction method for deep-water 3D visual measurements. Using a 3D calibration target, we obtain a more accurate initial estimate of the refractive axis. The reprojection and distance errors were defined as the optimization objectives, and the reasons for choosing them were discussed in detail. Because of the nonlinearity of the underwater imaging model, errors on both the image and space sides were assumed to be minimized simultaneously. The proposed numerical method for forward projection significantly reduces complexity and accelerates forward projection. The projected coordinates can be calculated in 4-5 iterations with a final error of less than 10⁻⁵ pixels.

Experiments verified the performance of the proposed method for underwater 3D reconstruction. The average absolute position error of the reconstructed points was 1.081 mm and the average reprojection error was 0.506 pixels. The results of the displacement experiment indicate that in both the vertical and horizontal directions, the average displacement error is less than 0.040 mm, and the standard deviation is less than 0.032 mm, which illustrates the reliability of the method.

Funding

National Key Research and Development Program of China (2020YFC1511900); National Natural Science Foundation of China (12272093).

Disclosures

The authors declare that there are no conflicts of interest related to this article.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

References

1. Y. Cong, C. J. Gu, T. Zhang, and Y. J. Gao, “Underwater robot sensing technology: A survey,” Fundam. Res. 1(3), 337–345 (2021). [CrossRef]

2. 2. C. Y. Li, C. L. Guo, W. Q. Ren, R. M. Cong, J. H. Hou, S. Kwong, and D. C. Tao, “An Underwater Image Enhancement Benchmark Dataset and Beyond,” IEEE Trans Image Process. 29, 4376–4389 (2020). [CrossRef]

3. A. Palomer, P. Ridao, and D. Ribas, “Inspection of an underwater structure using point-cloud SLAM with an AUV and a laser scanner,” J. Field Rob. 36(8), 1333–1344 (2019). [CrossRef]

4. G. Telem and S. Filin, “Photogrammetric modeling of underwater environments,” ISPRS J Photogram Remote Sens. 65(5), 433–444 (2010). [CrossRef]

5. M. P. Hayes and P. T. Gough, “Synthetic Aperture Sonar: A Review of Current Status,” IEEE J. Oceanic Eng. 34(3), 207–224 (2009). [CrossRef]

6. D. P. Williams and Ieee, “Underwater Target Classification in Synthetic Aperture Sonar Imagery Using Deep Convolutional Neural Networks,” in 23rd International Conference on Pattern Recognition (ICPR), International Conference on Pattern Recognition (2016), pp. 2497–2502.

7. Y. C. Yu, J. H. Zhao, Q. H. Gong, C. Huang, G. Zheng, and J. Y. Ma, “Real-Time Underwater Maritime Object Detection in Side-Scan Sonar Images Based on Transformer-YOLOv5,” Remote Sens. 13(18), 3555 (2021). [CrossRef]

8. Y. H. Kwon and J. B. Casebolt, “Effects of light refraction on the accuracy of camera calibration and reconstruction in underwater motion analysis,” Sports Biomech. 5(1), 95–120 (2006). [CrossRef]

9. J. M. Lavest, G. Rives, and J. T. Lapresté, “Underwater Camera Calibration,” in Computer Vision — ECCV 2000 (Springer, Berlin Heidelberg, 2000), pp. 654–668.

10. R. Ferreira, J. P. Costeira, and J. A. Santos, “Stereo reconstruction of a submerged scene,” in Pattern Recognition and Image Analysis, Pt 1, Proceedings, J. S. Marques, N. PerezdelaBlanca, and P. Pina, eds. (2005), pp. 102–109.

11. L. Kang, L. D. Wu, and Y. H. Yang, “Experimental study of the influence of refraction on underwater three-dimensional reconstruction using the SVP camera model,” Appl. Opt. 51(31), 7591–7603 (2012). [CrossRef]

12. C. KunzH. Singh, Mts, and Ieee, “Hemispherical Refraction and Camera Calibration in Underwater Vision,” in OCEANS 2008 Conference, Oceans-Ieee (2008), pp. 1097–1103.

13. T. Treibitz, Y. Y. Schechner, and H. Singh, “Flat refractive geometry,” in IEEE Conference on Computer Vision & Pattern Recognition (2008).

14. A. Agrawal, S. Ramalingam, Y. Taguchi, V. Chari, and Ieee, “A Theory of Multi-Layer Flat Refractive Geometry,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), IEEE Conference on Computer Vision and Pattern Recognition (2012), pp. 3346–3353.

15. X. D. ChenY. H. Yang, and Ieee, “Two-View Camera Housing Parameters Calibration for Multi-Layer Flat Refractive Interface,” in 27th IEEE Conference on Computer Vision and Pattern Recognition (CVPR), IEEE Conference on Computer Vision and Pattern Recognition (2014), pp. 524–531.

16. Z. L. Su, J. Y. Pan, L. Lu, M. L. Dai, X. Y. He, and D. S. Zhang, “Refractive three-dimensional reconstruction for underwater stereo digital image correlation,” Opt. Express 29(8), 12131–12144 (2021). [CrossRef]

17. Z. L. Su, J. Y. Pan, S. Q. Zhang, S. Wu, Q. F. Yu, and D. S. Zhang, “Characterizing dynamic deformation of marine propeller blades with stroboscopic stereo digital image correlation,” Mech Syst Sig Proces 162, 108072 (2022). [CrossRef]

18. A. Jordt-SedlazeckR. Koch, and Ieee, “Refractive Structure-from-Motion on Underwater Images,” in IEEE International Conference on Computer Vision (ICCV), IEEE International Conference on Computer Vision (2013), pp. 57–64.

19. A. Jordt-Sedlazeck and R. Koch, “Refractive Calibration of Underwater Cameras,” in Computer Vision – ECCV 2012 (Springer, Berlin Heidelberg, 2012), 846–859.

20. C. Zhang, X. Zhang, Y. K. Zhu, J. B. Li, and D. W. Tu, “Model and calibration of underwater stereo vision based on the light field,” Meas. Sci. Technol. 29(10), 105402 (2018). [CrossRef]

21. S. H. Kong, X. Fang, X. Y. Chen, Z. X. Wu, and J. Z. Yu, “A NSGA-II-Based Calibration Algorithm for Underwater Binocular Vision Measurement System,” IEEE Trans. Instrum. Meas. 69(3), 794–803 (2020). [CrossRef]

22. Y. J. ChangT. H. Chen, and Ieee, “Multi-View 3D Reconstruction for Scenes under the Refractive Plane with Known Vertical Direction,” in IEEE International Conference on Computer Vision (ICCV), IEEE International Conference on Computer Vision (2011), 351–358.

23. R. Rofallski and T. Luhmann, “An Efficient Solution to Ray Tracing Problems in Multimedia Photogrammetry for Flat Refractive Interfaces,” Pfg-J Photogram Remote Sens Geoinfo Sci. 90(1), 37–54 (2022). [CrossRef]

24. Y. P. Ma, Y. Q. Zhou, C. K. Wang, Y. Wu, Y. Zou, and S. Zhang, “Calibration of an underwater binocular vision system based on the refraction model,” Appl. Opt. 61(7), 1675–1686 (2022). [CrossRef]

25. B. Elnashef and S. Filin, “Target-free calibration of flat refractive imaging systems using two-view geometry,” Opt. Laser Eng. 150, 106856 (2022). [CrossRef]

26. A. P. K. Deb, “A fast elitist non-dominated sorting genetic algorithm for multi-objective optimization: Nsga-ii,” KanGAL report, Indian Institute of Technology (2000). [CrossRef]

Ground truth	2.5°	5°	7.5°	10°
Agrawal	2.258°	5.944°	7.952°	10.348°
Error	9.69%	18.88%	6.02%	3.48%
Ours	2.427°	4.975°	7.586°	10.054°
Error	2.92%	0.50%	1.14%	0.54%

Parameters	Initial value	Range	Final results
$d_{l}$	54.82	$\pm 27.41$	40.30
$d_{r}$	49.25	$\pm 24.63$	39.43
$n_{l}^{x}$	$- 1.41 \times 10^{- 2}$	$\pm 0.03$	$- 1.24 \times 10^{- 3}$
$n_{l}^{y}$	$5.14 \times 10^{- 3}$	$\pm 0.03$	$- 4.71 \times 10^{- 3}$
$n_{r}^{x}$	$6.83 \times 10^{- 3}$	$\pm 0.03$	$- 4.03 \times 10^{- 3}$
$n_{r}^{y}$	$- 2.15 \times 10^{- 2}$	$\pm 0.03$	$- 3.55 \times 10^{- 5}$
$ω_{1}$	-0.23	$\pm 1$	-0.05
$ω_{2}$	22.29	$\pm 1$	22.46
$ω_{3}$	1.66	$\pm 1$	1.55
$t_{1}$	-167.89	$\pm 2$	-167.58
$t_{2}$	2.18	$\pm 2$	1.43
$t_{3}$	25.99	$\pm 2$	26.₅₇

Ground truth	$μ (d x)$	$σ (d x)$	$μ (d z)$	$σ (d z)$
1	0.995	0.004	0.999	0.006
₂	2.020	0.008	2.001	0.009
₃	3.014	0.011	3.014	0.010
₄	3.996	0.013	4.023	0.011
₅	5.011	0.017	5.008	0.016
₆	6.012	0.019	5.994	0.018
₇	7.037	0.022	6.989	0.017
₈	8.032	0.025	8.030	0.020
₉	8.995	0.023	9.040	0.023
₁₀	10.018	0.032	10.032	0.027

Ground truth	2.5°	5°	7.5°	10°
Agrawal	2.258°	5.944°	7.952°	10.348°
Error	9.69%	18.88%	6.02%	3.48%
Ours	2.427°	4.975°	7.586°	10.054°
Error	2.92%	0.50%	1.14%	0.54%

Parameters	Initial value	Range	Final results
$d_{l}$	54.82	$\pm 27.41$	40.30
$d_{r}$	49.25	$\pm 24.63$	39.43
$n_{l}^{x}$	$- 1.41 \times 10^{- 2}$	$\pm 0.03$	$- 1.24 \times 10^{- 3}$
$n_{l}^{y}$	$5.14 \times 10^{- 3}$	$\pm 0.03$	$- 4.71 \times 10^{- 3}$
$n_{r}^{x}$	$6.83 \times 10^{- 3}$	$\pm 0.03$	$- 4.03 \times 10^{- 3}$
$n_{r}^{y}$	$- 2.15 \times 10^{- 2}$	$\pm 0.03$	$- 3.55 \times 10^{- 5}$
$ω_{1}$	-0.23	$\pm 1$	-0.05
$ω_{2}$	22.29	$\pm 1$	22.46
$ω_{3}$	1.66	$\pm 1$	1.55
$t_{1}$	-167.89	$\pm 2$	-167.58
$t_{2}$	2.18	$\pm 2$	1.43
$t_{3}$	25.99	$\pm 2$	26.₅₇

Refraction correction for deep-water three-dimensional visual measurement based on multi-objective optimization

Abstract

1. Introduction

2. Method

2.1 Binocular camera refractive imaging model in deep water

2.2 Initial estimation of refraction parameters

2.3 Optimization of refraction parameters based on NSGA-II

3. Experiments and results

3.1 Results of refraction parameters

3.2 Performance of underwater 3D measurement

4. Discussion

5. Conclusion

Funding

Disclosures

Data availability

References

Data availability

Cited By

Figures (11)

Tables (3)

Equations (13)

Optics Express