Background-oriented Schlieren tomography using gated recurrent unit

Lin Bo; Huajun Cai; Yang Song; Yunjing Ji; Zhenhua Li; Anzhi He

doi:10.1364/OE.505992

1. Introduction

Schlieren and shadowgraph techniques have been extensively employed for imaging and measuring flow-field structures. The history of schlieren and shadow photography dates back to the 17th century [1]; however, for a long time, they were primarily qualitative visualization methods that lacked the quantitative capabilities of interferometric measurements [2].

Background-oriented schlieren (BOS) has been one of the most significant developments in this field since the beginning of the 21st century. The optical setup of BOS was introduced by L.M. Weinstein [3], and was then further developed and applied to very large fields of view by Gary Settles [4]. Dalziel et al. [5] analytically described the principle of BOS in 2000. In the same year, Meier [6] suggested that BOS could be applied to both the flow visualization and CT reconstruction of flow fields. BOS simplifies the optical setup required to obtain information on light deflection, requiring only the flow under study to be placed between the camera and the textured background on which the camera is focused. BOS is technically easy to implement, has relatively low equipment costs and a wide field of view, and can perform reliable measurements under extreme conditions, making it highly valuable for research purposes. Multigroup BOS synchronized recordings enable the three-dimensional reconstruction of nonaxisymmetric unsteady flows [7].

In 2000, Raffel et al. further refined the BOS and demonstrated its applicability to flow field measurements by visualizing the density field of tip vortices in a hovering helicopter flight [8]. In 2004, Venkatakrishnan et al. [9] used a BOS to obtain the density field of an axisymmetric supersonic flow over a cone-cylinder model and found an excellent correlation between the densities obtained from the BOS and those obtained from the cone surface data. In 2007, Atcheson et al. [10] assessed the performance of optical flow algorithms in BOS and pointed out that combining optical flow with a multiscale background could significantly improve BOS performance.

One limitation of this method is that it is still a two-dimensional flow measurement technique that only detects path-integrated information projected onto an image plane. For axisymmetric flows, a single camera can be used for measurements. Time-averaged two-dimensional displacements are converted into line-averaged densities using the Poisson equation, and then, two-dimensional slices are reconstructed from the three-dimensional density field using Abel inversion. Several subsequent tests have applied this single-camera method to axisymmetric targets [11,12]. For example, Sourgen et al. [13] compared numerical simulations with the BOS results using an inverse Abel transformation.

To measure the three-dimensional flow characteristics of complex, non-axisymmetric turbulence, light deflection information from multiple angles can be combined with tomographic imaging algorithms to reconstruct the three-dimensional refractive index field. This is referred to as background-oriented schlieren tomography (BOST). However, purchasing multiple cameras with sufficient acquisition rates to obtain optical deflection information from multiple angles may sometimes be prohibitively expensive. Therefore, Mateo Gomez et al. [14] coupled an ultrahigh-speed camera to a viewsplitter and illuminated the background with an adequate light source, accomplishing the megahertz time resolution and quantitative reconstruction without symmetry assumptions. Bathel et al. [15] also placed frame splitters in front of each of the high-speed camera’s lenses so that two independent views of the flow could be acquired with each camera, achieving a high-speed, turbulent jet visualization. Classical BOST methods typically consist of two steps: the first being CT reconstruction from displacement images. One fundamental approach to CT reconstruction is the use of back-projection methods, such as filtered back-projection [16–18]; however, these inevitably produce artifacts, especially when the projection angles and the amount of projection data are limited. Therefore, starting in 2010, Ota et al. [19–21] replaced filtered back-projection with an algebraic reconstruction technique (ART) on the basis of using rotating cameras. ART yields better reconstruction results than filtered back projection when the amount of data is limited.

The second step in BOST integrates the results after CT reconstruction, because the direct outcome of CT reconstruction is the distribution of refractive index gradients. The simplest method is to perform line integration directly along each direction [8,22]; however, this approach leads to the accumulation of line noise. Alternatively, the Poisson equation can be employed; Atkinson and Hancock [23] proposed the first time-resolved BOST model, deriving the three-dimensional unsteady flow reconstruction process for the refractive index field from gradients through Poisson integration.

In recent years, more one-step reconstruction methods have been proposed than traditional two-step reconstruction methods. Nicolas et al. [24] introduced a model for estimating the density field directly from image displacement fields, avoiding the intermediate integration step of density gradient reconstruction, and employed regularization techniques to address ill-posed problems. Cai et al. [25] developed a 3D radial basis function-based BOST reconstruction method without integration or additional finite differences. In addition, Masahito Akamine et al. [26] proposed a new extension, using a wall as a mirror to provide sufficient light paths to address limitations in measuring near-wall regions, where most of the light paths are blocked.

Meanwhile, CT reconstruction can be performed using deep learning. Jin et al. proposed flame chemiluminescence tomography (FCT) based on convolutional neural networks (CNNs) [27], which demonstrated rapid combustion monitoring capabilities and computational efficiency in three-dimensional FCT measurements. Lei et al. applied an extreme learning machine (ELM) to speed up and improve the quality of reconstruction for electrical capacitance tomography [28]. Yu et al. further applied the ELM to tomographic absorption spectroscopy [29], dramatically reducing the computational time compared with classical iterative methods. However, this CNN-based FCT approach lacks a clear physical interpretation and treats deep learning as a black box. In contrast, BOS-CT reconstructs the measured field using multidirectional displacements, where each projection observation from different angles refers to the same target, and relations exist between adjacent projections. This physical correlation forms the foundation for CT reconstruction.

Therefore, deep learning models should also have the ability to learn and capture this type of correlation in the data; for example, Huang et al. [30] captured the correlation of adjacent frames to achieve time-resolved prediction of 3D flame evolution based on long short-term memory (LSTM). In recent years, with significant achievements in the field of natural language processing, Recurrent Neural networks (RNNs) [31] have attracted increasing research attention and applications. Compared to traditional neural networks, RNNs are better suited for tasks involving time-series inputs because they can retain the influence of previous inputs in the model and participate in calculating subsequent outputs. Because of their unique structure, RNNs have played an important role in language modeling [32], speech recognition [33,34], machine translation [35], audio and video data analysis [36], and image caption modeling [37]. Theoretically, RNNs can utilize time-series information of any length. However, in practice, gradient vanishing or exploding phenomena may occur quickly when the steps between two inputs become too large [38]. To address gradient vanishing and explosion issues in RNNs, Chung et al. [39] proposed a gated recurrent unit (GRU). As variants of RNNs, GRUs can learn long-term dependencies and exhibit a simpler structure with fewer parameters than another RNN variant, LSTM [40], making them popular in current research. Cahuantzi et al. [41] demonstrated that GRUs outperform LSTMs on low-complexity sequences.

In the field of 3D reconstruction, RNNs have been increasingly utilized owing to the accumulated contextual information in multiview inputs. Choy et al. [42] proposed an early image reconstruction network based on 3D RNNs: 3D-R2N2. However, its 3D model resolution and accuracy were limited. Le et al. [43] introduced a multi-viewpoint recursive neural network (MV-RNN) for 3D mesh segmentation. In 2021, Sun et al. [44] presented a real-time 3D reconstruction network called NeuralRecon, which uses a GRU to guide the network in fusing features from previous segments. This allows the network to capture both local information and global shape priors when sequentially reconstructing surfaces, ultimately achieving real-time incremental reconstruction by merging the fragment feature volumes over time. Zuo et al. [45] adopted a multi-view stereo (MVS) network to avoid the inability to extract the most relevant features from an entire video before feature volume fusion and processing. It can be observed that the GRU method can directly learn spatial correlation knowledge in physics. For optical CT reconstruction, classical iterative methods generate a weight-projection matrix by performing 3D ray tracing using the projection relationship between cameras. Using the GRU model, these computational steps are not required to implement the physical projection process, without the artifacts caused by ray-tracing errors and inefficient iterative calculations caused by the large-scale projection matrix.

To achieve real-time, high-precision 3D flow field reconstruction utilizing the inherent contextual information of BOST multiangle projection sequences, this study proposes a fast BOST reconstruction method based on the GRU model. Section 2 provides a detailed introduction to the principles of the BOST. In Section 3, the design philosophy and overall structure of our model are introduced in detail, along with the architecture of the ResNet and GRU modules. In Section 4, numerical simulations are performed by creating a simulated three-dimensional flow field structure for methane combustion. The performance of the GRU model was validated by calculating the root-mean-square error (RMSE) and structure similarity index measure (SSIM) values of the 3D reconstruction results based on the established model. Finally, in Section 5, the validation results for hot airflow above a candle flame measured by a BOST measurement system consisting of 12 AVT cameras are presented. Through experimental verification, the GRU model was able to reconstruct the 3D refractive index distribution above a candle flame from real-time displacement data. Compared to the ART algorithm, this model can directly obtain reconstruction results with higher accuracy and computational efficiency without the need for weight projection matrices and ray tracing.

2. BOS theory

A schematic diagram of the BOS is shown in Fig. 1, which mainly consists of three parts: the background pattern, test flow field, and camera. When light rays from the background pattern pass through the non-uniform test flow field, they are refracted at a certain deflection angle owing to the variation in the refractive index. Consequently, the position at which the light rays pass through the lens and reach the imaging plane is offset from that of the reference light rays. The deflection angle of the light rays is the integral of the refractive index gradient along the path of the light rays and can be expressed as

(1)$${\varepsilon ^{(\alpha )}} = \frac{1}{{{n_0}}}\mathop \int \limits_{s \in ray} \frac{{\partial n}}{{\partial \alpha }}ds,\;\;\alpha \in \{{x,y, z \}} $$

Here, ${\varepsilon ^{(\alpha )}}$ represents the deflection angle along the direction of the light ray, ${n_0}$ is the refractive index of the surrounding medium of the test flow field, and $ray$ represents the path of the light ray. In general, because of the small magnitude of the deflection angle $\varepsilon $ and the fulfillment of the paraxial approximation, the integral path of the light ray can be approximated as the undisturbed path. Under this assumption, the relationship between the deflection angle $\varepsilon $ and the background pattern displacement $\varDelta \alpha $ can be described by [46]

(2)$$\Delta \alpha = \frac{{{l_A}{l_C}}}{{{l_A}\; + {l_B}}}\;\varepsilon _{}^{(\alpha )}$$

Fig. 1. Schematic diagram of the BOS theory.

Name	Description
Visualization 1	Three-dimensional refractive index distribution of consecutive frames.
Visualization 2	The display of distribution under different viewing directions.

Frame	MRE	SSIM
1	0.00238	0.9993
500	0.00243	0.9989
1500	0.00227	0.9993
2500	0.00093	0.9999
3500	0.00103	0.9998

Frame	AME_u/pixel	AME_v/pixel	Std_u/pixel	Std_v/pixel	CC₁	CC₆	CC₁₁
5	0.057	0.038	0.085	0.065	0.847	0.872	0.798
15	0.058	0.032	0.085	0.050	0.833	0.893	0.828
25	0.053	0.032	0.082	0.054	0.850	0.901	0.828
35	0.051	0.029	0.081	0.052	0.851	0.899	0.833
45	0.050	0.030	0.088	0.058	0.822	0.891	0.865

Frame	Time-consuming of GRU (s)	Time-consuming of ART (min)
5	1.067	91
15	1.041	91
25	1.011	90
35	1.043	91
45	1.055	90

Frame	Time-consuming of GRU (s)	Time-consuming of ART (min)
5	1.067	91
15	1.041	91
25	1.011	90
35	1.043	91
45	1.055	90

Abstract

1. Introduction

2. BOS theory

3. 3D BOS based on GRU

3.1 Overall architecture

3.2 Gated recurrent unit (GRU)

3.3 Image encoder (feature extraction)

4. Numerical simulation

5. 3D BOS based on GRU

6. Summary

Funding

Disclosures

Data availability

References

Supplementary Material (2)

Data availability

Cited By

Figures (16)

Tables (3)

Equations (11)

Optics Express