Performance enhancement of diffuse fluorescence tomography based on an extended Kalman filtering-long short term memory neural network correction model

Lingxiu Xing; Limin Zhang; Limin Zhang; Wenjing Sun; Zhuanxia He; Yanqi Zhang; Feng Gao; Feng Gao

doi:10.1364/BOE.514041

1. Introduction

Diffuse fluorescence tomography (DFT) with the merits of high sensitivity, non-invasiveness and non-radiation [1] demonstrates great potential for tumor diagnosis [2] drug development [3] and treatment evaluation [4]. However, due to the strong scattering of photons in biological tissue and incomplete surface measurements, DFT reconstruction is an ill-posed inverse problem [5], and the reconstructed images suffer from low spatial resolution, accuracy, as well as susceptibility to noise and model errors.

To alleviate the ill-posedness of DFT inverse problem, various strategies have been explored in an attempt to improve the quality of DFT reconstruction. The most commonly used strategy is known as regularization including Tikhonov regularization (L2 norm), Lp (0 < p ≤ 1) norm and total variation applied independently or jointly. Tikhonov regularization [6,7] is one of the most popular approaches to solve the inverse problem of DFT, in which a L2-norm constraint term is added into the data-fitting term to improve the stability, while it tends to cause artifacts and over-smoothed image boundary. As a sparsity constraint, L1 regularization [8,9] using prior knowledge of the sparse distribution of fluorescent sources is another effective strategy. However, there are some difficulties, such as over-sparseness, incomplete reconstruction of the fluorescent target, and lack of detailed information on the boundary. Apart from these, the quality of the DFT reconstruction can be improved by incorporating structural prior information obtained from XCT or MRI to the DFT reconstruction process, such as Helmholtz regularization [10], weighed segments regularization [11], and Laplace regularization [12].

Nevertheless, the above-mentioned methods usually lack a basis to adequately tackle the model mismatch due to not considering the measurement errors. In contrast, non-linear filtering scheme can handle measurement noise and also account for inaccuracies in forward models through an appropriate assignment of a process noise, and thereby provides a more rigorous framework to obtain estimation and error properties [13]. In diffuse optical tomography (DOT) field, Raveendran [14] et al applied ensemble Kalman filtering to retrieve DOT image by introducing the concept of pseudo-time, where the iterative solving process of image reconstruction was regarded as a dynamic changing process. The numerical simulation results showed that the position and shape of the targets could be reconstructed more accurately, compared with the regularized Gauss-Newton algorithm.

Baez [15] et al employed extended Kalman filtering (EKF) algorithm to estimate DOT image, and the results showed that the EKF-based method can effectively improve noise robustness compared with the NIRFAST solver (LM algorithm). In our group, Zhang [16] et al proposed a regularization-based EKF algorithm for DOT reconstruction, where Tikhonov regularization was incorporated to EKF process, resulting in superior imaging accuracy and noise robustness, especially under the circumstance of low absorption contrast and high noise level, compared with the conventional algebraic reconstruction technique (ART) and L2 regularization. Briefly, the strength of nonlinear filtering method is that it takes into account the influence of measurement noise on the filtering results during the iteration process, and thus the accuracy and noise robustness of image reconstruction can be improved to some extent. However, a primary disadvantage is that EKF tends to be time consuming due to multiple iterative updates and matrix inversions. In addition, similar to other traditional reconstruction strategies, EKF-based algorithm cannot substantially improve imaging quality, due to the inherent ill-posed nature of inverse problem.

Nowadays, neural networks have been introduced to the field of optical tomography reconstruction. Generally, the neural network-based methods can be categorized into end-to-end network and post-processing methods. The former directly establishes a nonlinear mapping relation from surface measurements to interior fluorescence distributions. For instance, Wang [17] et al proposed a stacked auto-encoder neural network to retrieve DFT image, and the numerical results demonstrated that the positions and shapes of the targets can be retrieved more accurately than the traditional ART algorithm. Li [18] et al proposed a framework of graph convolution network with ResNet-like architectures to achieve DFT image reconstruction with fewer parameters and higher speed. Zhang [19] et al proposed a 3D fusion dual-sampling deep learning network model in DFT to achieve ultra-high spatial resolution reconstruction. Though end-to-end network can enormously alleviate the ill-posedness of the inverse problem and improve image reconstruction quality, it turns the image reconstruction into a purely data-driven process, heavily relying on huge data to train the black box, and departing the physical relationship between measurement and reconstruction [20]. The alternative is post-processing method that generally first utilizes a conventional algorithm to obtain low-quality image, and then leverages neural network learning to correct the initial reconstruction image. Moreover, post-processing method can typically learn their mappings from less data, compared with purely data-driven model requiring a massive data set to learn a desirable mapping. Long [20] proposed a two-stage fluorescence tomographic reconstruction algorithm, where Tikhonov regularization was employed to obtain preliminary fluorophore distributions, sequentially the distributions of fluorophores were refined by convolutional neural networks (CNN). The results showed that the proposed method achieved more depth localization and less ambiguous reconstruction than L1 regularization. In photoacoustic tomography (PAT), Antholzer [21] et al applied filtered back projection algorithm to yield a low-quality image containing severe under-sampling artefacts, then CNN network was employed to map the intermediate reconstruction to an artefact-free final image. The results demonstrated that the proposed approach reconstructed images with a quality comparable to state-of-the-art iterative approaches for PAT from sparse data. Currently, the research on post-processing algorithms in the field of DFT is limited, and most studies are mainly in simulation stage.

As we mentioned above, EKF has been exploited for DOT image reconstruction and demonstrated its superiority over traditional deterministic methods. However, to our best knowledge, EKF has not been introduced to reconstruct DFT image. Herein, a model-derived deep-learning method, specifically, a post-processing method was proposed, where a semi-iteration EKF-based reconstruction was implemented to obtain the iterative process parameters, then LSTM neural network correct model was further employed to predict the optimal fluorescence distribution. It harnessed both the merits of EKF incorporating prior information and measurement errors to the model as well as long short term memory (LSTM) network with the good ability to mine crucial information from time series data via network learning. To verify the effectiveness of the proposed SEKF-LSTM algorithm for DFT imaging, a series of numerical simulations were conducted firstly, then the well-trained network model was applied to phantom and in vivo experiments, and the experimental results were quantitatively evaluated and compared with the EKF and semi-iteration EKF algorithms.

The rest of this work is organized as follows. In Section 2, the mathematical framework of DFT, EKF, and LSTM correction model are presented, and four metrics are introduced to quantitively assess the image quality. The methods and results of the simulation phantom and in vivo experiments are presented in Section 3 and Section 4, respectively. Finally, Section 5gives the conclusions.

2. Method

2.1 DFT technique

2.1.1 Forward problem

For continuous-wave DFT imaging, the light transport in biological tissue can be commonly described by using a set of the coupled diffusion equations as follows [22]:

(1)$$\left\{ {\begin{array}{{l}} {[{\nabla \cdot{\kappa_x}({\boldsymbol r})\nabla - {\mu_{ax}}({\boldsymbol r})c} ]{\Phi _x}({\boldsymbol r},{{\boldsymbol r}_s}) ={-} \delta ({\boldsymbol r} - {{\boldsymbol r}_s})}\\ {[{\nabla \cdot{\kappa_m}({\boldsymbol r})\nabla - {\mu_{am}}({\boldsymbol r})c} ]{\Phi _m}({\boldsymbol r},{{\boldsymbol r}_s}) ={-} c{\Phi _x}({\boldsymbol r},{{\boldsymbol r}_s})\eta {\mu_{af}}({\boldsymbol r})} \end{array}} \right.$$

where subscripts x and m denote the excitation and emission wavelengths, respectively; ${\mu _{ax}}({\boldsymbol r})$ and ${\mu _{am}}({\boldsymbol r})$ represent the absorption coefficients of excitation and fluorescence, respectively; ${\Phi _\upsilon }({\boldsymbol r},{{\boldsymbol r}_s})(\upsilon \in [x,m])$ is the photon density; $\kappa ({\boldsymbol r}) = c/3({\mu _a}({\boldsymbol r}) + \mu _s^{\prime}({\boldsymbol r}))$ is the diffusion coefficient, where $\mu _s^{\prime}({\boldsymbol r})$ is the reduced scattering coefficient, c is the speed of light in medium; $\eta {\mu _{af}}({\boldsymbol r})$ is the fluorescence yield, where $\eta$ is the quantum efficiency of fluorescence agent; $\nabla$ is the gradient operator; $\delta$ is the Dirac function. These quantities are usually the functions of the position vector ${\boldsymbol r}$.

Usually, the above equation can be resolved by combining with Robin boundary condition:

(2)$${\Phi _\upsilon }({\boldsymbol r},{{\boldsymbol r}_s}) + 2\gamma \kappa ({\boldsymbol r})\vec{{\boldsymbol n}} \cdot \nabla {\Phi _\upsilon }({\boldsymbol r},{{\boldsymbol r}_s})|{_{{\boldsymbol r} \in \partial \Omega }} = 0$$

where $\vec{\boldsymbol n}$ is the outward unit normal vector to tissue boundary $\partial \Omega $; $\gamma = {{(1 + {R_f})} / {(1 - {R_f})}}$ with ${R_f} \approx{-} 1.4399{n_f}^{ - 2} + 0.7099{n_f}^{ - 1} + 0.6681 + 0.0636{n_f}$ is the efficient reflection coefficient, where ${n_f}$ is the refractive index of the tissue to the air(${n_f} = 1.4$). The numerical solution of the above equation is usually obtained based on finite element method (FEM).

2.1.2 Inverse problem

To mitigate the influence of heterogeneous background optical properties as well as the errors between different measurement channels, normalized Born ratio method is used to reconstruct the fluorescence yields, which is written as follows:

(3)$${I_{nb}}({{\boldsymbol r}_d},{{\boldsymbol r}_s}) = \frac{1}{{{I_x}({{\boldsymbol r}_d},{{\boldsymbol r}_s})}}\int_V {cG({{\boldsymbol r}_d},{\boldsymbol r})} {\Phi _x}({\boldsymbol r},{{\boldsymbol r}_s})\eta {\mu _{af}}({\boldsymbol r})dV$$

where ${I_{nb}}({{\boldsymbol r}_d},{{\boldsymbol r}_s})$ is the Born ratio of the emission and excitation flux measured at a detector position ${{\boldsymbol r}_d}$ with regard to the excitation source at ${{\boldsymbol r}_s}$; $G({{\boldsymbol r}_d},{\boldsymbol r})$ is the density at ${{\boldsymbol r}_d}$ for a source at ${\boldsymbol r}$; ${I_x}({{\boldsymbol r}_d},{{\boldsymbol r}_s})$ is the calculated excitation flux at ${{\boldsymbol r}_d}$ for a source at ${{\boldsymbol r}_s}$; V is the imaging domain.

Based on the FEM [23], Eq. (3) can be discretized into a matrix equation:

(4)$${I_{nb}}({\boldsymbol r}) = W({\boldsymbol r})X({\boldsymbol r})$$

where $W({\boldsymbol r})$ is a M × N weight matrix with M and N representing the numbers of measurement data and finite element nodes, respectively; X represents the fluorescence yield $\eta {\mu _{af}}({\boldsymbol r})$ to be reconstructed.

2.2 Extended Kalman filtering

2.2.1 State space model

The state space model of extended Kalman filtering consists of state and observation equations. In this work, the fluorescence yield is used as the state variable $X[k]$ to establish the state equation, and the ${I_{nb}}({\boldsymbol r})$ obtained from the measurement is applied to establish the observation equation, described by Eqs. (5) and (6), respectively.

(5)$$X[k ]= X[{k\textrm{ } - 1} ]+ S[k ]$$

(6)$${I_{nb}}[k ]= WX[k ]+ D[k ]$$

Where $S\sim N(0,Q)$ and $D\sim N(0,R)$ denote the process noise and measurement noise, respectively, and both are zero-mean Gaussian white noise; Q and R represent the corresponding noise covariance, respectively. Herein, $W$ is referred to as measurement matrix that is consistent with the weight matrix in Eq. (4).

2.2.2 DFT reconstruction based on an EKF model

The EKF solution to the above equation is given by a prediction-update procedure. In the prediction stage, the fluorescence yield ${\hat{X}^ - }[k|k - 1]$ and error covariance matrix ${\hat{P}^ - }[k|k - 1]$ at the kth step are predicted based on the one step ahead predictions $\hat{X}[k - 1]$ and $\hat{P}[k - 1]$, respectively. The estimation process can be formulated as follows:

(7)$${\hat{X}^ - }[k|k - 1] = \hat{X}[k - 1]$$

(8)$${\hat{P}^ - }[k|k - 1] = E\hat{P}[k - 1]{E^T} + Q$$

where E is an identity matrix.

In the update stage, the Kalman gain $G[k]$ at the kth step is calculated based on ${\hat{P}^ - }[k|k - 1]$ and measurement matrix W, as shown in Eq. (9). The fluorescence yield $\hat{X}[k]$ for step k is updated using the predicted ${\hat{X}^ - }[k|k - 1]$, $G[k]$, and the difference $\alpha [k]$ between the observed value and the predicted value, as shown in Eq. (11). The error covariance matrix $\hat{P}[k]$ for step k is updated using the predicted ${\hat{P}^ - }[k|k - 1]$, $G[k]$, and W, as shown in Eq. (12).

(9)$$G[k] = {\hat{P}^ - }[k|k - 1]{W^T}[k]{({W[k]{{\hat{P}}^ - }[k|k - 1]{W^T}[k] + R} )^{ - 1}}$$

(10)$$\alpha [k] = ({{I_{nb}}[k] - W[k] \cdot {{\hat{X}}^ - }[k|k - 1]} )$$

(11)$$\hat{X}[k] = {\hat{X}^ - }[k|k - 1] + G[k] \cdot \alpha [k]$$

(12)$$\hat{P}[k] = (I - G[k] \cdot W[k]) \cdot {\hat{P}^ - }[k|k - 1]$$

The above equations demonstrate that the fluorescence yield predicted at the next step is associated with the updated fluorescence yield ${\hat{X}^ - }[k|k - 1]$, the Kalman gain $G[k]$ and $\alpha [k]$ at the current step. Therefore, the three factors are used as the inputs to the LSTM correction model, and the output is the actual fluorescence yield $Y[k]$, as illustrated in Fig. 1. Accordingly, a mapping between the input and output can be expressed as:

(13)$$Y[k] = g({\hat{X}^ - }[k|k - 1],G[k],\alpha [k])$$

Fig. 1. The schematic diagram of the SEKF-LSTM method.

Abstract

1. Introduction

2. Method

2.1 DFT technique

2.1.1 Forward problem

2.1.2 Inverse problem

2.2 Extended Kalman filtering

2.2.1 State space model

2.2.2 DFT reconstruction based on an EKF model

2.3 LSTM correction model

2.4 Evaluation metrics

3. Experimental methods

3.1 Simulation experiments

3.1.1 Training dataset settings

3.1.2 Simulation scenarios settings

3.2 Phantom experiments

3.3 In vivo experiments

4. Results

4.1 Simulation results

4.1.1 Different target sizes

4.1.2 Different target fluorescence yields

4.2 Phantom experiment results

4.3 In vivo experiment results

5. Discussion and conclusion

Funding

Disclosures

Data availability

References

Data availability

Cited By

Figures (15)

Equations (24)

Biomedical Optics Express