RETRACTED: Unified joint reconstruction approach for random illumination microscopy

Penghuan Liu; Penghuan Liu

doi:10.1364/BOE.399547

1. Introduction

The resolution of conventional optical microscopy is limited by the diffraction effect of waves. In the past twenty years, numerous super-resolution techniques have been proposed in fluorescence microscopy to surpass this limitation, such as stochastic optical reconstruction microscopy (STORM) [1] and stimulated emission depletion fluorescence scanning microscopy (STED) [2], enabling a resolution of approximately 10 to 100 nm.

In this paper, I focus on the technique of random illumination microscopy (following [3], we abbreviate it as RIM). RIM can be seen as a blind version of structured illumination microscopy (SIM), which achieves super-resolution by illuminating an object $\rho$ with a few structured patterns $I_m$. In fact, when RIM was firstly proposed, it is called blind-SIM [4]. In the linear regime, the measured dataset $\{ y_m\}_{m=1}^{M}$ has a relation with the sample $\rho$ of [5]:

(1)$$y_m = h\ast (\rho I_m) + \epsilon_m$$

where $h$ is the point spread function (PSF) of the system and $\ast$ denotes the convolution operator. The product of the sample $\rho$ with structured patterns $I_m$ transfers otherwise unobservable high-frequency information of the sample into a region of lower frequency; therefore the high-frequency component can pass through the optical system [6]. As a wide-field imaging technique, standard SIM and RIM acquisitions are much faster than STORM and STED in a relatively large field of view scenario, and super-resolution imaging of living samples has been demonstrated for SIM and RIM [3,7].

The discretized form of (1), where each 2D quantity is displayed by a column vector, is:

(2)$$\mathbf{y}_m = \mathbf{H} (\boldsymbol{\rho} \circ \mathbf{I}_m) + \boldsymbol{\epsilon}_m$$

in which $\mathbf {y}_m \in \mathbb {R}^{L}$ is the recorded raw image, $\mathbf {H} \in \mathbb {R}^{L\times N}$ is the discrete convolution matrix built from the discretized PSF and $\circ$ denotes the element-wise product. $\boldsymbol {\rho } \in \mathbb {R}^{N}$ denotes the discretized fluorescence density, $\mathbf {I}_m \in \mathbb {R}^{N}$ is the $m$-th illumination with homogeneous intensity mean $I_0$, and $\boldsymbol {\epsilon }_m \in \mathbb {R}^{L}$ is the noise in the imaging process.

In standard SIM, the object is illuminated by a set of harmonic patterns with designed spatial frequencies and phases. However, due to the dependence on perfect knowledge of the illumination, the blurring in illumination induced within the investigated volume by the sample will reduce the SR capacity of SIM and introduce strong artefacts [8,9]. One way to address this problem is to use uncontrolled speckle patterns as a substitute for the harmonic illumination in SIM. Compared with harmonic illumination, the speckle patterns are easier to generate, while the super-resolution is still attainable [4]. The spatial super-resolution or image quality of RIM may be not as good as that obtained by standard SIM with the same photon budget, whereas the study of this strategy is well worth the effort considering that the control of the illumination may be difficult or even impossible in certain situations, like in photoacoustic imaging or in fluorescence microscopy for some specific fluorochromes. Though more raw images are required in RIM in comparison to standard SIM, its temporal resolution is as good as SIM by taking advantage of an interleaved reconstruction strategy (i.e. the stacks of raw images for forming the neighboring super-resolved frames are overlapped). For two-color imaging and 3D imaging, the temporal resolution of RIM is better than SIM due to the simplicity of the experimental protocol [3].

We estimate the fluorescence image and the speckle patterns simultaneously in the joint reconstruction approach of RIM – so named because the quantity of interest (the fluorescence distribution) is obtained jointly with nuisance parameters (the unknown speckle patterns). By introducing an auxiliary variable $\mathbf {q}_m = \boldsymbol {\rho } \circ \mathbf {I}_m$, the image formation model (2) can be written in matrix form as:

(3)$$\mathbf{Y} = \mathbf{H} \mathbf{Q} + \boldsymbol{\epsilon}$$

with $\mathbf {Y}=[\mathbf {y}_1,\ldots ,\mathbf {y}_M] \in \mathbb {R}^{L\times M}$ and $\mathbf {Q}=[\mathbf {q}_1,\ldots ,\mathbf {q}_M] \in \mathbb {R}^{N\times M}$. Now our task is to estimate matrix $\mathbf {Q}$ from the measurement matrix $\mathbf {Y}$. Once $\mathbf {Q}$ is obtained, $\boldsymbol {\rho }$ can be retrieved either by the mean of $\mathbf {q}_m$:

(4)$$\boldsymbol{\rho} = \lim_{M\rightarrow \infty} \frac{1}{I_0} \bar{\mathbf{q}}$$

or by the standard deviation:

(5)$$\boldsymbol{\rho} \propto \lim_{M\rightarrow \infty} \sqrt{\frac{1}{M}\sum_m\big(\mathbf{q}_m-\bar{\mathbf{q}}\big)^{2}}$$

where $\bar {\mathbf {q}} = \frac {1}{M} \sum _{m=1}^{M} \mathbf {q}_m$ and $(\cdot )^{2}$ means the element-wise square. The relation (5) holds because speckle patterns are second-order stationary.

Several reconstruction methods have been proposed in RIM, as shown in [4,10–13]. In reference [13], a marginal approach is proposed where the estimator depends on the statistics of the speckle patterns, but not on their specific values. The super-resolution capacity of RIM has been demonstrated to be as good as that of classic SIM by taking advantage of the second-order statistics of the data in the asymptotic condition in a standard epi-illumination geometry. However, the computational complexity of the methods presented in [13] is $\mathcal {O}(N^{3})$, which is exceedingly high for realistic size images. On the other hand, the methods shown in [4,10,11] share a similar framework, i.e. they try to reconstruct the object by minimizing a data fidelity term plus a regularizer:

(6)$$\arg \min_\mathbf{Q} \;\; \frac{1}{2}\lVert \mathbf{Y}-\mathbf{H}\mathbf{Q} \rVert_F^{2} + \mu \Phi(\mathbf{Q})$$

where $\lVert \cdot \rVert _F$ denotes the Frobenius norm and $\Phi (\mathbf {Q})$ is the regularizer term that enforces a priori knowledge of $\mathbf {Q}$, such as the positivity constraint [4], positivity and sparsity constraint [10], or joint sparsity constraint [11]. A major difficulty to solve (6) is that the regularization functions are generally not differentiable. The state-of-the-art algorithms for solving the optimization problem of Eq. (6) belong to the iterative shrinkage/thresholding (IST) family [14], or their variants, such as FISTA [15] and TwIST [16].

In joint reconstruction approach of RIM, the super-resolution is induced by the regularizer term, while the data fidelity term yields no super-resolution information if only the first-order statistics of speckle are used [10]. Although various regularization terms have been proposed in previous articles, there are still unclear points in the existing methods:

• It is not evident why the joint sparsity regularizer can help retrieve super-resolution, although it is widely used [11,17].
• The hyperparameter that corresponds to the regularizer in model (6) is not easy to tune.
• The two estimators (4) and (5) are chosen rather capriciously, and the difference between their performances remains to be better explored.

This paper shows that the joint sparsity of matrix $\mathbf {Q}$ is related to the sparsity of object $\boldsymbol {\rho }$ itself given that the speckle patterns are second-order stationary and thus is a reasonable prior. To solve the hyperparameter tuning problem of form (6), we firstly transform it to a constrained form and then solve the constraint minimization problem with the help of an indicator function and a primal-dual splitting optimization algorithm. Simulation results show that the hyperparameter in the new model is more robust than $\mu$ in form (6). Finally, we demonstrate that the estimator (5) can help remove the background signal which is inevitable in real experiments.

2. Problem formulation

To tackle the hyperparameter tuning problem in [4,10,11], the unconstrained minimization model (6) is transformed to a constrained one:

(7)$$\arg \min_\mathbb{Q} \;\Phi(\mathbf{Q}) \qquad \textrm{s.t.} \quad \lVert\mathbf{Y}-\mathbf{H}\mathbf{Q} \rVert_F \leq \xi$$

Although models (7) and (6) are equivalent in the sense that for any $\xi \geq 0$ the solution of (7) is either the null vector or it is a minimizer of (6), for some $\mu >0$ (see [18]), $\xi$ in (7) is much easier to set because it has a clear meaning in that it is proportional to the standard variance of the noise, while tuning the hyperparameter $\mu$ in (6) is a nontrivial task (please note here that the hyperparameter $\mu$ does not influence the performance of the positivity constraint presented in [4]). The approach of solving the constrained problem (7) is to first transform it to an unconstrained problem with the help of an indicator function and then solve the unconstrained optimization problem by the primal-dual splitting method [19].

To introduce the prior information, the $q$-th power of the $\ell _{p,q}$ norm of $\mathbf {Q}$ is chosen, which is defined as:

(8)$$\arg \min_{\mathbf{Q}} \quad \lVert \mathbf{Q} \rVert_{pq}^{q} \qquad \textrm{subject to}\quad \big\lVert \mathbf{H}\mathbf{Q} - \mathbf{Y} \big\rVert_F \leq \xi$$

where $\lVert \mathbf {Q} \rVert _{pq}$ denotes the $\ell _{p,q}$ norm of matrix $\mathbf {Q}$:

(9)$$\lVert\mathbf{Q} \rVert_{pq} = \Big( \sum_{n=1}^{N} \lVert \mathbf{q}^{n} \rVert_p^{q} \Big)^{1/q}, \quad p \geq 1,\; 0 < q \leq 1$$

with $\mathbf {q}^{n}$ the $n$-th row of $\mathbf {Q}$.

2.1 Relationship between joint sparsity and prior information of object $\rho$

In this section, the relation between the joint sparsity of $\mathbf {Q}$ and the prior of object $\rho$ is analyzed. For the $n$-th row of matrix $\mathbf {Q}$, its $\ell _p$ norm is given by:

(10)$$\lVert\mathbf{q}^{n}\rVert_p = \Big[\lvert \rho_nI_{1n} \rvert^{p} + \cdots +\lvert \rho_n I_{Mn} \rvert^{p} \Big]^{1/p}$$

when $p=1$:

(11)$$\lim_{M\to\infty} \lVert \mathbf{q}^{n} \rVert_p = \rho_n (\lvert I_{1n}\rvert + \cdots + \lvert I_{Mn}\rvert) = MI_0\rho_n$$

when $p=2$:

(12)$$\lim_{M\to\infty} \lVert \mathbf{q}^{n} \rVert_p = \rho_n \big(I_{1n}^{2} + \cdots + I_{Mn}^{2}\big)^{1/2} = \sqrt{ M \kappa}\rho_n$$

where $\kappa$ is a constant number for the fully developed speckle patterns [20, Chapter 7]. Therefore, the sparsity of $\lVert \mathbf {q}^{n} \rVert _p$ is equivalent to the sparsity of $\rho _n$ when $p=1$ or $2$ given that the speckle patterns are positive with homogeneous mean and second-order stationary.

2.2 Equivalently unconstrained form

To simplify the notation, let us define:

(13)$$\mathcal{H} = \mathbb {1}_M \otimes \mathbf{H} ; \quad \boldsymbol{\mathfrak{y}} = \left[\mathbf{y}_1^{T}, \mathbf{y}_2^{T}, \ldots, \mathbf{y}_M^{T} \right]^{T}; \quad \boldsymbol{\mathfrak{q}} = \left[\mathbf{q}_1^{T}, \mathbf{q}_2^{T}, \ldots, \mathbf{q}_M^{T} \right]^{T}$$

where $\mathbb {1}_M \in \mathbb {R}^{M\times M}$ is the identity matrix and $\otimes$ denotes the Kronecker product. Let us partition $\boldsymbol{\mathfrak{q}}$ into $N$ groups $\{ \boldsymbol{\mathfrak{q}}_{\mathcal {G}_1},\ldots ,\boldsymbol{\mathfrak{q}}_{\mathcal {G}_N} \}$ with $\boldsymbol{\mathfrak{q}}_{\mathcal {G}_n} \in \mathbb {R}^{M}$ corresponding to the $n$-th row $\mathbf {q}^{n}$ in matrix $\mathbf {Q}$. Then, the $\ell _{p,q}$ norm of matrix $\mathbf {Q}$ is equivalent to the group $\ell _{\mathcal {G},p,q}$ norm of vector $\boldsymbol{\mathfrak{q}}$:

(14)$$\lVert \boldsymbol{\mathfrak{q}} \rVert_{\mathcal{G} pq} = \Big( \sum_{n=1}^{N} \lVert \boldsymbol{\mathfrak{q}}_{\mathcal{G}_n} \rVert_p^{q} \Big)^{1/q}$$

Now, the problem (8) can be expressed in vector form:

(15)$$\arg \min_{\boldsymbol{\mathfrak{q}}} \; \lVert \boldsymbol{\mathfrak{q}} \rVert_{\mathcal{G} pq}^{q} + \quad \textrm{s.t.}\;\; \lVert \mathcal{H}\boldsymbol{\mathfrak{q}} - \boldsymbol{\mathfrak{y}} \rVert \leq \xi$$

Inspired by the so-called C-SALSA algorithm [21], I first transform problem (15) into an unconstrained optimization problem by introducing an indicator function. The feasible set $\mathbf {E}(\xi ,\mathcal {H},\boldsymbol{\mathfrak{y}})$ is defined as:

(16)$$\mathbf{E}(\xi,\mathcal{H},\boldsymbol{\mathfrak{y}}) =\big\{ \boldsymbol{\mathfrak{q}} \in \mathbb{R}^{MN} \mid \lVert \mathcal{H} \boldsymbol{\mathfrak{q}} - \boldsymbol{\mathfrak{y}} \rVert_2 \leq \xi \big\}$$

which is possibly infinite in some directions. Then, problem (15) can be written as an unconstrained problem:

(17)$$\arg \min_{\boldsymbol{\mathfrak{q}}} \; \lVert \boldsymbol{\mathfrak{q}} \rVert_{\mathcal{G} pq}^{q} + \iota_{\mathbf{E}(\xi,\mathbb{1},\boldsymbol{\mathfrak{y}})}(\mathcal{H}\boldsymbol{\mathfrak{q}})$$

where $\iota _{\mathcal {S}} : \mathbb {R}^{MN} \rightarrow \{\mathbb {R} \cup \infty \}$ is the indicator function of set $\mathcal {S} \subset \mathbb {R}^{MN}$ :

(18)$$\iota_\mathcal{S}(\mathbf{s}) = \begin{cases} 0, & \mathbf{s} \in \mathcal{S} \\ \infty, & \mathbf{s} \not\in \mathcal{S} \end{cases}$$

2.3 Primal-dual splitting method

The optimization problem of (17) can be tackled using proximal-splitting algorithms, such as the alternating direction method of multipliers (ADMM) [21] or the primal-dual method proposed in [19]. In this paper I choose the latter. To solve (17) with the primal-dual algorithm, two auxiliary variables $\boldsymbol{\mathfrak{d}}$ and $\boldsymbol{\mathfrak{r}}$ are introduced, with $\boldsymbol{\mathfrak{d}} = \boldsymbol{\mathfrak{q}}$ and $\boldsymbol{\mathfrak{r}}=\mathcal {H} \boldsymbol{\mathfrak{q}}$. Then, (17) can be rewritten as:

(19)$$\begin{aligned} \arg \min_{\boldsymbol{\mathfrak{q}}} \; f(\boldsymbol{\mathfrak{q}}) + g_1(\boldsymbol{\mathfrak{d}}) + g_2( \boldsymbol{\mathfrak{r}})\\ \textrm{with} \qquad f(\boldsymbol{\mathfrak{q}}) = 0, \quad g_1(\boldsymbol{\mathfrak{d}}) = \lVert \boldsymbol{\mathfrak{d}} \rVert_{\mathcal{G} pq}^{q}, \quad g_2( \boldsymbol{\mathfrak{r}}) = \iota_{\mathbf{E}(\epsilon,\mathbb {1},\boldsymbol{\mathfrak{y}})}(\boldsymbol{\mathfrak{r}}) \end{aligned}$$

Equation (19) is a particular case considered in [19], and the associated primal-dual algorithm is presented as follows:

$\textrm {Prox}_f(\mathbf {x})$ in the iterations denotes the proximal operator of the function $f$, whose definition is given by [14]:

(20)$$\textrm{prox}_{\lambda f}(\mathbf{x}) = \arg \min_{\mathbf{y}} f(\mathbf{y}) + \frac{1}{2\lambda}\lVert \mathbf{x} - \mathbf{y}\rVert^{2}$$

The proximal operator for $f^{*}$, the conjugate function of $f$, can be obtained from the relation:

(21)$$\textrm{prox}_{\lambda f^{*}}(\mathbf{x}) = \mathbf{x} - \lambda \textrm{prox}_{f/ \lambda }(\mathbf{x})$$

For the indicator function $g_2$, its proximal operator can be written analytically [21] as:

(22)$$\textrm{prox}_{\iota_{\mathbf{E}(\xi,\mathbb {1},\boldsymbol{\mathfrak{y}})}/\beta}(\mathbf{x}) = \boldsymbol{\mathfrak{y}} + \begin{cases} \epsilon \frac{ \mathbf{x}-\boldsymbol{\mathfrak{y}} }{ \lVert \mathbf{x}-\boldsymbol{\mathfrak{y}} \rVert_2 }, & \lVert \mathbf{x}-\boldsymbol{\mathfrak{y}} \rVert \geq \xi \\ \mathbf{x}-\boldsymbol{\mathfrak{y}}, & \lVert \mathbf{x}-\boldsymbol{\mathfrak{y}} \rVert \leq \xi \end{cases}$$

The proximal operator for $\ell _{\mathcal {G},p,q}$ norm of different $(p,q)$ pairs is shown in Appendix A.

Following [19, Theorem 5.2], the convergence of the primal-dual iteration is granted if $q=1$ and the parameters $(\tau ,\sigma ,\theta )$ in algorithm 1 satisfy:

(23)$$\tau \sigma \big\lVert \mathbb {1}_{MN} + \mathcal{H}^{*}\mathcal{H} \big\rVert_{op} \leq 1 , \qquad \theta \in ] 0,2 [$$

where $\mathcal {H}^{*}$ denotes the conjugate transpose of matrix $\mathcal {H}$ and $\lVert \cdot \rVert _{op}$ is the operator norm of the corresponding matrix. From the definition of the operator norm, we have:

(24)$$\big\lVert \mathbb {1}_{MN} + \mathcal{H}^{*} \mathcal{H} \big\rVert_{op} \leq \lVert \mathbb {1}_{MN}\rVert_{op} + \lVert \mathcal{H}^{*} \mathcal{H} \rVert_{op}$$

with:

(25)$$\lVert \mathcal{H}^{*} \mathcal{H} \rVert_{op} = \lVert \mathcal{H} \rVert_{op}^{2} = \lVert \mathbb {1}_M \rVert_{op}^{2} \lVert \mathbf{H} \rVert_{op}^{2}$$

Since $\mathbf {H}$ is a low-pass convolution operator with symmetric boundary conditions, we have $\lVert \mathbf {H} \rVert _{op} = 1$, so:

(26)$$\big\lVert \mathbb {1}_{MN} +\mathcal{H}^{*} \mathcal{H} \big\rVert_{op} \leq 2$$

Consequently, the primal-dual algorithm will converge as long as $\tau \sigma \leq \frac {1}{2}$ and $\theta \in ]0,2[\,$ in the case that $q=1$. When $0\leq q<1$, the algorithm no longer yields the global minimum since the $\ell _q$ norm is not a convex function.

The computational burden of the primal-dual algorithm mainly lie in $\mathcal {H}\boldsymbol{\mathfrak{r}}$ and $\mathcal {H}^{*}\boldsymbol{\mathfrak{r}}$ for $\boldsymbol{\mathfrak{r}} \in \mathbb {R}^{MN}$. Taking advantage of the fast Fourier transform (FFT) algorithm, the computational complexity of the primal-dual algorithm in each iteration is $\mathcal {O}(MN\log N)$.

3. Simulation results and experiments

To study the numerical performance of the proposed $\ell _{p,q}$ norm model, a 2D ‘star-like’ simulated target whose fluorescence density in the polar coordinates given by $\rho (r,\theta ) \propto [1+\cos (40\theta )]$ is used as the true object. The top-left quarter of the object is shown in Fig. 1. One advantage of this object is that its spatial frequencies increase when approaching the star center, making it easy to visualize the resolution improvement. The point spread function is chosen as:

(27)$$h(r,\theta) = \Big(\frac{J_1(NAk_0r)}{k_0r}\Big)^{2}\frac{k_0^{2}}{\pi}$$

where $J_1$ is the first-order Bessel function of the first kind, NA is the objective numerical aperture set to 1.49 and $k_0=\frac {2\pi }{\lambda }$ is the free-space wavenumber with $\lambda$ the emission and excitation wavelengths. The radius $r$ from the center of the object that conventional wide-field microscopy can reach could be easily deduced from the relation:

(28)$$\frac{2\pi r}{40} = \frac{\lambda}{2NA}$$

Fig. 1. Reconstructed object with 300 speckle patterns and 40 dB Gaussian noise by previously reported methods. The green solid lines (resp. red dashed lines) correspond to spatial frequencies transmitted by the OTF support (resp. 2 times OTF support), and the distance units in the images represents the wavelength $\lambda$.

Download Full Size | PDF

The sampling step in the object should be finer than $\lambda / 8NA$ to observe an SR factor of two. In the simulations, a sampling step of $\lambda / 20$ is adopted so that aliasing does not destroy the attainable SR. For the sampling rate in the raw images, no information is lost as long as it is higher than the Nyquist rate $4NA/\lambda$. In the simulations performed in this section, I set the sampling rate for the raw images to be the same as that of the object.

The speckle patterns are generated through the same optical device as used for the collection of raw images, unless otherwise stated. Under this condition, the frequency support of the speckle has the same shape as the OTF of the system for the unapodized pupil [20, Section 7.7]. The boundary conditions of the object $\boldsymbol {\rho }$ are assumed to be periodic; thus, the convolution matrix $\mathbf {H}$ will have a block-circulant with circulant-block (BCCB) structure [22], and the matrix vector product $\mathbf {H}\mathbf {v}$ can be obtained with the fast Fourier transform (FFT) algorithm .

First, numerical simulations are performed with 300 speckle patterns. The low-resolution raw images are corrupted with Gaussian white noise, with a corresponding SNR of 40 dB. In the primal-dual algorithm, I set $\theta = \sigma =1$ and $\tau = 0.35$, with $\boldsymbol{\mathfrak{q}}_0, \boldsymbol{\mathfrak{d}}_0,\boldsymbol{\mathfrak{r}}_0$ initialized with zeros. $\xi$ is set to its true value $\xi _{\textrm {real}} = \sqrt {MN}\nu$ , where $\nu$ is the standard variance of noise, unless otherwise stated.

The Wiener deconvolution of the mean of raw images $\bar {\mathbf {y}} = \frac {1}{M} \sum _m \mathbf {y}_m$ is shown in Fig. 1(b). As expected, we see no super-resolution (patterns inside the green solid line) in the Wiener deconvolution of the wide-field image. The reconstructed image obtained by the methods presented in [4] that use only the positivity constraint is shown in Fig. 1(c). It retrieves partial super-resolution information; however, the modulation contrast in the super-resolution part is relatively low, coinciding with the results reported in [4]. Figures 1(d) and 1(e) are obtained using the $\ell _{2,0}$ norm regularizer with the M-SBL algorithm as in [11] and the $\ell _1 + \ell _2$ norm plus positivity regularizer with the PPDS algorithm presented in [10], respectively. The image reconstructed by the M-SBL algorithm does not scale well, and we see strong bias in the low-resolution part. I stop the M-SBL iterations after a fixed number of iterations (i.e. , 20) as indicated in reference [11]. The computational complexity of the M-SBL algorithm is in fact $\mathcal {O}(N^{3})$, as high as that of the marginal approach [13] plotted in Fig. 1(f). The reconstruction by marginal approach agree with its theoretical predications, however, the computational burden makes it unrealistic for real-sized images. Possible solutions to reduce the computational burden in marginal approach are beyond the scope of this paper.

3.1 Reconstruction with different $\{p,q\}$ pairs

Figures 2 show the reconstruction results obtained by minimizing the constrained $\ell _{p,q}$ model with different $(p,q)$ pairs. To measure the quality of reconstructed images, the normalized radially averaged power spectrum (RAPS) of the error images is plotted, which is defined as:

(29)$$\begin{aligned} f(r) = \frac{\int_{-\pi}^{\pi} \big\lvert\widetilde{\hat{\rho}}(\mathbf{u})-\widetilde{\rho^{*}}(\mathbf{u}) \big\rvert^{2} d\theta}{\int_{-\pi}^{\pi} \lvert\widetilde{\rho^{*}}(\mathbf{u}) \rvert^{2} d\theta}\\ \qquad \textrm{with} \quad \mathbf{u} =\left[ \begin{array}{c} r\cos \theta\\ r\sin\theta \end{array}\right],\; r>0,\;\theta \in (0,2\pi) \end{aligned}$$

where $\hat {\rho }$ and $\rho ^{*}$ denote the estimated and true object, respectively. The normalized RAPS of errors obtained by different regularizers are displayed in Fig. 3. The $\ell _{2,1}$ regularizer has the lowest error power in almost the entire spectrum. The strong error power in the high-frequency part with the $\ell _{2,1/2}$ and $\ell _{2,2/3}$ regularizers is likely caused by the binary effect in the reconstructed images.

Fig. 2. Reconstructed object with 300 speckle patterns and 40 dB Gaussian noise by minimizing the $\ell_{p,q}$ norm of the unconstrained form. Estimator (5) is marked by ‘std’, and (4) by ‘mean’ respectively.

Download Full Size | PDF

Fig. 3. Normalized RAPS of the reconstructed objects obtained by averaging $\mathbf{q}_m$ in RIM with different regularizers.

Download Full Size | PDF

As for the two estimators (4) and (5), on one hand, we see no difference in terms of super-resolution when $\ell _{2,1}$ prior is selected (see Fig. 2(b) and 2(e)). On the other hand, the standard deviation of the Wiener deconvolutions yields partial super-resolution, as shown in Fig. 2(f), though their mean yields no super-resolution [10]. We will talk more about the differences between the two estimators in section 3.2.

3.2 Two estimators and background removal

The real images recorded by microscopy are always blurred by an out-of-focus background. A more accurate model than (2) to describe the imaging process is:

(30)$$\mathbf{y}_m = \mathbf{H}\mathbf{q}_m + \boldsymbol{\epsilon}_m + \mathbf{b}$$

with $\mathbf {b} \in \mathbb {R}^{L}$ denoting the background noise. Consequently, the reconstructed $m$-th column of $\mathbf {Q}$ is in fact $\hat {\mathbf {q}}_m =\mathbf {q}_m^{\perp } + \mathbf {H}^{+}(\boldsymbol {\epsilon }_m + \mathbf {b})$, with $\mathbf {q}_m^{\perp }$ indicating the estimate of $\mathbf {q}_m$ without the present of noise and $\mathbf {H}^{+}$ the pseudo-inverse of $\mathbf {H}$. If we continue estimating $\boldsymbol {\rho }$ by averaging $\hat {\mathbf {q}}_m$, then the estimated object will be blurred by $\mathbf {H}^{+}\mathbf {b}$:

(31)$$\hat{\boldsymbol{\rho}} = \frac{1}{MI_0} \sum_m \hat{\mathbf{q}}_m = \boldsymbol{\rho}^{\perp} + \frac{1}{I_0}\mathbf{H}^{+}\mathbf{b}$$

The nonmodulated background signal will dramatically degrade the image quality in conventional SIM [23]. Instead of modelling the background with a smooth function [24] or carrying out background subtraction heuristically [25], a natural strategy that is much easier in the RIM setup is to estimate the object by Eq. (5), leveraging the inherent second-order stationary random process of speckle patterns. Unlike the ensemble mean, the empirical variance of $\hat {\mathbf {q}}_m$ is not blurred with the background, so (5) holds even when a strong background signal is presented.

To verify this point, simulation results using 300 speckle patterns and 40 dB Gaussian noise with a fixed background (Cameraman) are shown in Fig. 4. The images in Fig. 4(b) and 4(c) are obtained by minimizing the constrained $\ell _{21}$ regularizer. As expected, both the Wiener deconvolution of the wide-field image and the mean of $\hat {\mathbf {q}}_m$ are blurred by the background, while the standard deviations of $\hat {\mathbf {q}}_m$ (Fig. 4(c)) are rather clear.

Fig. 4. Reconstructed object with 300 speckle patterns and 40 dB Gaussian noise with a fixed (lena) background.

Download Full Size | PDF

3.3 Influence of the hyperparameter

The hyperparameter $\xi$ denoting the variance of the additive noise was assumed to be known in previous simulations. In this section, its influence on the estimator is explored by considering the cases when it is not correctly set. The reconstruction results with 300 speckle patterns and 40 dB white noise using different $\xi$ values are shown in the first line of Fig. 5. When $\xi$ is equal to 5 times its true value, we partially lose the super-resolution. However, if $\xi$ is chosen to be much lower than its true value, as shown in Fig. 5(b) and 5(c), then no evident visual differences are observed in comparison with the situation when it is correctly set (Fig. 2(b)). In contrast, the reconstructed image quality obtained by solving the optimization problem of the form (6) using the FISTA algorithm [26,27] is more sensitive to the hyperparameter $\mu$. When $\mu$ is set relatively high, we observe strong artifacts in the low-frequency image component.

Fig. 5. Reconstructed images using the $\ell_{2,1}$ regularizer with estimator (5) and different hyperparameters. All simulations are performed using the same measured dataset $\mathbf{Y}$ generated by 300 speckle patterns and 40 dB Gaussian noise. The images presented in the first line are obtained by solving (7), while the images in the second line are corresponding to (6).

Download Full Size | PDF

3.4 Necessary number of speckle patterns

The $\ell _{p,q}$ norm prior is based on the statistical properties of speckle patterns. It is reasonable only when $M$ is relatively large (Eqs. (11) and (12)). In real experiments, more illumination patterns mean a longer data acquisition time and thus a lower temporal resolution. How many speckle patterns are required to achieve a reasonable performance? To answer this question, we perform simulations under 40 dB Gaussian noise and various illumination numbers with $\ell _{2,1}$ prior as shown in Fig. 6. It is clear that as the illumination number increases, the quality of the reconstruction improves. When the number of illuminations is low, strong artifacts are observed.

Figure 6(d) shows the mean square error (MSE) of $5$ simulations for certain number of speckle patterns as the cloud of points, with the MSE of the reconstructed image given by: $\textrm {MSE} =\frac {1}{N} \lVert \boldsymbol {\rho }-\hat {\boldsymbol {\rho }}\rVert _2^{2}$. The solid line indicates the average of MSE for each illumination number $M$. We see that both the MSE values and their variability decrease as the number of frames increases. It provides practional guidance for choosing the number of frames to satisfy a target MSE value. Based on the simulation results, I suggest that the speckle pattern number should be no less than $80$ in real experiments with the $\ell _{2,1}$ prior. Furthermore, Fig. 6(d) indicates that the MSE of estimator (4) is generally smaller than estimator (5). This is not surprising considering that the sample mean converges faster than the sample standard deviation (see Appendix B).

Fig. 6. Reconstruction using various numbers of speckle patterns and 40 dB Gaussian noise with $\ell_{2,1}$ prior.

Download Full Size | PDF

3.5 Applying Poisson noise

In the previous simulations, we do not consider the shot noise caused by the random arrival of photons. For a given photon, the probability of its arrival within a given time period is governed by Poisson distribution. Our analysis suggests that the $\ell _{p,q}$ regularizer and the estimators Eqs. (4) and (5) are still valid after introducing Poisson noise as long as the photon counting rates are not too low, which is true for most practical situations. Even under low photon counting rates, the influence of Poisson noise on $\ell _{1,1}$ prior together with esitmator (4) can be neglected (see details in Appendix C).

In Fig. 7, we show the reconstructions using 300 speckle patterns under the mixture of Poisson and Gaussian noise. The SNR of Gaussian noise added is 20 dB. Fig. 7(a) shows that $\ell _{2,1}$ regularizer performs well as expected when the averaging photon number for each pixel in one raw image is set to 100 photons per pixel on average. Under low photon counting cases (20 photons per pixel on average), the image quality and resolution obtained by $\ell _{2,1}$ prior are dramatically degraded (Figs. 7(b) and 7(c)), while the reconstruction by $\ell _{1,1}$ prior together with estimator (4) is robust in low photon counting case (Fig. 7(e)) in comparison with the reconstruction without considering Poisson statistics (Fig. 7(f)).

Fig. 7. Reconstructions under Poisson noise with 300 speckle patterns and various number of average photons per pixel.

Download Full Size | PDF

3.6 Reconstructions from experimental data

To provide a more convincing illustration of the super-resolution capacity of RIM, the processing of experimental datasets is presented in this section. The raw images are obtained with an objective of $NA=1.49$ and $100\times$ magnification. The wavelength of excitation is $405$ nm for Argolight sample and $488$ nm for beads and podosome samples, while the collection wavelength is $520$ nm. The PSF used is simulated using an ICY plug-in called PSF Generator with the Gibson & Lanni 3D Optical Model. The spatial sampling rate is set to be slightly above the Nyquist rate $\frac {\lambda }{4NA}$.

Reconstructed images by minimizing the constrained $\ell _{2,1}$ regularizer are displayed in Figs. 8 and 9. The Argolight sample shown in Fig. 8 is a designed slide, in which, from left to right, the spacing between two middle lines becomes narrower. The data shown in Fig. 9 is obtained from the fluorescent beads sample with diameters of $100$ nm and Podosome sample.

Fig. 8. Processing of real data obtained from Argolight sample using 100 speckle patterns.

Download Full Size | PDF

Fig. 9. Reconstructed images of beads and podosome samples by $\ell _{2,1}$ prior and estimator (5). The raw images of beads contain 100 speckle patterns, whereas the reconstruction of podosome uses 80 speckle patterns

Download Full Size | PDF

The Line section plot extracted from Argolight reconstructions (indicated by the white dashed line in Fig. 8(a)) with normalized ranges from $[0,1]$ in Fig. 8(d) reveals that RIM is superior in resolution. Zoomed-in views of a small part of the images of the beads and Podosome samples (marked by a green square in Fig. 9) are shown in Fig. 10. The reconstructions of the corresponding subimages obtained by the marginal approach are also plotted in Fig. 10. To remove the out-of-focus information in raw images, I slightly adapt the objective function introduced in Ref. [13] and reconstruct $\boldsymbol {\rho }$ with only the second-order statistics (that is, the mean values are abandoned) by minimizing:

(32)$$\begin{aligned} D_R(\boldsymbol{\rho}) & = D_{KL}\big(\mathcal{N} (0,\hat{\boldsymbol{\Gamma}}_y)\|\mathcal{N} (0,\boldsymbol{\Gamma}_y)\big) \\ & = \frac{1}{2}\log \lvert \boldsymbol{\Gamma}_y \rvert+ \frac{1}{2}\textrm{Tr} \big(\boldsymbol{\Gamma}_y^{-1}\hat{\boldsymbol{\Gamma}}_y\big) + K \end{aligned}$$

where $\boldsymbol {\Gamma }_y$ indicates the theoretical covariance of $\mathbf {y}_m$ (respectively $\hat {\boldsymbol {\Gamma }}_y$, the empirical covariance) and $K$ is a constant number. The gradient of $D_R(\boldsymbol {\rho })$ is:

(33)$$\nabla D_R(\boldsymbol{\rho}) = \Big(\Big(\boldsymbol{\Omega}^{T}\big(\boldsymbol{\Gamma}_y-\hat{\boldsymbol{\Gamma}}_y\big)\boldsymbol{\Omega}\Big)\circ \boldsymbol{\Gamma}_{\textrm{s}}\Big)\boldsymbol{\rho}$$

with $\circ$ denoting the Hadamard (i.e., element-wise) product, $\boldsymbol {\Omega } = \boldsymbol {\Gamma }_y^{-1}\mathbf {H}$ and $\boldsymbol {\Gamma }_{\textrm {s}}$ the covariance of speckle patterns. The L-BFGS algorithm [28, 29] is chosen to optimize the marginal criterion (32). Clearly, we see better details after introducing the $\ell _{2,1}$ regularizer in comparison to the Wiener deconvolution of wide-field images (Figs. 10(b) and 10(h)). The image obtained by estimator (5) is less blurred by the background signal than that obtained by estimator (4) (Figs. 10(d) and 10(j)), which is consistent with the previous simulations. The high-frequency structures are verified by the marginal approach built on the elegant theoretical cornerstone (Figs. 10(f) and 10(l)).

Fig. 10. Partial enlargement of the beads and podosome reconstructions obtained by different methods. (a,g) Averaging raw images $\bar{y}$. (b,h) Wiener deconvolution of $\bar{y}$. (c,i) The standard deviation of the Wiener deconvolutions $\hat{q}_m$. (d,j) $\ell_{2,1}$ prior together with estimator (4). (e,k) $\ell_{2,1}$ prior together with estimator (5). (f,l) Marginal approach by minimizing (32).

Download Full Size | PDF

4. Conclusion

RIM could achieve super-resolution imaging with low toxicity and high temporal resolution comparable to standard SIM. In this paper, a unified joint reconstruction approach in RIM based on constrained $\ell _{p,q}$ norm minimization of the data is proposed. Mathematical analysis shows that the joint sparsity of matrix $\mathbf {Q}$ implies sparsity of the object itself. In the analysis, the statistical priors of speckle patterns are taken into consideration, i.e. ,their ensemble average is homogeneous or they are a second-order stationary random process, as shown in Eqs. (11) and (12). Therefore, it is the sparsity of the object together with the statistical prior of speckle patterns that induce the super-resolution imaging in the $\ell _{p,q}$ norm reconstruction strategy.

Among the $\ell _{p,q}$ family of priors, numerical simulations show that the $\ell _{2,1}$ regularizer is superior in terms of both error and super-resolution in comparison to the $\ell _{1,1}$ term. Please note here that this conclusion is drawn under the fully developed speckle illumination situation. In cases with variational structured illumination, or in the very low photon count regime, the $\ell _{2,1}$ regularizer is no longer a good choice since the illumination statistics change. When $q<1$, the super-resolution information still appears even though the associated primal-dual splitting method cannot be assured to yield the global minimum because the corresponding regularizer term is no longer a convex function. The binary effect in reconstructions is quite evident in cases of $q<1$, as shown in Fig. 2.

The hyperparameter involved in this model is proportional to the standard variance of noise; thus, it is easy to tune. We believe that the robust performance of the constrained form (7) will be inspiring for the other biomedical imaging inverse problems shared the same form as (6), such as optical diffraction tomography, magnetic resonance imaging, and so on [30,31]. In applications like diffraction tomography microscopy, positivity constraint and total variation (TV norm) regularization are preferable. The TV norm of object in our imaging setup is equivalent with the composition of the $\ell _{2,1}$ regularizer with a linear operator of $\mathbf {Q}$ (see Appendix D). The associated primal-dual splitting method can find the minimizer of the this kind of objective function without big changes or applying the inverse of the linear operator.

Normally, the inevitable background signal in experiments will cause artifacts in reconstruction and reduce the super-resolution in standard SIM. We show that the estimator based on the standard deviation of $\mathbf {q}_m$ can cancel out the background in both numerical studies and for experimental real data, with an acceptable sacrifice of MSE in comparison to the estimator (4) under the same number of speckle patterns. Only 2D super-resolution problems are considered in this paper, its application in 3D imaging of thick samples in RIM setup deserves to be further explored in future [3,8,32,33].

A. Proximal operator for the joint sparsity prior

This section focuses on the proximal operator of $g_1 = \lVert \boldsymbol{\mathfrak{d}} \rVert _{\mathcal {G} pq}^{q}$, whose definition is given by:

(34)$$\begin{aligned} \textrm{prox}_{\lambda g_1}(\boldsymbol{\mathfrak{x}}) & = \arg \min_{\boldsymbol{\mathfrak{d}}} \; \lVert \boldsymbol{\mathfrak{d}} \rVert_{\mathcal{G} pq}^{q} + \frac{1}{2\lambda} \lVert \boldsymbol{\mathfrak{d}} - \boldsymbol{\mathfrak{x}}\lVert_2^{2}\\ & = \arg \min_{\boldsymbol{\mathfrak{d}}} \; \sum_{n=1}^{N} \Big ( \lVert \boldsymbol{\mathfrak{d}}_{\mathcal{G}_n} \rVert_p^{q} + \frac{1}{2\lambda} \lVert \boldsymbol{\mathfrak{d}}_{\mathcal{G}_n} - \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n}\lVert_2^{2} \Big) \end{aligned}$$

Since the partitions of $\boldsymbol{\mathfrak{d}}_{\mathcal {G}_n}$ do not overlap, problem (34) can be decoupled into $N$ independent subproblems. Each subproblem is given below:

(35)$$\arg \min_{\boldsymbol{\mathfrak{d}}_{\mathcal{G}_n}} \; \lVert \boldsymbol{\mathfrak{d}}_{\mathcal{G}_n} \rVert_p^{q} + \frac{1}{2\lambda} \lVert \boldsymbol{\mathfrak{d}}_{\mathcal{G}_n} - \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n}\lVert_2^{2}$$

For specific $(p,q)$ pairs, the minimization problem (35) has the following analytical formula [34]:

• for $p=1$ and $q=1$: $(36)$$\boldsymbol{\mathfrak{d}}_{\mathcal{G}_n} = \textrm{sign}(\boldsymbol{\mathfrak{x}}_{\mathcal{G}_n}) \circ \max \Big\{ \lvert \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n} \rvert - \lambda , 0 \Big\}$$$
• for $p=2$ and $q=1$: $(37)$$\boldsymbol{\mathfrak{d}}_{\mathcal{G}_n} = \max \Big\{1-\frac{\lambda}{ \lVert \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n} \rVert_2}, 0 \Big\} \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n}$$$
• for $p=2$ and $q = 1/2$: $(38)$$\boldsymbol{\mathfrak{d}}_{\mathcal{G}_n} = \begin{cases} \frac{16 \lVert \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n} \rVert_2^{3/2}\omega_n}{3\sqrt{3}\lambda+ 16 \lVert\boldsymbol{\mathfrak{x}}_{\mathcal{G}_n} \rVert_2^{3/2}\omega_n} \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n}, & \lVert \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n} \rVert_2 > \frac{3}{2}(\lambda)^{2/3} \\ \boldsymbol{0} \;\textrm{or} \; \frac{16 \lVert\boldsymbol{\mathfrak{x}}_{\mathcal{G}_n}\rVert_2^{3/2}\omega_n}{3\sqrt{3}\lambda+ 16 \lVert\boldsymbol{\mathfrak{x}}_{\mathcal{G}_n} \rVert_2^{3/2}\omega_n} \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n}, & \lVert \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n} \rVert_2 = \frac{3}{2}(\lambda)^{2/3} \\ \boldsymbol{0}, & \lVert \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n} \rVert_2 < \frac{3}{2}(\lambda)^{2/3} \end{cases}$$$ with: $\omega_n = \cos^{3}(\frac{\pi}{3}-\frac{\psi_n}{3}), \qquad \psi_n = \arccos \bigg(\frac{\lambda}{4} \Big(\frac{3}{\lVert\boldsymbol{\mathfrak{x}}_{\mathcal{G}_n} \rVert_2}\Big)^{3/2} \bigg)$
• for $p=2$ and $q = 2/3$: $(39)$$\boldsymbol{\mathfrak{d}}_{\mathcal{G}_n} = \begin{cases} \frac{3 \eta^{4}}{2\lambda+3 \eta^{4}}\boldsymbol{\mathfrak{x}}_{\mathcal{G}_n}, & \lVert \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n}\rVert_2 > 2(\frac{2}{3}\lambda)^{3/4}\\ 0 \; \textrm{or}\; \frac{3 \eta^{4}}{2\lambda+3 \eta^{4}}\boldsymbol{\mathfrak{x}}_{\mathcal{G}_n}, & \lVert \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n}\rVert_2 = 2(\frac{2}{3}\lambda)^{3/4} \\ 0, & \lVert \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n}\rVert_2 < 2(\frac{2}{3}\lambda)^{3/4} \end{cases}$$$ with $\begin{aligned} & \eta = \frac{1}{2} \Bigg(\lvert a \rvert + \sqrt{\frac{2\lVert \boldsymbol{\mathfrak{x}}_{\mathcal{G}_n}\rVert_2}{\lvert a \rvert}-a^{2}} \Bigg),\qquad a = \frac{2}{\sqrt{3}}\big(2\lambda \big)^{1/4} \Big(\cosh\big( \frac{\phi(\boldsymbol{\mathfrak{x}}_{\mathcal{G}_n})}{3}\big) \Big)^{1/2} \\ & \phi(\boldsymbol{\mathfrak{x}}_{\mathcal{G}_n}) = \textrm{arccosh}\Big( \frac{27\lVert\boldsymbol{\mathfrak{x}}_{\mathcal{G}_n} \rVert_2^{2}}{16 \big(2\lambda \big)^{3/2}}\Big) \end{aligned}$

B. Statistical analysis about estimators (4) and (5)

The intensity of speckle patterns follows an exponential probability distribution [35, Chapter 3]:

(40)$$p(I) = \frac{1}{I_0} \exp\big(-\frac{I}{I_0}\big)$$

for $I\geq 0$. The moments of this distribution is:

(41)$$E[I^{q}] = (I_0)^{q} q!$$

Then we have:

(42)$$E[I] = (I_0) \qquad \textrm{Var}(I) = E[I^{2}] - (E[I])^{2} = (I_0)^{2}$$

Consider the sample mean divided by $I_0$ as a new random variable: $S_m = \frac {1}{MI_0} \sum _{m=1}^{M} I_m$, such that $I_1, I_2, \ldots , I_M$ are independent and identically distributed, according to the central limit theorem:

(43)$$S_m \sim \mathcal{N}(1, 1/M)$$

where $\mathcal {N}(1, 1/M)$ denotes the normal distribution with mean $1$ and variance $1/M$. Similarly, we define the sample variance as a new random variable:

(44)$$V_m =\frac{1}{M} \sum_{m=1}^{M} (I_m)^{2} - (E[I])^{2} = \frac{1}{M} \sum_{m=1}^{M} (I_m)^{2} - (I_0)^{2}$$

From Eq. (41), we have:

(45)$$E[I^{2}] = 2 (I_0)^{2} \qquad \textrm{Var} (I^{2}) = E[I^{4}] - (E[I^{2}])^{2} = 20 (I_0)^{4}$$

Combing Eqs. (44)and (45) and the central limit theorem gives:

(46)$$V_m \sim \mathcal{N}\Big( (I_0)^{2}, \frac{20(I_0)^{4}}{M} \Big)$$

In estimator (5), what really matters is $\sqrt {V_m/(I_0)^{2}}$. Let’s define $X_m =V_m/(I_0)^{2}$, then

(47)$$X_m \sim \mathcal{N}\Big( 1, 20/M \Big)$$

The second order Taylor approximation around $X_m = E[X_m]$ shows that:

(48)$$\begin{aligned} E\Big(\sqrt{X_m}\Big) \approx \sqrt{E[X_m]} -\frac{1}{8} \Big(E[X_m]\Big)^{-2/3} \textrm{Var} (X_m) = 1- \frac{5}{2M}\\ \textrm{Var}\Big[\sqrt{X_m}\Big] \approx \bigg( \frac{1}{2} \Big(E[X_m] \Big)^{-1/2} \bigg)^{2} \textrm{Var}(X_m ) = \frac{5}{M} \end{aligned}$$

Apparently, $\textrm {Var}\Big (\sqrt {X_m}\Big ) > \textrm {Var} \big (S_m\big )$, that explains why the MSE of estimator (5) is larger than (4) as shown in Fig. 6(d).

C. Validity of $\ell _{p,q}$ regularizer under Poisson noise

The recorded data of detectors in imaging obeys Poisson statistics. If we ignore the influence of point spread function and define:

(49)$$\mathbf{k}_m = \mathcal{P}(\mathbf{q}_m)$$

where $\mathcal {P}(\mathbf {q}_m)$ denotes a realization of the Poisson process with mean $\mathbf {q}_m$, then the validity of $\ell _{p,q}$ regularizer depends on the statistics of $\mathbf {k}_m$. For brevity of expression, we remove the subscript $m$ and write $k_{r} = k(\mathbf {r}), \; q_{r} = q(\mathbf {r}_m)$, in which $\mathbf {r} \in \{1,\ldots ,N\}$ denotes the spatial index, and $k_r =\mathcal {P}(q_r)$. Since $q_r$ are still random variables, the Poisson variables $k_{r}$ are called doubly stochastic Poisson random variables [36].

Since $E[k_r|q_r] = q_r$, it comes directly that

\begin{aligned} E[k_r] &=\int E[k_r|q_r] \,p(q_r) \,dq_r\\ &=\int q_r\,p(q_r) \,dq_r = E[q_r]. \end{aligned}

Therefore, the mean of the Poisson vector $\mathbf {k}$ equals the mean of its parameters, i.e., $E[\mathbf {k}] = E[\mathbf {q}]$, so (11) and estimator (4) still hold for random vector $\mathbf {k}$.

Similarly, one can obtain the second-order statistics of $\mathbf {k}$ :

E[k_r^{2}|q_r] = \textrm{Var} (k_r|q_r) + (E[k_r|q_r])^{2} = q_r + q_r^{2}.

Therefore,

(50)$$E[k_r^{2}] = \int (q_r + q_r^{2}) p(q_r) dq_r = E[q_r] + E[q_{r}^{2}]$$

Substitute (50) to (12), we finally get :

(51)$$\begin{aligned} \lim_{M\to\infty} \lVert \mathbf{k}^{n} \rVert_2 & =\bigg( \rho_n^{2} \big(I_{1n}^{2} + \cdots + I_{Mn}^{2}\big) + \rho_n \big(\lvert I_{1n}\rvert + \cdots + \lvert I_{Mn}\rvert \big) \bigg)^{1/2} \\ & = \sqrt{ M\kappa \rho_n^{2}+ M I_0\rho_n} \end{aligned}$$

As the second-order moments have a quadratic dependence in the random vector $\mathbf {q}$, the changes after introducing Poisson statistics can be neglected except when $\mathbf {q}$ is small in low photon counting cases. Even in low photon situation, the above analysis shows that $\ell _{1,1}$ regularizer together with the estimator (4) are still valid.

D. TV norm as the composition of the $\ell _{2,1}$ function with a linear operator of $\mathbf {Q}$

In some biomedical imaging applications, such as diffraction tomography microscopy, TV norm of the object is preferable. Given a $N_1\times N_2$ object $\rho$, the isotropic TV is defined as:

(52)$$TV(\rho) = \sum_{n_1 =1}^{N_1} \sum_{n_2 =1}^{N_2} \sqrt{(\rho[n_1+1,n_2]-\rho[n_1,n_2])^{2}+(\rho[n_1,n_2+1]-\rho[n_1,n_2])^{2}}$$

According to the definition of TV norm and $\ell _{\mathcal {G},p,q}$ norm, we can easily verify that the TV norm of $\boldsymbol {\rho }$ can be seen as the $\ell _{\mathcal {G},2,1}$ norm of $\mathbf {C}\boldsymbol {\rho }$, with $\mathbf {C} = (\mathbf {C}_1;\mathbf {C}_2) \in \mathbb {R}^{2N \times N}$ and $\mathbf {C}_1, \mathbf {C}_2$ the first-order horizontal and vertical finite difference operators, and the $n$-th group is given by $(\mathbf {C}\boldsymbol {\rho })_{\mathcal {G}_n} = [(\mathbf {C}_1\boldsymbol {\rho })_n; (\mathbf {C}_2\boldsymbol {\rho })_n] \in \mathbb {R}^{2}$. So now

(53)$$\lVert \boldsymbol{\rho} \rVert_{TV} = \lVert\mathbf{C}\mathcal{A}\boldsymbol{\mathfrak{q}}\lVert_{\mathcal{G} 21}$$

with

(54)$$\mathcal{A} = \left[ \begin{array}{ccc} \frac{1}{MI_0}\mathbb {1}_N,\ldots,\frac{1}{MI_0}\mathbb {1}_N \end{array} \right] \in \mathbb{R}^{N\times MN}$$

where $\mathbb {1}_N$ is the $N\times N$ identity matrix. Thanks for the good compatibility of the associated primal-dual splitting algorithm, it could deal with the linear composite term easily in this case.

Funding

China Scholarship Council (201404490041); Le GdR 720 ISIS (Information, Signal, Image et ViSion).

Acknowledgments

This work was mainly done during my PhD studies at École Centrale de Nantes, France. I would like to thank Jérôme Idier, Sébastien Bourguignon at École Centrale de Nantes, Laurent Mugnier from ONERA (Châtillon) and the anonymous reviewers for their valuable comments. I also thank Simon Labouesse, Thomas Mangeat for sharing the raw data corresponding to Figs. 8 and 9.

Disclosures

The authors declare no conflicts of interest.

References

1. M. J. Rust, M. Bates, and X. Zhuang, “Sub-diffraction-limit imaging by stochastic optical reconstruction microscopy (STORM),” Nat. Methods 3(10), 793–796 (2006). [CrossRef]

2. S. W. Hell and J. Wichmann, “Breaking the diffraction resolution limit by stimulated emission: stimulated-emission-depletion fluorescence microscopy,” Opt. Lett. 19(11), 780–782 (1994). [CrossRef]

3. T. Mangeat, S. Labouesse, M. Allain, R. Poincloux, A. Bouissou, S. Cantaloube, E. Courtais, E. Vega, T. Li, A. Guenole, et al., “Super-resolved live-cell imaging using random illumination microscopy,” bioRxiv (2020).

4. E. Mudry, K. Belkebir, J. Girard, J. Savatier, E. Le Moal, C. Nicoletti, M. Allain, and A. Sentenac, “Structured illumination microscopy using unknown speckle patterns,” Nat. Photonics 6(5), 312–315 (2012). [CrossRef]

5. J. Goodman, Introduction to Fourier Optics (Roberts & Company Publishers, 2005).

6. M. G. L. Gustafsson, “Surpassing the lateral resolution limit by a factor of two using structured illumination microscopy,” J. Microsc. 198(2), 82–87 (2000). [CrossRef]

7. P. Kner, B. B. Chhun, E. R. Griffis, L. Winoto, and M. G. Gustafsson, “Super-resolution video microscopy of live cells by structured illumination,” Nat. Methods 6(5), 339–342 (2009). [CrossRef]

8. A. Jost, E. Tolstik, P. Feldmann, K. Wicker, A. Sentenac, and R. Heintzmann, “Optical sectioning and high resolution in single-slice structured illumination microscopy by thick slice blind-sim reconstruction,” PLoS One 10(7), e0132174 (2015). [CrossRef]

9. R. Ayuk, H. Giovannini, A. Jost, E. Mudry, J. Girard, T. Mangeat, N. Sandeau, R. Heintzmann, K. Wicker, K. Belkebir, and A. Sentenac, “Structured illumination fluorescence microscopy with distorted excitations using a filtered blind-SIM algorithm,” Opt. Lett. 38(22), 4723–4726 (2013). [CrossRef]

10. S. Labouesse, A. Negash, J. Idier, S. Bourguignon, T. Mangeat, P. Liu, A. Sentenac, and M. Allain, “Joint reconstruction strategy for structured illumination microscopy with unknown illuminations,” IEEE Trans. on Image Process. 26(5), 2480–2493 (2017). [CrossRef]

11. J. Min, J. Jang, D. Keum, S.-W. Ryu, C. Choi, K.-H. Jeong, and J. C. Ye, “Fluorescent microscopy beyond diffraction limits using speckle illumination and joint support recovery,” Sci. Rep. 3(1), 2075 (2013). [CrossRef]

12. L.-H. Yeh, L. Tian, and L. Waller, “Structured illumination microscopy with unknown patterns and a statistical prior,” Biomed. Opt. Express 8(2), 695–711 (2017). [CrossRef]

13. J. Idier, S. Labouesse, M. Allain, P. Liu, S. Bourguignon, and A. Sentenac, “On the super-resolution capacity of imagers using unknown speckle illuminations,” IEEE Trans. Comput. Imaging 4(1), 87–98 (2018). [CrossRef]

14. P. L. Combettes and J.-C. Pesquet, “Proximal splitting methods in signal processing,” in Fixed-point Algorithms for Inverse Problems in Science and Engineering, (Springer, 2011), pp. 185–212.

15. A. Beck and M. Teboulle, “A fast iterative shrinkage-thresholding algorithm for linear inverse problems,” SIAM J. Imaging Sci. 2(1), 183–202 (2009). [CrossRef]

16. J. M. Bioucasdias and M. A. T. Figueiredo, “A new twist: Two-step iterative shrinkage/thresholding algorithms for image restoration,” IEEE Trans. on Image Process. 16(12), 2992–3004 (2007). [CrossRef]

17. T. W. Murray, M. Haltmeier, T. Berer, E. Leiss-Holzinger, and P. Burgholzer, “Super-resolution photoacoustic microscopy using blind structured illumination,” Optica 4(1), 17–22 (2017). [CrossRef]

18. R. T. Rockafellar, Convex Analysis (Princeton University Press, 2015).

19. L. Condat, “A primal–dual splitting method for convex optimization involving lipschitzian, proximable and linear composite terms,” J. Optim. Theory Appl. 158(2), 460–479 (2013). [CrossRef]

20. J. W. Goodman, Statistical Optics (John Wiley & Sons, 2015).

21. M. V. Afonso, J. M. Bioucas-Dias, and M. A. Figueiredo, “An augmented lagrangian approach to the constrained optimization formulation of imaging inverse problems,” IEEE Trans. on Image Process. 20(3), 681–695 (2011). [CrossRef]

22. P. C. Hansen, J. G. Nagy, and D. P. O’leary, Deblurring Images: Matrices, Spectra, and Filtering (SIAM, 2006).

23. J. Demmerle, C. Innocent, A. J. North, G. Ball, M. Müller, E. Miron, A. Matsuda, I. M. Dobbie, Y. Markaki, and L. Schermelleh, “Strategic and practical guidelines for successful structured illumination microscopy,” Nat. Protoc. 12(5), 988–1010 (2017). [CrossRef]

24. F. Orieux, E. Sepulveda, V. Loriette, B. Dubertret, and J. C. Olivomarin, “Bayesian estimation for optimized structured illumination microscopy,” IEEE Trans. on Image Process. 21(2), 601–614 (2012). [CrossRef]

25. A. Lal, C. Shan, and P. Xi, “Structured illumination microscopy image reconstruction algorithm,” IEEE J. Select. Topics Quantum Electron. 22(4), 50–63 (2016). [CrossRef]

26. A. Beck and M. Teboulle, “A fast iterative shrinkage-thresholding algorithm for linear inverse problems,” SIAM J. Imaging Sci. 2(1), 183–202 (2009). [CrossRef]

27. T. W. Murray, M. Haltmeier, T. Berer, E. Leiss-Holzinger, and P. Burgholzer, “Super-resolution photoacoustic microscopy using blind structured illumination,” Optica 4(1), 17–22 (2017). [CrossRef]

28. D. C. Liu and J. Nocedal, “On the limited memory BFGS method for large scale optimization,” Math. programming 45(1-3), 503–528 (1989). [CrossRef]

29. M. Schmidt, “minfunc: unconstrained differentiable multivariate optimization in matlab,” (2005).

30. E. Soubies, F. Soulez, M. T. Mccann, T. Pham, L. Donati, T. Debarre, D. Sage, and M. Unser, “Pocket guide to solve inverse problems with globalbioim,” Inverse Prob. 35(10), 104006 (2019). [CrossRef]

31. T. Pham, E. Soubies, A. B. Ayoub, J. Lim, D. Psaltis, and M. Unser, “Three-dimensional optical diffraction tomography with lippmann-schwinger model,” IEEE Trans. Comput. Imaging 6, 727–738 (2020). [CrossRef]

32. L. Yeh, S. Chowdhury, N. A. Repina, and L. Waller, “Speckle-structured illumination for 3d phase and fluorescence computational microscopy,” Biomed. Opt. Express 10(7), 3635–3653 (2019). [CrossRef]

33. A. Negash, T. Mangeat, P. C. Chaumet, K. Belkebir, H. Giovannini, and A. Sentenac, “Numerical approach for reducing out-of-focus light in bright-field fluorescence microscopy and superresolution speckle microscopy,” J. Opt. Soc. Am. A 36(12), 2025–2029 (2019). [CrossRef]

34. Y. Hu, C. Li, K. Meng, J. Qin, and X. Yang, “Group sparse optimization via lp,q regularization,” Journal of Machine Learning Research 18, 1–52 (2017).

35. J. W. Goodman, Speckle Phenomena in Optics: Theory and Applications (Roberts and Company Publishers, 2007).

36. H. H. Barrett and K. J. Myers, Foundations of Image Science (John Wiley & Sons, 2013).

RETRACTED: Unified joint reconstruction approach for random illumination microscopy

Abstract

Retraction

1. Introduction

2. Problem formulation

2.1 Relationship between joint sparsity and prior information of object $\rho$

2.2 Equivalently unconstrained form

2.3 Primal-dual splitting method

3. Simulation results and experiments

3.1 Reconstruction with different $\{p,q\}$ pairs

3.2 Two estimators and background removal

3.3 Influence of the hyperparameter

3.4 Necessary number of speckle patterns

3.5 Applying Poisson noise

3.6 Reconstructions from experimental data

4. Conclusion

A. Proximal operator for the joint sparsity prior

B. Statistical analysis about estimators (4) and (5)

C. Validity of $\ell _{p,q}$ regularizer under Poisson noise

D. TV norm as the composition of the $\ell _{2,1}$ function with a linear operator of $\mathbf {Q}$

Funding

Acknowledgments

Disclosures

References

Cited By

Figures (10)

Equations (33)

Biomedical Optics Express