From Fienup’s phase retrieval techniques to regularized inversion for in-line holography: tutorial

Fabien Momey; Loïc Denis; Thomas Olivier; Corinne Fournier

doi:10.1364/JOSAA.36.000D62

1. INTRODUCTION

The imaging of samples at macro, micro, and nano scales is important in many fields of research, from physics (fluid mechanics, materials) to biology (cells, bacteria, viruses). Imaging remains challenging, and advanced techniques are still emerging. These new techniques often exploit physical principles involved in light–matter interactions from a variety of radiation sources (x-rays, electrons, visible or near-visible light, lasers). In this context, imaging techniques aim to record the perturbation of the incident electromagnetic wave by the sample of interest. The perturbed wave depends on both the absorption and phase-shift properties of the sample. Thus, being able to precisely measure the shape of this complex wave—the amplitude and phase of the electromagnetic field in space and time—is crucial to fully quantitatively characterize the sample.

Holography, invented by Gabor [1], is a technique to reconstruct the complex wave (amplitude and phase) due to the diffraction of light when a scattering sample is illuminated by a coherent source. This principle gave rise to a wide variety of imaging techniques (off-axis holography, phase shifting, in-line holography, diffractive tomography, x-ray diffractive microscopy, etc.) [2]. Since only intensity measurements of light are available, the problem of retrieving the phase on the sensor has fueled many studies. In contrast to interferometric setups such as off-axis or phase-shifting holography, iterative phase retrieval techniques are algorithms that estimate the phase of light in the plane of in-line holograms.

Digital hologram reconstruction is typically performed by backpropagating the hologram from the sensor plane to the object plane. In the absence of phase information with in-line holograms, the reconstruction suffers from artifacts called the twin image. To suppress the twin image, phase retrieval techniques were introduced in the 1970s and 1980s, and since then, improved algorithms have regularly been described in the literature. Most phase retrieval techniques are still derived from the methods of alternating projections initially proposed by Gerchberg and Saxton [3] and popularized and extended by Fienup [4,5]. This class of methods is still widely used today [6–13], with improvements to enforce a priori knowledge (support of the objects, admissible values domain, sparsity constraints) [14–16].

Inverse problems approaches take a different point of view from phase retrieval techniques: rather than recovering the phase on the sensor plane (which does not completely solve the sample reconstruction problem), they focus on the reconstruction of the complex-valued transmittance in the object plane. Because of measurement noise and hologram truncation at the borders of the sensor, backpropagating the complex amplitude in the sensor plane does not lead to perfect sample reconstruction, i.e., restoring the sensor phase and reconstructing the complex wave in the sample plane are not strictly equivalent problems.

Figure 1 gives an overview of the different ingredients of the inverse problems methodology in the context of holographic imaging. The “direct model” relates a description of the “parameters of interest” (i.e., the sample) and the “measurements” (i.e., the hologram), taking into account the “instrumental parameters” (i.e., the imaging setup). The sample may be described as a complex-valued transmittance plane [17], a volume of transmittance planes [18], or as a collection of objects [19]. Depending on the application, the sample is absorbing, and/or translucent (it induces a phase shift due to a refractive index difference with respect to the surrounding medium). It is described by the spatial distribution of the absorption, phase shift, or complex refractive index within a plane or a volume. When individual objects are considered, their 3D location, size, and complex refractive index are the parameters that characterize the objects. Any appropriate optical model can then be used to describe how an incident illumination wave gets diffracted by the sample and propagates to the sensor: diffraction by a plane (Rayleigh–Sommerfeld, Fresnel approximation, angular spectrum) [20], diffraction by a volume (Born approximation, Rytov approximation) [21], diffraction by a sphere (Mie scattering) [22], or free-space propagation. Multiple holograms can be recorded to improve information diversity by including lateral or axial shifts of the sensor [16,23], multiple illumination wavelengths [24], or multiple illumination angles [25,26]. When a suitable model of the observations has been defined, a “reconstruction” (or “estimation”) method can be selected. If the sample is described by a spatial distribution within a plane or a volume, the problem amounts to an “image or volume reconstruction.” If a collection of objects is to be identified, the geometrical and optical parameters of each object have to be estimated (“parameter estimation”). “Image or volume reconstruction” problems are generally solved by regularized inversion [27–37], i.e., by solving a minimization problem whose objective function is defined by a data-fitting term that penalizes the deviation of the model from the data. Regularization terms are added to favor reconstructions with desirable properties (e.g., smooth images, sharp edges, zero-valued background). Additional (hard) constraints are also enforced to guarantee that the reconstruction fulfills basic physical properties such as the positivity of the absorption or the sign of the phase shift induced by the objects (e.g., thin objects with an optical index larger than that of the surrounding medium induce a positive phase shift). Depending on the type of cost function to minimize [quadratic, nonlinear but differentiable (smooth), non-differentiable (non-smooth), non-convex], different strategies can be used. “Parameter estimation” of a collection of objects generally follows a different path because the reduced number of unknowns makes the regularization unnecessary. Rather than considering a fixed discretization of space, the 3D location of each object is generally sought in a continuous domain (i.e., subpixel location). Objects are often detected and characterized one after another, and the measurements are described by the superimposition of the diffraction patterns due to each object [19,38]. Nonlinear least-squares minimization techniques are applied to estimate the geometrical and optical parameters of each object. In the case where the data contain some signatures that cannot be fitted by the model (outliers), e.g., out-of-field objects, the estimations can be improved by using a robust signal processing approach such as the reweighted least squares [39].

Fig. 1. Hologram reconstruction based on inverse problems approaches: the direct model connects the sample of interest to the measurements (the holograms); inverting this model leads to a reconstruction and/or estimation of the optical parameters of the sample. The case chosen as an illustration in this paper is highlighted in yellow.

Download Full Size | PDF

The hologram formation model depends on “instumental parameters” such as the wavelength and coherence of the source, the sample-to-sensor distance, or the sensor response (fill-factor and spectral sensitivity). The model of the data not only underlies the reconstruction algorithm, but also provides a way to calibrate the experimental parameters.

The goal of this paper is to illustrate the inverse problems methodology and to draw connections with the popular phase retrieval techniques based on Fienup’s approach. We selected a problem that can be addressed using either approach: the reconstruction of an in-line hologram of a non-absorbing plane sample that induces spatially varying phase shifts. We discuss the advantages provided by the inverse problems framework and interpret Fienup’s as an instance of a proximal gradient algorithm applied to a specific cost function.

The paper is organized as follows: next, we present the phase retrieval problem in the context of in-line digital holography. We then present the standard alternating projections strategy proposed by Gerchberg and Saxton, Fienup, and subsequent variants. In Section 4, we present the inverse problems methodology applied to the reconstruction of in-line holograms of phase-only samples. We validate and compare the algorithms in Section 5, first in numerical simulations, then on experimental in-line holograms of microscopic beads. Finally, we discuss the possible extensions of inverse problems approaches for phase retrieval.

2. THE INVERSE PROBLEM OF IMAGE RECONSTRUCTION IN IN-LINE HOLOGRAPHY

The object of interest, i.e., the sample, can be modeled as a two-dimensional (2D) complex transmittance plane $ \underline t ({\boldsymbol r}) $ [$ {\boldsymbol r} = (x,y) $ is the vector of 2D spatial coordinates, and all complex-valued variables are underlined in this paper]. A transmittance $ \underline t ({\boldsymbol r}) = 1 $, $ \forall \,\,{\boldsymbol r} $ means a fully transparent plane that leaves any illumination wave unperturbed after passing through it. Consequently, the presence of scattering objects in this plane will be revealed by a certain deviation $ \underline o ({\boldsymbol r}) $ from a unit transmittance:

(1)$$\underline t ({\boldsymbol r}) = 1 + \underline o ({\boldsymbol r}) = \rho ({\boldsymbol r}){e^{i\varphi ({\boldsymbol r})}}.$$

$ \underline o ({\boldsymbol r}) $ is the quantity of interest. It models the distribution of the phase $ \varphi ({\boldsymbol r}) $ and amplitude $ \rho ({\boldsymbol r}) $, directly linked to the complex refractive index difference between the observed objects (beads, droplets, cells, etc.) and the medium.

Digital in-line holography (DIH) (cf. Fig. 2) consists of illuminating a sample plane with a coherent (generally plane) wave $ {\underline a _0}({\boldsymbol r}) $ of wavelength $ \lambda $, and recording on a digital sensor the intensity $ I({\boldsymbol r}) $ of the total diffracted wave $ {\underline a _z}({\boldsymbol r}) $ at a distance $ z $ from the sample plane (the sensor plane being parallel to the sample plane). From the Huygens–Fresnel principle, the physical data acquisition process is written as

(2)$$\begin{split}\!\!\!\!\!I({\boldsymbol r})=\left|\iint_{\mathbb R^{2}}\underline h_z ({\boldsymbol r}-{\boldsymbol r^{\prime}})\underline a_0({\boldsymbol r^\prime})\underline t ({\boldsymbol r}^\prime)\text{d}{{\boldsymbol r}^\prime} \right|^2 = |\underline h_z {\mathop *\limits_{\boldsymbol r}}(\underline a_0 \cdot \underline t)|^2 ,\end{split}$$

where $ * $ is a convolution operator and $ {\underline h _z} $ is the propagation kernel. Depending on the approximations considered, the Rayleigh–Sommerfeld and the Fresnel kernels are used as $ {\underline h _z} $ [20]. In this paper, we use the Fresnel kernel (paraxial approximation), which is written as

(3)$${\underline h _z}({\boldsymbol r}) = \frac{1}{{i\lambda z}}\exp \left( {\frac{{i\pi }}{{\lambda z}} {\parallel}{\boldsymbol r}{\parallel}^2} \right).$$

Fig. 2. In-line holography principle: a plane wave illuminates the sample plane with normal incidence. The presence of inhomogeneity in the spatial distribution of the complex-valued refractive index, in the sample plane, distorts the illumination wave. A diffraction pattern is recorded on the sensor plane.

Download Full Size | PDF

The backpropagation kernel $ {\underline h _{ - z}} $ can be used to recover the original complex wave from a diffracted (i.e., propagated) wave.

Considering Eq. (1), and under the assumption that the incident wave is plane ($ {\underline a _0}({\boldsymbol r}) = {\underline a _0} $), Eq. (2) becomes

(4)$$\begin{split}I({\boldsymbol r}) = |\underline a_0|^2 |1+ \underline h_z {\mathop *\limits_{\boldsymbol r}} \underline o|^2 = | \underline a_0|^2 {\underbrace {(1+2 \mathfrak{Re} (\underline h_z {\mathop *\limits_{\boldsymbol r}} \underline o)+|\underline h_z {\mathop *\limits_{\boldsymbol r}} \underline o|^2)}_{\text{model}\, m (\underline o)}} .\end{split}$$

The intensity $ I({\boldsymbol r}) $ is recorded by a digital sensor. Therefore, effective intensity measurements correspond to sampled (and noisy) intensity values that can be collected in the form of a column vector $ {\boldsymbol d} = ( {{d_q}} )_{q = 1 \ldots M}^{T} $ with $ M $ data measurements. This also leads to considering the targeted physical quantity $ \underline o $ as a vector $ \underline {\boldsymbol o} = ( {{x_q}} )_{q = 1 \ldots N}^{^{T}} $ of $ N $ unknowns. In imaging systems, $ N $ is often equal to $ M $, as we aim to recover an image of the same size as the hologram, i.e., the field of view. In the context of inverse approaches, the unknown field of view of $ \underline {\boldsymbol o} $ can be extended, leading to $ N \gt M $ (cf. Section 6). We denote $ {\mathbb D} $ and $ {\mathbb O} $, respectively, as the feasible domains of $ {\boldsymbol d} $ and $ \underline {\boldsymbol o} $, i.e., the domains of valid values for each quantity. From Eq. (4), $ {\mathbb D} \subset {{\mathbb R}^M} $ and $ {\mathbb O} \subset {{\mathbb C}^N} $. As discussed in Section 1, in order to ensure physically relevant properties of the object of interest, the feasible domain $ {\mathbb O} $ can be restricted, for example, to impose a positive absorption or the sign of the phase shift. Such properties are applied in the regularized inverse approach developed in our case study in Section 4.B.

The algebraic process that “transforms” $ \underline {\boldsymbol o} $ into $ {\boldsymbol d} $ is modeled by the mapping $ {\boldsymbol m} $, which goes from $ {\mathbb D} $ to $ {\mathbb O} $, and is a discretized version of the physical—analytic—formulation $ m $ in Eq. (4). Finally, we get the following algebraic formulation of the image formation model:

(5)$${\boldsymbol d} = c {\boldsymbol m}(\underline {\boldsymbol o} ) + {\boldsymbol \eta } = c |\textbf{1} + {\underline{\textbf{H}} _z} \underline{\boldsymbol o} {|^2} + {\boldsymbol \eta },$$

where the square modulus operator $ |{\boldsymbol x}{|^2} $ on a vector $ {\boldsymbol x} $ is applied component-wise, leading to an $ M $-dimensional vector, and $ \textbf{1} $ is a vector of 1 of size $ M $. $ {\boldsymbol \eta } = ( {{\eta _q}} )_{q = 1 \ldots M}^{T} $ corresponds to a vector of $ M $ noise values to model electronic noise in the measurement system, and mathematical approximations involved in the model, etc. $ {\underline{\textbf{H}} _z} $ is the complex-valued convolution operator that calculates the discrete propagation of the object $ \underline {\boldsymbol o} :{\underline{\textbf{H}} _z} $ is a $ M \times N $ matrix, such that $ {[{\underline{\textbf{H}} _z} \underline{\boldsymbol o} ]_q} = {[{\underline{h}_z} * \underline{o}]_q} $ for all pixels $ q $. The Hermitian transpose of $ {\underline{\textbf{H}} _z} $, written $ \underline{\textbf{H}}_z^{\dagger} $, corresponds to the backpropagation operator $ {\underline{\textbf{H}}_{ - z}} $. Because of sampling and field truncation at the border of the images, the backpropagation does not exactly invert the propagation, yet the approximation $ \underline{\textbf{H}} _z^{\dagger} {\underline{\textbf{H}}_z} \approx \textbf{I} $ is often used to get an intuitive understanding of holographic reconstruction. Finally, $ c $ is a scalar scaling factor that accounts for the intensity of the incident wave $ |{\underline{a} _0}{|^2} $ as well as the detector gain and quantum efficiency.

Retrieving the unknown image of the sample $ \underline{\boldsymbol o} $, denoted $ {\underline{\boldsymbol o} ^*} $, from the observed hologram $ {\boldsymbol d} $, based on the hologram formation model in Eq. (5), constitutes the inverse problem of holographic reconstruction.

3. FIENUP’S ALTERNATING PROJECTIONS STRATEGIES

The standard alternating projections strategy, still widely used to date, was first proposed by Fienup [4] as the error-reduction (ER) algorithm. It is an upgraded variant of the initial method proposed by Gerchberg and Saxton [3] from two-plane intensity measurements.

In Ref. [5], Fienup has proposed several variants of the ER algorithm, such as the basic input–output (BIO) or the hybrid input–output (HIO), involving a parameter $ \beta $ in the “projection on data” step that relaxes the strict projection on the feasible domain. These variants are known to be more efficient than the classical ER strategy (HIO is considered as the most efficient). In this paper, we focus on the classical approach ER, which we find the most intuitive way to understand the alternating projections strategy. We show in Section 4.C.3 that this algorithm can be reformulated to fit the inverse problems framework. Thus, our goal is not an exhaustive comparison of inverse approaches with all variants of the alternating projections strategy, but rather to give an “inverse problems” interpretation of Fienup’s strategy and show that inverse problems offer a framework that is accessible, yet more flexible and powerful than alternating projections.

Algorithm 1 summarizes the Fienup ER strategy. Details of implementation are given in Appendix A, in Algorithm 3.

Fienup’s ER algorithm requires only one intensity measurement image $ {\boldsymbol d} $ at the sensor plane at distance $ z $. This measurement image has to be normalized so that the background equals 1, to ensure that the retrieved transmittance plane $ \underline{\boldsymbol o} $ is normalized the same way ($ \underline{\boldsymbol o} = \textbf{1} $ stands for a fully transparent plane; see Section 2), and that appropriate physical constraints can be applied. The normalized data image, denoted $ {\boldsymbol{ \bar d}} $, can be obtained, for example, by dividing $ {\boldsymbol d} $ by its mean or its median (almost valid in the case of a low-density distribution of objects).

Algorithm 1. Fienup’s ER algorithm [4]

View Table | View all tables in this article

The principle is to iteratively alternate the following steps starting from an estimate of the object $ {\underline{\boldsymbol o} ^{(i)}} $ at the sample plane to get a new estimate $ {\underline{\boldsymbol o} ^{(i + 1)}} $:

• The estimate $ {\underline{\boldsymbol o} ^{(i)}} $ is propagated to the sensor plane to get a simulated complex wave $ \underline{\boldsymbol a} _z^{(i + {1}/{2})} $ corresponding to the total diffracted wave in this plane.
• At the sensor plane, a “projection on data” step forces the amplitude of $ \underline{\textbf{a}}_z^{(i + {1}/{2})} $ to match the square root of the normalized measurement vector $ {\boldsymbol{\bar d}} $ (projection on a non-convex set), to get a modified wave $ \underline{\boldsymbol a} _z^{(i + 1)} $.
• The modified wave $ \underline{\boldsymbol a} _z^{(i + 1)} $ is backpropagated to the object plane to get a new estimate $ {\underline{\boldsymbol o} ^{(i + {1}/{2})}} $.
• At the sample plane, $ \mathfrak{Re}({\underline{\boldsymbol o} ^{(i + {1}/{2})}}) $ and $ \mathfrak{Jm}({\underline {\boldsymbol o} ^{(i + {1}/{2})}}) $ are forced to be positive or negative quantities (depending on the hypothesis concerning the object of interest), and/or forced to be zero outside a restricted support. This step, which leads to the new estimate $ {\underline {\boldsymbol o} ^{(i + 1)}} $, can be summed up as a projection step $ {{\cal P}_{\mathbb O}}({\underline {\boldsymbol o} ^{(i + {1}/{2})}}) $ on the feasible domain $ {\mathbb O} \subset {{\mathbb C}^N} $ (usually a convex set).

This approach is based on the approximation $ \underline{\textbf{H}} _z^\dagger {\underline{\textbf{H}}_z} \approx \textbf{I} $ that makes it possible to go back and forth between the sample plane and the sensor plane. Since $ \underline{\textbf{H}} _z^\dagger $ is not the exact inverse of $ {\underline{\textbf{H}} _z} $, there are strong limitations related to sampling and truncation effects (e.g., objects on the border of the field of view are inevitably distorted). If the algorithm is stopped after several iterations and the sensor plane constraint applied, the data are augmented by the phase information (i.e., the lost phase on the sensor plane is retrieved by the algorithm). Due to this restored phase, the twin-image phenomenon typical of in-line holography is strongly reduced.

The great popularity of this class of algorithms is due to the fact that the strategy is very intuitive as well as being easy to implement. Likewise, these algorithms have been proven to converge to a local minimum [40–42]. Such approaches have been exploited mainly in the fields of x-ray coherent diffraction imaging [7,43–46] and digital holographic microscopy [29,47,48].

As a result, research in this field is still active and new approaches are often proposed. In the past 3 decades, it has benefited from many practical improvements [6,10,11,13,49] and theoretical studies [9,41,50]. Recent approaches include new constraint enforcement strategies using sparsity constraints in the object or data spaces [51,52] or in the wavelets domain [16,53].

4. INVERSE PROBLEMS METHODOLOGY FOR THE RECONSTRUCTION OF IN-LINE HOLOGRAMS: A TUTORIAL

A. Linearized Image Formation Model

The first step of the inverse problems methodology is to derive a suitable image formation model, by analyzing the object and data characteristics. Here, we describe a linearized model based on two hypotheses: (i) the sample is a distribution of purely and weakly dephasing objects [$ \rho ({\boldsymbol r}) = 1 $; see Eq. (1)], such that

(6)$$\underline{t} ({\boldsymbol r}) = 1 + \underline{o} ({\boldsymbol r}) = {e^{i\varphi ({\boldsymbol r})}} \approx 1 + i\varphi ({\boldsymbol r}) = 1 + io({\boldsymbol r});$$

thus, we define a new unknown image $ o({\boldsymbol r}) $ that is real and stands for the phase image $ \varphi ({\boldsymbol r}) $. (ii) We neglect the second-order term in Eq. (4), leading to the following approximated real and linear image formation model:

(7)$$\tilde m (o)=1-2 \mathfrak{Jm}(\underline{h}_z)* o.$$

The new discrete image formation model is written as

(8)$${\boldsymbol d} = c {\boldsymbol{ \tilde m}}({\boldsymbol o}) + {\boldsymbol \eta }\quad \text{with}\quad {\boldsymbol{\tilde m}}({\boldsymbol o}) = \left( {\textbf{1} + {\textbf{G}_z}{\boldsymbol o}} \right),$$

where $ {\textbf{G}_z}= -2 \,{\mathfrak{Jm}} ({\underline{\textbf{H}}}_z)$ is a linear propagation operator. It is a real-valued convolution operator: $ {\textbf{G}_z} $ is still a $ M \times N $ matrix, such that $ {[{\textbf{G}_z} {\boldsymbol o}]_q} = {[{a_z}]_q} = {[ - 2\, {\mathfrak{Jm}} ({\underline{h} _z}) * o]_q} $ for all pixels $ q $. We note the convolution kernel $ g_z = -2\, \mathfrak{Jm} (\underline{h}_z)$. As it is real and even, the operator $ {\textbf{G}_z} $ is symmetric. Therefore, $ {\textbf{G}_z}^{T} = {\textbf{G}_z} $. The noise $ {\boldsymbol \eta } $ now also includes modeling errors due to the above-mentioned approximations. Model approximations are very often considered because of their computational interest in leading to tractable mathematical and numerical resolution strategies. The inverse problem is now fully real-valued (model $ {\boldsymbol{ \tilde m}} $ and unknown $ {\boldsymbol o} $) and constitutes a simple case study to pedagogically illustrate the basic ingredients and tools required to build a relevant inverse problems approach and draw a parallel to Fienup’s alternating projections approaches. It is of course also possible to consider nonlinear direct models and both the attenuation and phase shift induced by the sample. An example of such an approach can be found in [17].

B. Building a Regularized Inverse Approach

First, we recall the definition of the $ {l_2} $ and $ {l_1} $ norms applied to a vector $ {\boldsymbol x} $: $ ||{\boldsymbol x}{||_2} = \sqrt {\sum\nolimits_q x_q^2} $ and $ ||{\boldsymbol x}{||_1} = \sum\nolimits_q |{x_q}| $. The notation $ ||{\boldsymbol x}{||_{\textbf{W}}} $, where $ \textbf{W} $ is a positive definite matrix, corresponds to $ ||{\boldsymbol x}||_{\textbf{W}}^2 = {{\boldsymbol x}^{T}}\textbf{W}{\boldsymbol x} $.

1. Data Fidelity

An inverse approach consists of retrieving an optimal solution $ {{\boldsymbol o}^*} $ to Eq. (8), knowing an approximate numerical model $ {\boldsymbol{\tilde m}}({\boldsymbol o}) $ of the data formation process.

To do so, we define a calculable metric $ {{\cal J}_{\textsf{fid}}}({\boldsymbol o},{\boldsymbol d}) $ that evaluates the error between the data $ {\boldsymbol d} $ and the model $ {\boldsymbol{ \tilde m}}({\boldsymbol o}) $ obtained from a guessed $ {\boldsymbol o} $. This error term $ {{\cal J}_{\textsf{fid}}}({\boldsymbol o},{\boldsymbol d}) $, also called the data-fidelity term, acts as a penalization: the larger the error, the farther the guess $ {\boldsymbol o} $ is from the solution and the more it has to be corrected to reach a better estimate.

The most popular data-fidelity term is the weighted least-squares criterion, which derives from the assumption of Gaussian errors. The criterion corresponds to the $ \textbf{W} $-weighted squared $ {l_2} $ norm of the differences between $ {\boldsymbol d} $ and $ {\boldsymbol{ \tilde m}}({\boldsymbol o}) $, which is written as

(9)$$\begin{split}{{\cal J}_{\textsf{fid}}}(c,\boldsymbol{ o},{\boldsymbol d}) &= \left\| {c \boldsymbol{ \tilde m}({\boldsymbol o}) - {\boldsymbol d}} \right\|_{\textbf{W}}^2\\ &= {\left( {c \boldsymbol{ \tilde m}({\boldsymbol o}) - {\boldsymbol d}} \right)^{T}} \textbf{W} \left( {c \boldsymbol{ \tilde m}({\boldsymbol o}) - {\boldsymbol d}} \right)\\ &= \sum\limits_q {w_q}{\left( {{{\tilde m}_q}({\boldsymbol o}) - {d_q}} \right)^2} \quad (\text{if errors are uncorrelated,i.e.},\textbf{W}\,\text{is diagonal})\\ &= \sum\limits_q {w_q}{( {1 + {{[{g_z} * o]}_q} - {d_q}} )^2},\end{split}$$

where $ {\tilde m_q}({\boldsymbol o}) $ is the $ q $-th pixel of the model $ {\boldsymbol{ \tilde m}}({\boldsymbol o}) $, and $ {d_q} $ is the $ q $-th pixel of the hologram. $ {w_q} $ ($ q $-th element of the diagonal of matrix $ \textbf{W} $) is a weight associated with pixel $ q $ on the sensor. In imaging problems, this allows to avoid unmeasured pixel data $ {y_q} $ in the field of view, by setting to zero the corresponding weight $ {w_q} $. As discussed later (see Section 6), these unmeasured pixels could be those outside the sensor field of view, when out-of-field objects have to be taken into account because of their contribution to the recorded hologram. {$ \textbf{W} $ can be viewed from a statistical point of view: if the noise statistics is known a priori, $ \textbf{W} $ can be set to the inverse of the noise covariance matrix $ {\textbf{C}_{\boldsymbol \eta }} = {\mathbb E} [ {{\boldsymbol \eta } {{\boldsymbol \eta }^{T}}} ] $ (where $ {\mathbb E} [ \cdot ] $ stands for the expectation) [54]. In this case, the weighted least-squares criterion corresponds to the co-log-likelihood, where the noise follows Gaussian statistics. Under the additional assumption of an uncorrelated noise, $ \textbf{W} $ is a diagonal matrix, and the diagonal elements $ {w_q} $ correspond to the inverse of the variance at the pixel $ {y_q} $ [the larger the variance of the noise in a given pixel $ {y_q} $, the smaller the weight assigned to this pixel in the criterion Eq. (9)]. The maximum likelihood estimation theory suggests minimizing the weighted least squares under a Gaussian assumption [55].}

To solve the inverse problem, the objective is to find the solution $ {{\boldsymbol o}^*} $ that minimizes the criterion Eq. (9), thus the error between the data $ {\boldsymbol d} $ and the model $ {\boldsymbol{\tilde m}}({{\boldsymbol o}^*}) $. Such a minimization problem is written as

(10)$$\{ {{\boldsymbol o}^*},{c^*}\} = \mathop { \text{argmin}}\limits_{{\boldsymbol o},c} \quad {{\cal J}_{\textsf{fid}}}(c,{\boldsymbol o},{\boldsymbol d})$$

and is the standard formulation of an inverse problem. In this formulation of our particular problem, the optimal solution also depends on the scalar scaling factor $ c $. The optimal value for $ c $ can be obtained in closed form:

(11)$${c^*}({\boldsymbol o}) = \frac{{{\boldsymbol{ \tilde m}}{{({\boldsymbol o})}^{T}}\textbf{W}{\boldsymbol d}}}{{{\boldsymbol{ \tilde m}}{{({\boldsymbol o})}^{T}}\textbf{W}{\boldsymbol{ \tilde m}}({\boldsymbol o})}} .$$

If matrix $ \textbf{W} $ is diagonal, the expression of $ {c^*} $ takes a simpler form:

(12)$${c^*}({\boldsymbol o}) = \frac{{\sum\nolimits_q {{w_q}{{\tilde m}_q}({\boldsymbol o}){d_q}} }}{{\sum\nolimits_q {{w_q}{{\tilde m}_q}{{({\boldsymbol o})}^2}} }} .$$

We can then solve the following problem with regard to $ {\boldsymbol o} $:

(13)$${{\boldsymbol o}^*} = \mathop {\text{arg min}}\limits_{\boldsymbol o} \quad {{\cal J}_{\textsf{fid}}}({c^*}({\boldsymbol o}),{\boldsymbol o},{\boldsymbol d}).$$

We discuss how this minimization problem can be addressed in Section 4.C.2.

2. Constraints and Regularizations

The minimization problem Eq. (10) is not sufficient to obtain a satisfactory solution because measurements are noisy, and there are too many unknowns for a reliable estimation based only on data fitting. As a result, unsatisfactory reconstructions can be found that perfectly match the data, i.e., solutions of the problem where the noise is also fitted by the model.

These issues are the characteristics of what is called an ill-posed problem. To overcome these limitations, the resolution of the problem cannot be achieved by considering only the data-fidelity term Eq. (10). One has to find a solution $ {{\boldsymbol o}^*} $ that best fits a particular prerequisite on the targeted information: one has to enforce some prior knowledge about the unknown $ {\boldsymbol o} $.

Such prior knowledge can be injected in the minimization problem Eq. (10), taking the form of hard and/or soft constraints. The new minimization problem is written as

(14)$${{\boldsymbol o}^*} = \mathop {\text{arg min}}\limits_{{\boldsymbol o} \in {\mathbb O}} \quad {{\cal J}_{\textsf{fid}}}({c^*}({\boldsymbol o}),{\boldsymbol o},{\boldsymbol d}) + {{\cal J}_{\textsf{reg}}}({\boldsymbol o},{\boldsymbol \theta }),$$

where $ {\boldsymbol o} \in {\mathbb O} $ and $ {{\cal J}_{\textsf{reg}}}({\boldsymbol o},{\boldsymbol \theta }) $ correspond to the enforced prior knowledge, which take, respectively, the form of hard and soft constraints; see, e.g., [55,56].

$ {\boldsymbol o} \in {\mathbb O} $ imposes the feasible domain of values $ {\mathbb O} $ for the estimate $ {\boldsymbol o} $. In many inverse problems involving real quantities, a relevant constraint is to enforce positivity. This is, for example, the case in positron emission tomography (PET), where the quantity of interest is a photon count [57].

$ {{\cal J}_{\textsf{reg}}}({\boldsymbol o},{\boldsymbol \theta }) $ are soft constraints imposed as a combination of regularization terms, parameterized by the set of hyperparameters $ {\boldsymbol \theta } $ that balance each term with respect to the data-fidelity term. In general, standard regularizations aim to smooth the estimate $ {\boldsymbol o} $ to filter high frequencies due to noise. In the following, we present three popular regularizations in image processing. Figure 3 illustrates their behavior on reconstructions, using a regularized inverse problems approach, of the simulated hologram that will be used in Section 5 for further demonstrations. More details on the method, the simulation, and reconstruction parameters are also provided in Section 5.

Fig. 3. Illustration of the behavior of several regularization terms on reconstructions (in a region of interest) of a simulated hologram (a) using a regularized inverse problems approach. Details on the method, the simulation, and reconstruction parameters are given in Section 5 and Tables 1 and 2. (a) Data (in-line hologram). The yellow frame indicates the region of interest that is extracted from the field of view for visualization. (b) Ground truth phase. (c)–(f) Reconstructed images using the following regularizations: (c) squared $ {l_2} $ norm of the gradient $ {{\cal J}_{{{l}_{2}}\nabla }} $ [Eq. (15)], (d) edge-preserving smoothness $ {{\cal J}_{{{\text{TV}}_ \epsilon }}} $ [Eq. (16)], (e) $ {l_1} $ sparsity constraint $ {{\cal J}_{{l_1}}} $ [Eq. (17)], and (f) composition of the $ {l_1} $ sparsity constraint and edge-preserving smoothness $ {{\cal J}_{\textsf{reg}}} = {{\cal J}_{{l_1}}} + {{\cal J}_{{{\text{TV}}_ \epsilon }}} $ [Eq. (18)]. Red curves show a line profile passing through two particles of the image.

Download Full Size | PDF

A possible strategy is to minimize the squared $ {l_2} $ norm of the gradient image $ \nabla {\boldsymbol o} $:

(15)$${{\cal J}_{{l_2}\nabla }}({\boldsymbol o},\mu ) = \mu \mathop {\sum }\limits_q ||{\nabla _q}{\boldsymbol o}||_2^2,$$

where the gradient operator $ {\nabla _q} $ corresponds to the finite difference operator at pixel $ q $ (a 2D vector with the differences in $ x $ and $ y $ directions). However, this solution tends to oversmooth the sharp features of the estimate $ {\boldsymbol o} $ [cf. Fig. 3(c)] and is consequently not suitable for images with almost sharp edges. To overcome this limitation, more sophisticated regularizations can enforce piecewise smoothness, i.e., edge preservation. This is the targeted behavior of the very popular total variation (TV) [58], which enforces sparsity of the gradient image by minimizing the sum of the $ {l_2} $ norm of the gradient vector at each pixel. This regularization favors piecewise continuous images, i.e., sharp edges. Because it promotes the appearance of flat areas, which is not desired in cases where smooth variations are expected, it is often replaced by a generalized version, also known as “edge-preserving smoothness” [59]:

(16)$${{\cal J}_{{{\text{TV}}_ \epsilon }}}({\boldsymbol o},\mu , \epsilon ) = \mu \mathop {\sum }\limits_q \sqrt {||{\nabla _q}{\boldsymbol o}||_2^2 + { \epsilon ^2}} ,$$

where $ \epsilon $, which must be different from zero to ensure the differentiability of the criterion, tunes the regularization behavior. For values of the gradient norm much higher than $ \epsilon $, the regularization acts as TV (preservation of sharp edges), while for much lower values, the regularization is almost quadratic [it almost acts as the regularizer of Eq. (15)], and the estimate $ {\boldsymbol o} $ will be smoothed [cf. Fig. 3(d)]. Thus, this criterion is more flexible and can impose more natural constraints than TV.

Direct sparsity of the image can also be enforced. This strategy is very often used to favor a low-density distribution of objects in the image. This is implemented by minimizing the separable $ {l_1} $ norm of the image:

(17)$${{\cal J}_{{l_1}}}({\boldsymbol o},\mu ) = \mu {\left\| {\boldsymbol o} \right\|_1} = \mu \mathop {\sum }\limits_q |{o_q}|.$$

This type of regularization limits the size of the objects’ support [29] by favoring reconstructions with pixels at zero. Indeed, looking at Fig. 3(e), we clearly see that the objects of interest constitute the only signal reconstructed, while the remaining background is set to zero. Note that when $ {\boldsymbol o} $ is constrained to be nonnegative, the above equation is just $ \mu $ times the sum of the values in $ {\boldsymbol o} $, hence a linear term in $ {\boldsymbol o} $ (see Fig. 5).

These regularizations can be mixed to benefit from each of their advantages [cf. Fig. 3(f)], leading $ {{\cal J}_{\textsf{reg}}}({\boldsymbol o},{\boldsymbol \theta }) $ in Eq. (14) to be a combination of some of these terms. This increased flexibility comes at the cost of a more difficult tuning of the reconstruction algorithms because of the increase in the number of hyperparameters. For example, in Fig. 3(f) and reconstruction experiments in Section 5, we use a combination of a separable sparsity prior $ {{\cal J}_{{l_1}}} $ [Eq. (17)] and an edge-preserving regularization $ {{\cal J}_{{{\text{TV}}_ \epsilon }}} $ [Eq. (16)], leading to the following expression for the global regularization term $ {{\cal J}_{\textsf{reg}}}({\boldsymbol o},{\boldsymbol \theta }) $:

(18)$${{\cal J}_{\textsf{reg}}}({\boldsymbol o},{\boldsymbol \theta }) = {{\cal J}_{{l_1}}}({\boldsymbol o},{\mu _{{l_1}}}) + {{\cal J}_{{{\text{TV}}_ \epsilon }}}({\boldsymbol o},{\mu _{\text{TV}}},{ \epsilon _{\text{TV}}}),$$

with $ {\boldsymbol \theta } = \{ {\mu _{{l_1}}},{\mu _{\text{TV}}},{ \epsilon _{\text{TV}}}\} $, the set of all tuning hyperparameters.

C. Case Study: Solving a Standard Sparsity-Based Problem

1. Problem

Following the methodology presented in the previous section, we now focus on the resolution of the following criterion that corresponds to a popular sparsity-based approach:

(19)$$\begin{split}{{\boldsymbol o}^*} &= \mathop {\text{arg min}}\limits_{{\boldsymbol o} \ge \textbf{0}} \quad \underbrace {{{\cal J}_{\textsf{fid}}}({c^*}({\boldsymbol o}),{\boldsymbol o},{\boldsymbol d})}_{\text{smooth part}\,{\cal G}} + \underbrace {{{\cal J}_{{l_1}}}({\boldsymbol o},\mu )}_{\text{non} - \text{smooth part}\,{\cal H}}\\[-4pt] &= \mathop {\text{arg min}}\limits_{{\boldsymbol o} \ge \textbf{0}} \left\| {{c^*}({\boldsymbol o}) {\boldsymbol{ \tilde m}}({\boldsymbol o}) - {\boldsymbol d}} \right\|_{\textbf{W}}^2 + \mu {\left\| {\boldsymbol o} \right\|_1}.\end{split}$$

The criterion minimizes the sum of a least-squares data-fidelity term developed in Eq. (9) and an $ {l_1} $ regularization [cf. Eq. (17)], weighted by the hyperparameter $ \mu $, to enforce the sparsity of the solution in the spatial domain. A positivity constraint is imposed on the solution ($ {\boldsymbol o} \ge \textbf{0} $). The problem Eq. (19) constitutes our case study from which we must find suitable minimization strategies.

In the following, we introduce a simple yet efficient method to solve this inverse problem, named Iterative Shrinkage-Thresholding Algorithm (ISTA). This method belongs to the class of proximal gradient methods [60], whose general principle is depicted in Fig. 4. The key idea is to replace the original minimization problem by a sequence of simpler minimization problems involving a surrogate function that majorizes the original function. The cost function is decomposed into the sum of a smooth (i.e., differentiable) and a non-smooth (i.e., non-differentiable) term. The smooth term is replaced by a separable quadratic approximation. Minimizing this approximation corresponds to computing the proximal operator associated with the non-smooth term [60]. In the case of regularization by an $ {l_1} $ norm, under a positivity constraint, we recall in Fig. 5 that the proximal operator corresponds to a soft-thresholding operation with a clipping of negative values to zero. In the following, we also establish that there is a strong connection between Fienup’s alternating projections method and this strategy applied to a particular formulation of the minimization problem.

Fig. 4. Principle that underlies proximal gradient methods is the iterative minimization of a local approximation of the original cost function. If the parameter $ t $ is chosen small enough, the approximation is majorant, and minimizing the approximation improves the current solution until convergence. When the non-smooth component $ {\cal H} $ is separable (as is the case of the $ {\ell _1} $ norm), minimizing the local approximation is easily done [see Fig. (5)].

Download Full Size | PDF

Fig. 5. Computation of the proximal operator: an illustration with the case of the $ {l_1} $ norm under positivity constraints. Values that are smaller than $ \mu t $ are mapped to zero by the proximal operator. Larger values of the regularization weight $ \mu $ therefore lead to more values being set to zero, i.e., a sparser solution.

Download Full Size | PDF

Fig. 6. Phase image reconstructed from simulated data with different methods and regularization terms. (a) Data (in-line hologram). The yellow frames indicate the regions of interest that are extracted from the field of view for visualization. (b) Ground truth phase. (c) Evolution of the normalized cost criterion (minimum and maximum values ranged between zero and one) for each reconstruction. (d)–(i) Reconstructed images using (d) Fienup method, (e) Fienup method + soft-thresholding, (f) Fienup method + soft-thresholding, stopped at 10 iterations, (g) RI method using FISTA with soft-thresholding, (h) RI method using FISTA with soft-thresholding + edge-preserving, (i) RI method using FISTA with soft-thresholding + edge-preserving, stopped at 10 iterations. For each reconstruction, a positivity constraint is imposed on the solution. The values of the hyperparameters are indicated in Table 2. Black dots delimit the contour of the ground truth image. Red curves show line profiles passing through particles of the image.

Download Full Size | PDF

Fig. 7. Phase image reconstructed from experimental data with different methods and regularization terms. (a) Data (in-line hologram). The yellow frame indicates the region of interest that is extracted from the field of view for visualization. (b) Evolution of the normalized cost criterion (minimum and maximum values ranged between zero and one) for each reconstruction. (c)–(h) Reconstructed images using (c) Fienup method, (d) Fienup method + soft-thresholding, (e) Fienup method + soft-thresholding, stopped at 10 iterations, (f) RI method using FISTA with soft-thresholding, (g) RI method using FISTA with soft-thresholding + edge-preserving, (h) RI method using FISTA with soft-thresholding + edge-preserving, stopped at 10 iterations. For each reconstruction, a positivity constraint is imposed on the solution. The values of the hyperparameters are indicated in Table 2. Red curves show a line profile passing through two particles of the image.

Download Full Size | PDF

2. Optimization Strategy

The application of the proximal gradient framework to the minimization of Eq. (19) leads to the following iteration:

(20)$$\begin{split}{{\boldsymbol o}^{(i + 1)}} &= {{\cal T}_{\mu t}}\left( {{{\boldsymbol o}^{(i)}} - t \nabla {\cal G}({{\boldsymbol o}^{(i)}})} \right)\\[-3pt] &= {{\cal T}_{\mu t}}\left( {{{\boldsymbol o}^{(i)}} - t \nabla {{\cal J}_{\textsf{fid}}}({c^*}({{\boldsymbol o}^{(i)}}),{{\boldsymbol o}^{(i)}},{\boldsymbol d})} \right)\\[-3pt]& = {{\cal T}_{\mu t}}\left( {{{\boldsymbol o}^{(i)}} - 2 t {c^*}({{\boldsymbol o}^{(i)}}) {\textbf{G}_z}^{T}\textbf{W}({c^*}({{\boldsymbol o}^{(i)}}) \boldsymbol{ \tilde m}({{\boldsymbol o}^{(i)}}) - {\boldsymbol d})} \right),\end{split}$$

where the soft-thresholding operator $ {{\cal T}_{\mu t}} $ is applied to the current reconstruction, improved by a steepest gradient descent step ($ t $ is the step length). As proved in Fig. 5, the proximal operator for the $ {l_1} $ norm under positivity constraint is defined by

(21)$${{\cal T}_\alpha }{({\boldsymbol o})_q} = \text{max}(0 , {o_q} - \alpha ) .$$

This soft-thresholding sets to zero all values below a threshold $ \alpha $, which effectively denoises the reconstruction and tends to remove small fluctuations in the background. This projection step enforces sparsity directly in the spatial domain. It is also possible to derive proximal operators that enforce sparsity in “transformed” domains, such as the image gradient space (TV regularization; see [61]) or the wavelet domain (see [62]).

To ensure the convergence of the algorithm, the step length $ t $ has to be chosen adequately. For our problem, a suitable value depends on the maximum eigenvalue of $ \textbf{G}_z^{T}{\textbf{G}_z} $ (i.e., the Lipschitz constant of $ {{\cal J}_{\textsf{fid}}} $; see [60,61]). As $ {\textbf{G}_z} $ is a convolution operator, this is equivalent to the maximum squared modulus of the Fourier transform of the kernel $ {g_z} $. It is also possible to use back-tracking methods to reduce the step length $ t $ if it is found that the surrogate function does not majorize the original cost function for the current value of $ t $ [61].

In Ref. [61], the authors propose a strategy to accelerate the convergence rate of ISTA, leading to the popular algorithm named Fast Iterative Shrinkage-Thresholding Algorithm (FISTA). We detail the general steps of FISTA in Algorithm 2. Implementation details are given in Appendix A, Algorithm 4. In Algorithm 2, if the scalar factor $ s $ is kept to the value 1, we fall back to the simpler ISTA algorithm.

Algorithm 2. FISTA algorithm [61] for solving problem Eq. (19)

View Table | View all tables in this article

3. Alternating Projections Point of View

We analyze the alternating projections strategy under the ISTA formulation presented in Fig. 4. From steps 1 to 4 in Algorithm 1, we derive the equivalent iteration step of Fienup’s algorithm as follows:

(22)$${\underline {\boldsymbol o} ^{(i + 1)}} = {{\cal P}_{\mathbb O}}\left( {\underline{\textbf{H}} _z^\dagger \left( {\sqrt {\boldsymbol{ \bar d}} \odot \frac{{\underline {\boldsymbol a} _z^{(i + {1}/{2})}}}{{|\underline {\boldsymbol a} _z^{(i + {1}/{2})}|}} - \textbf{1}} \right)} \right),$$

with $ \underline {\boldsymbol a} _z^{(i + {1}/{2})} = \textbf{1} + {\underline{\textbf{H}} _z}{\underline {\boldsymbol o} ^{(i)}} $, and given that $ \underline{\textbf{H}} _z^\dagger = {\underline{\textbf{H}} _{ - z}} $. $ \odot $ stands for the Hadamard pixelwise product, and the division and modulus operators are also applied pixelwise. Since $ \underline{\textbf{H}} _z^\dagger {\underline{\textbf{H}} _z} \approx \textbf{I} $, we can rewrite Eq. (22) as follows:

(23) $$\begin{split}{\underline {\boldsymbol o} ^{(i + 1)}} \approx {{\cal P}_{\mathbb O}} \left[{{\underline {\boldsymbol o} }^{(i)}} - \underline{\textbf{H}} _z^\dagger \left( \frac{{\underline {\boldsymbol a} _z^{(i + {1}/{2})}}}{{|\underline {\boldsymbol a} _z^{(i + {1}/{2})}|}}\odot \left( {|\underline {\boldsymbol a} _z^{(i + {1}/{2})}| - \sqrt {\boldsymbol{\bar d}} } \right)\right)\right].\end{split}$$

The term in the square brackets on which the projector $ {{\cal P}_{\mathbb O}} $ is applied corresponds to a steepest gradient descent step that decreases the following data-fidelity criterion:

(24)$$\begin{split}{{\cal{J}}_{\textsf{fid}}}({\boldsymbol o},\bar{{\boldsymbol d}})=\left\| |\underline{{{{\boldsymbol a}}_{{\boldsymbol z}}}}(\underline{{\boldsymbol o}})|-\sqrt{{\bar{{\boldsymbol d}}}} \right\|_{2}^{2}=\left\| |\textbf{1}+{{\underline{\textbf{H}}}_{z}}\ \underline{{\boldsymbol o}}|-\sqrt{{\bar{{\boldsymbol d}}}} \right\|_{2}^{2}.\end{split}$$

Contrary to the previous data fidelity (weighted least squares), this term involves the square root of the observations $ {\boldsymbol{ \bar d}} $ and does not correspond to the co-log-likelihood under a Gaussian assumption. Deriving the gradient term $ \nabla {{\cal J}_{\textsf{fid}}} $ must be done carefully because $ {\underline {\boldsymbol o}} $ is complex-valued. Separating the real and imaginary parts, or alternatively using Wirtinger calculus, leads to the gradient expression in Eq. (23); see, e.g., [63].

The projector $ {{\cal P}_{\mathbb O}} $ is the orthogonal projection onto the convex set $ {\mathbb O} $, and corresponds in our case to forcing positive or negative values of the real and/or imaginary parts of $ \underline {\boldsymbol o} $, and/or to forcing to zero all pixels that are outside the support of the object. $ {{\cal P}_{\mathbb O}} $ is in fact the proximal operator associated with the indicator function $ {\iota _{\mathbb O}} $ of the domain $ {\mathbb O} $ [60]:

(25)$${\iota _{\mathbb O}}(\underline {\boldsymbol o} ) = \left\{ {\begin{array}{*{20}{l}}0&{ \text{if}\,\,\underline {\boldsymbol o} \in {\mathbb O}}\\{ + \infty }&{\text{otherwise}}\end{array}} \right..$$

Then, applying Fienup’s alternating projections strategy given in Eq. (23) is analogous to performing ISTA iterations with a fixed step length $ t = \frac{1}{2} $, derived from the following problem:

(26)$${{\underline{{\boldsymbol o}}}^{*}}=\underset{\underline{{\boldsymbol o}}}{\mathop{\rm{arg min}}} \ \underbrace{\left\| |\textbf{1}+{{\underline{\textbf{H}}}_{z}}\ \underline{{\boldsymbol o}}|-\sqrt{{\bar{{\boldsymbol d}}}} \right\|_{2}^{2}}_{\rm{smooth\,\, part}\,\mathcal{G}}+\underbrace{{{\iota }_{\mathbb{O}}}(\underline{{\boldsymbol o}})}_{\rm{non-smooth\,\, part}\,\mathcal{H}},$$

where the smooth part $ {\cal G} $ correponds to the data-fidelity term $ {{\cal J}_{\textsf{fid}}} $ in Eq. (24) and the non-smooth part $ {\cal H} $ is the indicator function $ {\iota _{\mathbb O}} $ in Eq. (25).

The above analysis is important since it demonstrates that the alternating projections method underlies a particular inverse problems approach, and thus falls within this rigorous framework. The same as with the method developed in Section 4.C.2, the accelerated FISTA algorithm can then be applied to this problem, and it is even possible to replace the standard projection $ {{\cal P}_{\mathbb O}} $ by, for example, a soft-thresholding proximal operator $ {{\cal T}_{\mu t}} $ to enforce sparsity of $ \underline {\boldsymbol o} $ (real or/and imaginary part), or other regularization terms (provided that an efficient computation of the associated proximal operator is possible).

This interpretation of the alternating projections strategy in the framework of inverse problems and convex optimization methods has already been noted in previous works. In Ref. [5], Fienup already analyzed his variants of the ER algorithm as particular steepest descent strategies. Levi and Stark [50] have established that this method can be analyzed as a nonconvex instance of the Projection Onto Convex Sets (POCS) algorithm. This was rigorously demonstrated by Bauschke et al. [41], since Fienup’s BIO and HIO are shown to be instances of Dykstra and Douglas–Rachford algorithms. In Ref. [12], alternating projections strategies are reformulated as proximity operators derived from the maximum likelihood point of view.

Conversely, this analysis makes it possible to interpret the ISTA strategy in Eq. (20) from the alternating projections point of view, since we can deduce that this algorithm performs a succession of a propagation step $ {\textbf{G}_z} $ of the image $ {\boldsymbol o} $ to get a model of the data $ {\boldsymbol{ \tilde m}}({\boldsymbol o}) $, followed by a pseudo-backpropagation step $ \textbf{G}_z^{\text{T}} $ of an error term [difference between $ {\boldsymbol{ \tilde m}}({\boldsymbol o}) $ and $ {\boldsymbol d} $] and finally a projection step $ {{\cal T}_{\mu t}} $ on constraints.

Table 1. Experimental and Simulation Parameters of In-Line Holograms Data

View Table | View all tables in this article

Table 2. Reconstruction Parameters for Simulations and Experiments in Figs. 3,6–8

View Table | View all tables in this article

Fig. 8. Reconstruction (detection) of an out-of-field particle using RI method and field-of-view extension. (a) Data in-line hologram highlighting (in yellow) a part of the out-of-field hologram. (b) Ground truth image. (c), (d) Reconstructions with two different sets of regularization weights values [cf. Table 2]. The red frames show the initial data field of view (pixels seen by the detector). The green frames are zooms on the out-of-field particle.

Download Full Size | PDF

With the inverse problems point of view, Fienup’s ER algorithm becomes much more flexible. We can, for example, avoid the data normalization step $ {\boldsymbol{ \bar d}} $ and inject the optimal parameter $ {c^*}({\boldsymbol o}) $. Taking the square root of the data is not necessarily a good solution since it does change the noise statistics. Moreover, the image formation model $ {\boldsymbol{\tilde m}} $ of our approach directly calculates an intensity image $ {\boldsymbol{\tilde m}}({\boldsymbol o}) $ that can match the raw intensity measurements $ {\boldsymbol d} $, with or without the approximations in Section 4.A. Therefore, the approach developed in Section 4.C.1 is to be preferred, as it can be considered as a similar but more sophisticated alternating projections strategy.

With the above interpretation, we want to demonstrate that the inverse problems framework, in some specific but adapted cases, can keep the intuitive nature and ease of implementation property that generally constitutes the argument for preferring the use of Fienup’s method. This is an original contribution of this paper compared with the previous works.

5. RESULTS

In this section, we present the reconstruction results obtained with the proposed inverse problems approach formulated in Eq. (19), using the ISTA iteration given in Eq. (20) and its acceleration with the FISTA algorithm Algorithm 2. We compare this method with Fienup’s alternating projections approach defined by the iteration Eq. (23) using two versions: (i) the standard way with positivity constraint enforcement and (ii) an upgraded way with a soft-thresholding step ($ {l_1} $ sparsity constraint). All the reconstruction strategies also enforce a positivity constraint. In the following interpretations of the results, our regularized inversion approach method is referred to as “the RI method,” while Fienup’s alternating projections approach is referred to as “the Fienup method.”

Fig. 9. Derivation of a reconstruction algorithm from a cost function.

Download Full Size | PDF

A. Data

We reconstruct two data sets: an experimental in-line hologram of polystyrene beads in immersion oil [cf. Fig. 7(a)], and a simulated in-line hologram of beads in similar conditions [cf. Fig. 6(a)]. In this simulation, the transmittance $ {\boldsymbol o} $ is modeled as 2D truncated Gaussian footprints (the Gaussians are truncated so that the footprint has a diameter of $ 6\sigma $, and zero outside.). The peak value of this footprint is set to a refractive index difference $ ({n_0} - {n_{\text{bead}}}) = 0.07 $. The in-line hologram is simulated under Fresnel approximation. Table 1 summarizes the experiment and simulation parameters.

Looking at the data in Fig. 6(a), we can see that a particle outside the field of view is also present. Its diffraction fringes are truncated at the top of the hologram. We intentionally included this out-of-field object to illustrate that the inverse approach framework allows the reconstruction of holograms in a wider field of view than that seen by the sensor. We discuss this particular point in Section 6.

In these conditions, the maximum phase shift induced by the particles is equal to $ \Delta \varphi = 2\pi {d_{\text{bead}}}({n_0} - {n_{\text{bead}}})/\lambda \approx 0.83\,\,\text{rad} $. Then, the hypotheses (weakly dephasing objects) made to derive the model proposed in Section 4.A are not valid. Still, in addition to the fact that the problem is simplified (only a real image has to be estimated), we show that this approximate model leads to good reconstructions.

B. Reconstructions

For each experiment, all the reconstruction parameters in Figs. 6 and 7 are summarized in Table 2.

Figure 6 shows the results of the reconstruction of the simulated hologram data, using the two methods presented above, with different parameters. For each reconstructed image in Figs. 6(d)–6(i), the values of hyperparameters are given in Table 2. First, we can observe that both reconstruction methods using a soft-thresholding step give a satisfying estimation of the objects’ support. We see that the Fienup method enforcing this sparsity constraint is clearly better than the standard approach with a positivity constraint alone. However, the simulated objects have a spatially extended support, i.e., they cannot be considered as point objects. Thus, the sparsity constraint tends to create tiny holes within the objects. The sparsity constraint alone is not well adapted to this particular hologram reconstruction problem. The RI method can be improved by adding an edge-preserving regularization in the form of the edge-preserving term presented in Eq. (16). The new minimization problem is written as

(27)$$\begin{split} {{{\boldsymbol o}}^{*}} &= \underset{{\boldsymbol o}\ge \textbf{0}}{\mathop{\rm{argmin}}}\,\underbrace{{{\mathcal{J}}_{\textsf{fid}}}({{c}^{*}}({\boldsymbol o}),{\boldsymbol o},{\boldsymbol d})+{{\mathcal{J}}_{{\rm{TV}}_{{\epsilon }}}}({\boldsymbol o},{{\mu }_{\rm{TV}}})}_{\rm{smooth\,part}\,\mathcal{G}}\\[-6pt]&\quad+\underbrace{{{\mathcal{J}}_{{{l}_{1}}}}({\boldsymbol o},{{\mu }_{{{l}_{1}}}})}_{\rm{non-smooth\,part}\,\mathcal{H}} \\[-6pt] &= \underset{{\boldsymbol o}\ge \textbf{0}}{\mathop{\rm{argmin}}}\,\left\| {{c}^{*}}({\boldsymbol o})\tilde{{\boldsymbol m}}({\boldsymbol o})-{\boldsymbol d} \right\|_{\textbf{W}}^{2}\\[-6pt]&\quad+{{\mu }_{\rm{TV}}} {\sum_{q}\,\sqrt{||{{\nabla }_{q}}{\boldsymbol o}||_{2}^{2}}+{{\epsilon }^{2}}}+{{\mu }_{{{l}_{1}}}}{{\left\| {\boldsymbol o} \right\|}_{1}}.\end{split} $$

As this additional regularization term is differentiable, it is included in the smooth part of the proximal gradient algorithm [see Fig. 4]. Thus, the development of the ISTA iteration yields (see Fig. 9)

(28)$$\begin{split}{{{\boldsymbol o}}^{(i+1)}}&={{\mathcal{T}}_{{{\mu }_{{{l}_{1}}}}t}}\left( {{{\boldsymbol o}}^{(i)}}-t\left( \nabla {{\mathcal{J}}_{\textsf{fid}}}({{c}^{*}}({{{\boldsymbol o}}^{(i)}}),{{{\boldsymbol o}}^{(i)}},{\boldsymbol d})+\nabla {{\mathcal{J}}_{\rm{TV}_{{\epsilon }}}}({{{\boldsymbol o}}^{(i)}},{{\mu }_{\text{TV}}}) \right) \right) \\[-4pt]& ={{\mathcal{T}}_{{{\mu }_{{{l}_{1}}}}t}}\left(\vphantom{\left( \frac{{{\nabla }_{q}}{{{\boldsymbol o}}^{(i)}}}{\sqrt{||{{\nabla }_{q}}{{{\boldsymbol o}}^{(i)}}||_{2}^{2}+{{\epsilon }^{2}}}} \right)} {{{\boldsymbol o}}^{(i)}}-2t{{c}^{*}}({{{\boldsymbol o}}^{(i)}})\textbf{G}_{z}^{\text{T}}\textbf{W}({{c}^{*}}\tilde{m}({{{\boldsymbol o}}^{(i)}})-{\boldsymbol d})\right.\\[-4pt]&\qquad-\left.t{{\mu }_{\text{TV}}} {\sum_{q}}\,\nabla _{q}^{{T}}\left( \frac{{{\nabla }_{q}}{{{\boldsymbol o}}^{(i)}}}{\sqrt{||{{\nabla }_{q}}{{{\boldsymbol o}}^{(i)}}||_{2}^{2}+{{\epsilon }^{2}}}} \right) \right).\end{split} $$

The reconstruction via this combination of regularizations is shown in Fig. 6(h), where a clear benefit of considering both a sparsity constraint and an edge-preserving smoothing constraint is observed, as already shown in Fig. 3(f). Using the inverse problems interpretation of the Fienup method, we could also adapt Fienup’s algorithm to include both regularizations.

In Fig. 6(c), we show the evolution over the iterations of the global (normalized) criterion of the RI and Fienup methods, respectively, the criteria Eqs. (27) and (26). We illustrate the convergence of the RI method with several optimization strategies: FISTA, ISTA, and VMLMB [64,65], a quasi-Newton gradient variable metric method with limited memory requirements and possibly bound constraints enforcement on the unknowns. Whatever the method, an empirical convergence of the criteria values is reached in less than 10 iterations. This is confirmed in the reconstructions Figs. 6(f) and 6(i), which show the same reconstructions as Figs. 6(e) and 6(h) after only 10 iterations. We see that both reconstructions (RI and Fienup) are almost the same as the solution obtained after 1000 iterations.

Figure 7 shows reconstructions of the experimental hologram data, using the two methods presented above with different parameters. The field of view has been cropped to $ 1080 \times 1080 $ images. For each reconstructed image in Figs. 7(c)–7(h), the values of the hyperparameters are given in Table 2. In Fig. 7(b), the evolution over the iterations of the global normalized criterion of the methods is shown.

The observations and analysis that can be performed on these reconstructions match those made on the simulated data: the objects of interest are clearly detected. Moreover, the estimations of their shape, diameter [see Fig. 7(h)], and phase shift are coherent with the expected values. Indeed, from the experimental parameters in Table 1, the maximum phase shift should be around 0.83 rad. Again we notice that the convergence of the criteria is almost reached in a few tens of iterations.

6. DISCUSSION

As pointed out in Section 5.A, the linear model derived in this paper is only an approximation. Nevertheless, we observe in both the simulated and experimental results that the shape of the beads is correctly retrieved and that the phase-shift values estimated remain close to the expected values (the difference is less than one order of magnitude). The values obtained on the experimental holograms even match the expected phase shift at the center of the beads ($ \Delta \varphi \approx 0.83\,\,\text{rad} $).

This illustrates that the regularizations and constraints introduced to solve the inverse problem balance the modeling errors and lead to a satisfying solution: the reconstruction is robust to measurement and modeling errors.

In Section 5.A, we mentioned that an out-of-field particle was considered in the simulation, leading to cropped diffraction fringes in the upper part of the data [cf. Fig. 8(a)]. The inverse problems framework makes it possible to account for this data truncation (a problem also frequently faced in x-ray computerized tomography [66]) and reconstruct a wider field of view. This can be done by two equivalent ways: (i) an extension of the size of both the reconstructed object plane and the measured hologram; (ii) a field extension of only the reconstructed object plane. In the first case, unmeasured pixels (i.e., pixels outside the field of view of the sensor) are given a weight $ {w_q} = 0 $ in Eq. (9). In the second case, the model is rewritten to include a truncation operator (i.e., a rectangular matrix obtained by chopping off rows corresponding to unmeasured pixels):

(29)$$\boldsymbol{\tilde{m}(o)}=\boldsymbol{T}\ \left( \textbf{1}+{{\boldsymbol{G}}_{z}}{\boldsymbol o} \right).$$

Algorithm 3. Detailed implementation of Algorithm 1

View Table | View all tables in this article

Algorithm 4. Detailed implementation of Algorithm 2

View Table | View all tables in this article

Figure 8 shows reconstructions of the simulated hologram with the RI method, where the object field of view is doubled compared to the data field of view. These reconstructions illustrate that the out-of-field particle can be detected. Since a major part of the diffraction pattern of this bead is missing, the reconstruction is not as good as for other beads. If a proper regularization is used, the bead can be detected (a “too large” sparsity regularization would tend to suppress it). Here, the signal-to-noise ratio and the approximate direct model lead to an imperfect reconstruction of the morphology of the bead (asymmetrical shape). Other “reconstruction” approaches based on a parametric model of the spherical beads coupled with an adequate inversion algorithm (e.g., continuous matching pursuit algorithm) lead to accurate estimations even out of the field of view [19].

These discussions give an overview of the refinements made possible by a properly built inverse approach to enhance the quality of reconstructions and extract the most relevant information from the data.

7. CONCLUSION

In this work, we have presented the inverse problems methodology applied to the reconstruction of the phase information from in-line intensity holograms. The main goal was to provide a tutorial for this reconstruction strategy, in comparison with the standard alternating projections method proposed by Fienup. To this end, we have presented the overall methodology for building an inverse approach dedicated to the targeted application, from the modeling of data formation to the choice of suitable constraints and regularizations. Deriving a reconstruction algorithm from a cost function is straightforward, for classical regularization terms, by following a decomposition into smooth and non-smooth components, as summarized in Fig. 9. We have shown that Fienup’s method is analogous to a particular formulation of this phase retrieval problem when solved by proximal gradient descent iterations, i.e., the ISTA algorithm. We have shown reconstructions from our proposed RI method compared with standard and upgraded versions of Fienup’s method, and for which we have provided algorithmic details of implementation. The analysis of the results has shown that the reconstruction quality can be quite similar when the Fienup method is refined using the inverse problems framework. We hope that we have convinced the reader that developing a reconstruction strategy following the inverse problems methodology can bridge a gap in reachable reconstruction quality, while allowing much more flexibility to extract relevant information from the data, at no cost in terms of implementation efforts and computational burden. We think that this unifying point of view on various approaches to hologram reconstruction will help the cross-fertilization of algorithmic ideas.

APPENDIX A: DETAILED IMPLEMENTATIONS OF ALGORITHM 1 AND ALGORITHM 2

In order to facilitate the derivation of the equations and algorithms, we used the formalism of linear algebra in the main body of the paper. While it simplifies the expressions, it also makes the implementation of the algorithms less straightforward. In this appendix, we rewrite the algorithms to explicitly define which operations are involved (pixel-by-pixel operation, 2D discrete convolution).

The image of the object plane $ o $ (or $\underline{o} $) that has to be reconstructed, as well as intermediate images such as the propagated wave $ {\underline{a} _z} $, the convolution kernels $ {g_z} $ (or $ {\underline{h} _z} $ and $ {\underline{h} _{ - z}} $), and others, are implemented as 2D arrays of size $ {N_x} \times {N_y} $. In the absence of field extension or pixel super-resolution, the number of pixels $ {N_x} \times {N_y} $ in the object plane is equal to the number of pixels in the measured data. We define each 2D array $ x $ as follows: $ x \in {{\mathbb T}^{{N_x} \times {N_y}}} $, $ {\mathbb T} $ being the real domain $ {\mathbb R} $ or the complex domain $ {\mathbb C} $. We keep the notation $ {x_q} $ to represent the $ q $-th pixel of image $ x $. A 2D discrete convolution $ h * x $ with a kernel $ h $ is typically computed in the Fourier space using fast Fourier transforms (FFTs) and adequate zero padding to prevent periodization artifacts.

In Algorithm 4, if the scalar factor $ s $ is kept to the value 1, we fall back to the simpler ISTA algorithm.

Funding

Agence Nationale de la Recherche (ANR-11-LABX-0063, ANR-11-IDEX-0007); Région Auvergne-Rhône-Alpes.

Acknowledgment

The authors would like to warmly thank the anonymous reviewers for their careful reading and their numerous comments and suggestions that helped to significantly improve the paper. The algorithmic tools (optimization strategies, models, regularizations) presented in this work have been implemented within the framework of the MATLAB library GlobalBioIm [67,68] (https://biomedical-imaging-group.github.io/GlobalBioIm/index.html).

Disclosures

The authors declare no conflicts of interest.

REFERENCES

1. D. Gabor, “A new microscopic principle,” Nature 161, 777–778 (1948). [CrossRef]

2. V. Micó, J. Zheng, J. Garcia, Z. Zalevsky, and P. Gao, “Resolution enhancement in quantitative phase microscopy,” Adv. Opt. Photon. 11, 135–214 (2019). [CrossRef]

3. R. Gerchberg and W. Saxton, “A practical algorithm for the determination of phase from image and diffraction plane pictures,” Optik 35, 237–246 (1972).

4. J. R. Fienup, “Reconstruction of an object from the modulus of its Fourier transform,” Opt. Lett. 3, 27–29 (1978). [CrossRef]

5. J. R. Fienup, “Phase retrieval algorithms: a comparison,” Appl. Opt. 21, 2758–2769 (1982). [CrossRef]

6. H. H. Bauschke, P. L. Combettes, and D. R. Luke, “Hybrid projection-reflection method for phase retrieval,” J. Opt. Soc. Am. A 20, 1025–1034 (2003). [CrossRef]

7. V. Elser, “Solution of the crystallographic phase problem by iterated projections,” Acta Crystallogr. Sec. A 59, 201–209 (2003). [CrossRef]

8. D. R. Luke, “Relaxed averaged alternating reflections for diffraction imaging,” Inverse Probl. 21, 37–50 (2005). [CrossRef]

9. S. Marchesini, “Invited article: a unified evaluation of iterative projection algorithms for phase retrieval,” Rev. Sci. Instrum. 78, 011301 (2007). [CrossRef]

10. R. A. Dilanian, G. J. Williams, L. W. Whitehead, D. J. Vine, A. G. Peele, E. Balaur, I. McNulty, H. M. Quiney, and K. A. Nugent, “Coherent diffractive imaging: a new statistically regularized amplitude constraint,” New J. Phys. 12, 093042 (2010). [CrossRef]

11. J. A. Rodriguez, R. Xu, C.-C. Chen, Y. Zou, and J. Miao, “Oversampling smoothness: an effective algorithm for phase retrieval of noisy diffraction intensities,” J. Appl. Crystallogr. 46, 312–318 (2013). [CrossRef]

12. F. Soulez, E. Thiébaut, A. Schutz, A. Ferrari, F. Courbin, and M. Unser, “Proximity operators for phase retrieval,” Appl. Opt. 55, 7412–7421 (2016). [CrossRef]

13. T. Latychevskaia and H.-W. Fink, “Solution to the twin image problem in holography,” Phys. Rev. Lett. 98, 233901 (2007). [CrossRef]

14. M. L. Moravec, J. K. Romberg, and R. G. Baraniuk, “Compressive phase retrieval,” Proc. SPIE 6701, 670120 (2007). [CrossRef]

15. S. Mukherjee and C. S. Seelamantula, “An iterative algorithm for phase retrieval with sparsity constraints: application to frequency domain optical coherence tomography,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2012), pp. 553–556.

16. Y. Rivenson, Y. Wu, H. Wang, Y. Zhang, A. Feizi, and A. Ozcan, “Sparsity-based multi-height phase recovery in holographic microscopy,” Sci. Rep. 6, 37862 (2016). [CrossRef]

17. F. Jolivet, F. Momey, L. Denis, L. Méès, N. Faure, N. Grosjean, F. Pinston, J.-L. Marié, and C. Fournier, “Regularized reconstruction of absorbing and phase objects from a single in-line hologram, application to fluid mechanics and micro-biology,” Opt. Express 26, 8923–8940 (2018). [CrossRef]

18. A. Berdeu, O. Flasseur, L. Méès, L. Denis, F. Momey, T. Olivier, N. Grosjean, and C. Fournier, “Reconstruction of in-line holograms: combining model-based and regularized inversion,” Opt. Express 27, 14951–14968 (2019). [CrossRef]

19. F. Soulez, L. Denis, E. Thiébaut, C. Fournier, and C. Goepfert, “Inverse problem approach in particle digital holography: out-of-field particle detection made possible,” J. Opt. Soc. Am. A 24, 3708–3716 (2007). [CrossRef]

20. J. W. Goodman, Introduction to Fourier Optics (Roberts and Company, 2005).

21. E. Wolf, “Three-dimensional structure determination of semi-transparent objects from holographic data,” Opt. Commun. 1, 153–156 (1969). [CrossRef]

22. G. Mie, “Beiträge zur Optik trüber Medien, speziell kolloidaler Metallösungen,” Ann. Phys. 330, 377–445 (1908). [CrossRef]

23. C. Fournier, F. Jolivet, L. Denis, N. Verrier, E. Thiebaut, C. Allier, and T. Fournel, “Pixel super-resolution in digital holography by regularized reconstruction,” Appl. Opt. 56, 69–77 (2017). [CrossRef]

24. O. Flasseur, F. Jolivet, F. Momey, L. Denis, and C. Fournier, “Improving color lensless microscopy reconstructions by self-calibration,” Proc. SPIE 10677, 106771A (2018). [CrossRef]

25. Y. Cotte, F. Toy, P. Jourdain, N. Pavillon, D. Boss, P. Magistretti, P. Marquet, and C. Depeursinge, “Marker-free phase nanoscopy,” Nat. Photonics 7, 113–117 (2013). [CrossRef]

26. J. Bailleul, B. Simon, M. Debailleul, L. Foucault, N. Verrier, and O. Haeberlé, “Tomographic diffractive microscopy: towards high-resolution 3-D real-time data acquisition, image reconstruction and display of unlabeled samples,” Opt. Commun. 422, 28–37 (2018). [CrossRef]

27. S. P. Hau-Riege, H. Szoke, H. N. Chapman, A. Szoke, S. Marchesini, A. Noy, H. He, M. Howells, U. Weierstall, and J. C. H. Spence, “SPEDEN: reconstructing single particles from their diffraction patterns,” Acta Crystallogr. Sec. A 60, 294–305 (2004). [CrossRef]

28. S. Sotthivirat and J. A. Fessler, “Penalized-likelihood image reconstruction for digital holography,” J. Opt. Soc. Am. A 21, 737–750 (2004). [CrossRef]

29. L. Denis, D. Lorenz, E. Thiébaut, C. Fournier, and D. Trede, “Inline hologram reconstruction with sparsity constraints,” Opt. Lett. 34, 3475–3477 (2009). [CrossRef]

30. D. J. Brady, K. Choi, D. L. Marks, R. Horisaki, and S. Lim, “Compressive holography,” Opt. Express 17, 13040–13049 (2009). [CrossRef]

31. Y. Rivenson, A. Stern, and B. Javidi, “Compressive Fresnel holography,” J. Display Technol. 6, 506–509 (2010). [CrossRef]

32. Y. Shechtman, A. Beck, and Y. C. Eldar, “GESPAR: efficient phase retrieval of sparse signals,” IEEE Trans. Signal Process. 62, 928–938 (2014). [CrossRef]

33. A. Repetti, E. Chouzenoux, and J. Pesquet, “A nonconvex regularized approach for phase retrieval,” in IEEE International Conference on Image Processing (ICIP) (2014), pp. 1753–1757.

34. A. Drémeau and F. Krzakala, “Phase recovery from a Bayesian point of view: the variational approach,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2015), pp. 3661–3665.

35. A. M. Tillmann, Y. C. Eldar, and J. Mairal, “Dictionary learning from phaseless measurements,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2016), pp. 4702–4706.

36. J. Song, C. L. Swisher, H. Im, S. Jeong, D. Pathania, Y. Iwamoto, M. Pivovarov, R. Weissleder, and H. Lee, “Sparsity-based pixel super resolution for lens-free digital in-line holography,” Sci. Rep. 6, 24681 (2016). [CrossRef]

37. A. Berdeu, F. Momey, B. Laperrousaz, T. Bordy, X. Gidrol, J.-M. Dinten, N. Picollet-D’hahan, and C. Allier, “Comparative study of fully three-dimensional reconstruction algorithms for lens-free microscopy,” Appl. Opt. 56, 3939–3951 (2017). [CrossRef]

38. F. Soulez, L. Denis, E. Thiébaut, C. Fournier, and C. Goepfert, “Inverse-problem approach for particle digital holography: accurate location based on local optimization,” J. Opt. Soc. Am. A 24, 1164–1171 (2007). [CrossRef]

39. O. Flasseur, L. Denis, C. Fournier, and E. Thiébaut, “Robust object characterization from lensless microscopy videos,” in 25th European Signal Processing Conference (EUSIPCO) (IEEE, 2017), pp. 1445–1449.

40. B. Liu and N. C. Gallagher, “Convergence of a spectrum shaping algorithm,” Appl. Opt. 13, 2470–2471 (1974). [CrossRef]

41. H. H. Bauschke, P. L. Combettes, and D. R. Luke, “Phase retrieval, error reduction algorithm, and Fienup variants: a view from convex optimization,” J. Opt. Soc. Am. A 19, 1334–1345 (2002). [CrossRef]

42. D. Noll and A. Rondepierre, “On local convergence of the method of alternating projections,” Found. Comput. Math. 16, 425–455 (2016). [CrossRef]

43. J. Miao, D. Sayre, and H. N. Chapman, “Phase retrieval from the magnitude of the Fourier transforms of nonperiodic objects,” J. Opt. Soc. Am. A 15, 1662–1669 (1998). [CrossRef]

44. J. Miao, P. Charalambous, J. Kirz, and D. Sayre, “Extending the methodology of x-ray crystallography to allow imaging of micrometre-sized non-crystalline specimens,” Nature 400, 342–344 (1999). [CrossRef]

45. Y. Shechtman, Y. C. Eldar, O. Cohen, H. N. Chapman, J. Miao, and M. Segev, “Phase retrieval with application to optical imaging: a contemporary overview,” IEEE Signal Process. Mag. 32(3), 87–109 (2015). [CrossRef]

46. H. N. Chapman and K. A. Nugent, “Coherent lensless x-ray imaging,” Nat. Photonics 4(12), 833–839 (2010). [CrossRef]

47. W. Bishara, T.-W. Su, A. F. Coskun, and A. Ozcan, “Lensfree on-chip microscopy over a wide field-of-view using pixel super-resolution,” Opt. Express 18, 11181–11191 (2010). [CrossRef]

48. Y. Wu and A. Ozcan, “Lensless digital holographic microscopy and its applications in biomedicine and environmental monitoring,” Methods 136, 4–16 (2018). [CrossRef]

49. T. Latychevskaia, “Iterative phase retrieval in coherent diffractive imaging: practical issues,” Appl. Opt. 57, 7187–7197 (2011). [CrossRef]

50. A. Levi and H. Stark, “Image restoration by the method of generalized projections with application to restoration from magnitude,” J. Opt. Soc. Am. A 1, 932–943 (1984). [CrossRef]

51. R. Horisaki, Y. Ogura, M. Aino, and J. Tanida, “Single-shot phase imaging with a coded aperture,” Opt. Lett. 39, 6466–6469 (2014). [CrossRef]

52. Z. Wang, Q. Dai, D. Ryu, K. He, R. Horstmeyer, and A. Katsaggelos, “Dictionary-based phase retrieval for space-time super resolution using lens-free on-chip holographic video,” in Imaging and Applied Optics (3D, AIO, COSI, IS, MATH, pcAOP) (Optical Society of America, 2017), paper CTu2B.3.

53. F. Eilenberger, S. Minardi, D. Pliakis, and T. Pertsch, “Digital holography from shadowgraphic phase estimates,” Opt. Lett. 37, 509–511 (2012). [CrossRef]

54. O. Flasseur, L. Denis, E. Thiébaut, T. Olivier, and C. Fournier, “ExPACO: detection of an extended pattern under nonstationary correlated noise by patch covariance modeling,” in IEEE European Signal Processing Conference (EUSIPCO), Coruna, Spain (2019).

55. A. Tarantola, Inverse Problem Theory and Methods for Model Parameter Estimation (SIAM, 2005).

56. A. Ribes and F. Schmitt, “Linear inverse problems in imaging,” IEEE Signal Process. Mag. 25(4), 84–99 (2008). [CrossRef]

57. J. A. Fessler, “Penalized weighted least-squares image reconstruction for positron emission tomography,” IEEE Trans. Med. Imag. 13, 290–300 (1994). [CrossRef]

58. L. Rudin, S. Osher, and E. Fatemi, “Nonlinear total variation based noise removal algorithms,” Physica D 60, 259–268 (1992). [CrossRef]

59. P. Charbonnier, L. Blanc-Féraud, G. Aubert, and M. Barlaud, “Deterministic edge-preserving regularization in computed imaging,” IEEE Trans. Image Process. 6, 298–311 (1997). [CrossRef]

60. N. Parikh and S. Boyd, “Proximal algorithms,” Found. Trends Optim. 1, 127–239 (2014). [CrossRef]

61. A. Beck and M. Teboulle, “Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems,” IEEE Trans. Image Process. 18, 2419–2434 (2009). [CrossRef]

62. I. Daubechies, M. Defrise, and C. De Mol, “An iterative thresholding algorithm for linear inverse problems with a sparsity constraint,” Commun. Pure Appl. Math. 57, 1413–1457 (2004). [CrossRef]

63. G. Wang, G. B. Giannakis, and Y. C. Eldar, “Solving systems of random quadratic equations via truncated amplitude flow,” IEEE Trans. Inf. Theory 64, 773–794 (2017). [CrossRef]

64. J. Nocedal, “Updating quasi-Newton matrices with limited storage,” Math. Comp. 35, 773–782 (1980). [CrossRef]

65. E. Thiebaut, “Optimization issues in blind deconvolution algorithms,” Proc. SPIE 4847, 174–183 (2002). [CrossRef]

66. M. Defrise, F. Noo, R. Clackdoyle, and H. Kudo, “Truncated Hilbert transform and image reconstruction from limited tomographic data,” Inverse Probl. 22, 1037 (2006). [CrossRef]

67. M. Unser, E. Soubies, F. Soulez, M. McCann, and L. Donati, “GlobalBioIm: a unifying computational framework for solving inverse problems,” in Imaging and Applied Optics (3D, AIO, COSI, IS, MATH, pcAOP) (Optical Society of America, 2017), paper CTu1B.1.

68. E. Soubies, F. Soulez, M. McCann, T.-A. Pham, L. Donati, T. Debarre, D. Sage, and M. Unser, “Pocket guide to solve inverse problems with globalbioim,” Inverse Probl. 35, 104006 (2019). [CrossRef]

		Simulation	Experiment
Illumination wavelength (in nm)	$λ$	532
Propagation distance (in µm)	$z$	12.5	7.3
Refractive index of the medium	$n_{0}$	1.52
Refractive index of the beads	$n_{bead}$	1.59	1.59 (commercial data)
Diameter of the beads (in µm)	$d_{bead}$	1.0	1.0 (commercial data)
Magnification factor (microscope objective)	$m a g$	56.7
Size of the image (in pixels)	$N = M$	$512 \times 512$	$1920 \times 1080$
Pixel size (in µm)	$Δ_{x} = Δ_{y}$	4.4	2.2
Noise type	$η$	Gaussian i.i.d.	—
Signal-to-noise ratio	$⟨ d ⟩ / σ_{η}$	200	—

Input:
$d \in R^{N_{x} \times N_{y}}$ ;	{intensity measurements}
${\underline{h}}_{z} \in C^{N_{x} \times N_{y}}$ ;	{propagation kernel ${\underline{h}}_{z}$ }
${\underline{h}}_{- z} \in C^{N_{x} \times N_{y}}$ ;	{backpropagation kernel ${\underline{h}}_{- z}$ }
$Λ r m i n \in R^{N_{x} \times N_{y}}$ ;	{minimum bound constraint for each pixel of $R e (o)$ (feasible domain $O$ )}
$Λ r m a x \in R^{N_{x} \times N_{y}}$ ;	{maximum bound constraint for each pixel of $R e (o)$ (feasible domain $O$ )}
$Λ i m i n \in R^{N_{x} \times N_{y}}$ ;	{minimum bound constraint for each pixel of $J m (o)$ (feasible domain $O$ )}
$Λ i m a x \in R^{N_{x} \times N_{y}}$ ;	{maximum bound constraint for each pixel of $J m (o)$ (feasible domain $O$ )}
maxiter;	{maximum number of iterations}
Output:
$\underline{o} \in C^{N_{x} \times N_{y}}$ ;	{unknown deviation from unit transmittance $\underline{t}$ }
Allocations:
$\underline{a} \in C^{N_{x} \times N_{y}}$ ;	{array for storing the simulated diffracted wave $a_{z}$ }
begin
	{Step 0: initializations}
1 $d \leftarrow normalize (d)$ ;	{normalize the data hologram so that the background equals 1}
	{N.B.: for instance, divide $d$ by its mean or its median (almost valid in the case of a low-density distribution of objects).}
2 $\underline{o} \leftarrow {\underline{h}}_{- z} * (\sqrt{d} - 1)$ ;	{first guess: for instance, direct backpropagation of the data (or initialization with random values).}
	{N.B.: the square root $\sqrt{}$ is applied pixelwise.}
for $i \leftarrow 1 : m a x i t e r$ do
3 $\underline{a} \leftarrow 1 + {\underline{h}}_{z} * \underline{o}$ ;	{Step 1: propagation to the sensor plane}
for $q \leftarrow 1 : N_{x} \cdot N_{y}$ do	{Step 2: enforce the measured amplitude at sensor plane}
if ${\underline{a}}_{q} \neq 0$ then
4 ${\underline{a}}_{q} \leftarrow \sqrt{d_{q}} \cdot ({\underline{a}}_{q} / \| {\underline{a}}_{q} \|)$ ;
else
5 ${\underline{a}}_{q} \leftarrow 0$ ;
end
end
6 $\underline{o} \leftarrow {\underline{h}}_{- z} * (\underline{a} - 1);$	{Step 3: backpropagation to the sample plane}
for $q \leftarrow 1 : N_{x} \cdot N_{y}$ do	{Step 4: projection on the domain $O$ }
7 $R e {(\underline{o})}_{q} \leftarrow m i n (R e {(\underline{o})}_{q}, Λ {r m a x}_{q});$
8 $R e {(\underline{o})}_{q} \leftarrow m a x (R e {(\underline{o})}_{q}, Λ {r m i n}_{q});$
9 $J m {(\underline{o})}_{q} \leftarrow m i n (J m {(\underline{o})}_{q}, Λ {r m a x}_{q});$
10 $J m {(\underline{o})}_{q} \leftarrow m a x (J m {(\underline{o})}_{q}, Λ {r m i n}_{q});$
	{N.B.: the operators $R e ()$ and $J m ()$ , giving, respectively, the real and imaginary parts (arrays) of the array $\underline{o}$ , are applied pixelwise.}
end
end
end

Input:
$d \in R^{N_{x} \times N_{y}}$ ;	{intensity measurements}
$g_{z} \in R^{N_{x} \times N_{y}}$ ;	{propagation kernel $g_{z}$ }
$w \in R^{N_{x} \times N_{y}}$ ;	{confidence weights applied to each pixel data (these arrays’ values constitute the diagonal of the matrix $W$ in Eq. (9))}
$f l a g_{p o s}$ ;	{flag ( $t r u e$ or $f a l s e$ ) for enforcing a positivity constraint}
$μ$ ;	{hyperparameter value for the soft-thresholding (sparsity contraint)}
$t$ ;	{steepest gradient descent step length}
Output:
$o \in R^{N_{x} \times N_{y}}$ ;	{unknown deviation from unit transmittance $\underline{t}$ }
Allocations:
$o_{p r e v} \in R^{N_{x} \times N_{y}}$ ;	{estimate at previous iterate}
$\tilde{m} \in R^{N_{x} \times N_{y}}$ ;	{array for storing the direct model}
$r \in R^{N_{x} \times N_{y}}$ ;	{array for storing the residues (difference between the model and the data)}
$u \in R^{N_{x} \times N_{y}}$ ;	{intermediate array}
$c \in R$ ;	{intensity hologram scaling factor}
$s \in R$ ;	{scalar factor for the acceleration step}
$s_{p r e v} \in R$ ;	{previous scalar factor for the acceleration step}
begin
	{Step 0: initializations}
1 $o_{p r e v} \leftarrow zeros (N_{x}, N_{y})$ ;	{first guess: for instance, $zeros (N_{x}, N_{y})$ returns a $N_{x} \times N_{y}$ array of zeros.}
2 $u \leftarrow o_{p r e v}$ ;
3 $s_{p r e v} \leftarrow 1$ ;
repeat
4 $\tilde{m} \leftarrow 1 + [g_{z} * u]$ ;	{Step 1: calculate the direct model}
5 $c \leftarrow \sum_{q} w_{q} {\tilde{m}}_{q} d_{q} / \sum_{q} w_{q} {\tilde{m}}_{q}^{2}$ ;	{Step 2: get optimal current scaling factor}
for $q \leftarrow 1 : N_{x} \cdot N_{y}$ do
6 $r_{q} \leftarrow w_{q} (c {\tilde{m}}_{q} - d_{q})$ ;	{Step 3: get the weighted residual pixel values}
end
7 $r \leftarrow g_{z} * r$ ;	{Step 4: backpropagation of the residues}
	{N.B.: the backpropagation kernel is equal to the propagation kernel (cf. Section 4.1)}
for $q \leftarrow 1 : N_{x} \cdot N_{y}$ do
8 $o_{q} \leftarrow u_{q} - 2 t c r_{q}$ ;	{Step 5: steepest gradient descent step}
if $f l a g_{p o s} = t r u e$ then	{Step 6: soft-thresholding}
9 $o_{q} \leftarrow max (0, o_{q} - μ t)$ ;
else
10 $o_{q} \leftarrow sign (o_{q}) max (0, \| o_{q} \| - μ t)$ ;
end
end
11 $s \leftarrow \frac{1}{2} (1 + \sqrt{1 + 4 {(s_{p r e v})}^{2}})$ ;	{Step 7: get new scalar factor $s$ for the acceleration step}
12 $u \leftarrow o + \frac{s_{p r e v} - 1}{s} (o - o_{p r e v})$ ;	{Step 8: new intermediate array $u$ according to previous iterate}
	{N.B.: this step is applied pixelwise.}
13 $s_{p r e v} \leftarrow s$ ;
14 $o_{p r e v} \leftarrow o$ ;	{Step 9: update memory of previous values}
until convergence;
end

		Simulation	Experiment
Illumination wavelength (in nm)	$λ$	532
Propagation distance (in µm)	$z$	12.5	7.3
Refractive index of the medium	$n_{0}$	1.52
Refractive index of the beads	$n_{bead}$	1.59	1.59 (commercial data)
Diameter of the beads (in µm)	$d_{bead}$	1.0	1.0 (commercial data)
Magnification factor (microscope objective)	$m a g$	56.7
Size of the image (in pixels)	$N = M$	$512 \times 512$	$1920 \times 1080$
Pixel size (in µm)	$Δ_{x} = Δ_{y}$	4.4	2.2
Noise type	$η$	Gaussian i.i.d.	—
Signal-to-noise ratio	$⟨ d ⟩ / σ_{η}$	200	—

Input:
$d \in R^{N_{x} \times N_{y}}$ ;	{intensity measurements}
${\underline{h}}_{z} \in C^{N_{x} \times N_{y}}$ ;	{propagation kernel ${\underline{h}}_{z}$ }
${\underline{h}}_{- z} \in C^{N_{x} \times N_{y}}$ ;	{backpropagation kernel ${\underline{h}}_{- z}$ }
$Λ r m i n \in R^{N_{x} \times N_{y}}$ ;	{minimum bound constraint for each pixel of $R e (o)$ (feasible domain $O$ )}
$Λ r m a x \in R^{N_{x} \times N_{y}}$ ;	{maximum bound constraint for each pixel of $R e (o)$ (feasible domain $O$ )}
$Λ i m i n \in R^{N_{x} \times N_{y}}$ ;	{minimum bound constraint for each pixel of $J m (o)$ (feasible domain $O$ )}
$Λ i m a x \in R^{N_{x} \times N_{y}}$ ;	{maximum bound constraint for each pixel of $J m (o)$ (feasible domain $O$ )}
maxiter;	{maximum number of iterations}
Output:
$\underline{o} \in C^{N_{x} \times N_{y}}$ ;	{unknown deviation from unit transmittance $\underline{t}$ }
Allocations:
$\underline{a} \in C^{N_{x} \times N_{y}}$ ;	{array for storing the simulated diffracted wave $a_{z}$ }
begin
	{Step 0: initializations}
1 $d \leftarrow normalize (d)$ ;	{normalize the data hologram so that the background equals 1}
	{N.B.: for instance, divide $d$ by its mean or its median (almost valid in the case of a low-density distribution of objects).}
2 $\underline{o} \leftarrow {\underline{h}}_{- z} * (\sqrt{d} - 1)$ ;	{first guess: for instance, direct backpropagation of the data (or initialization with random values).}
	{N.B.: the square root $\sqrt{}$ is applied pixelwise.}
for $i \leftarrow 1 : m a x i t e r$ do
3 $\underline{a} \leftarrow 1 + {\underline{h}}_{z} * \underline{o}$ ;	{Step 1: propagation to the sensor plane}
for $q \leftarrow 1 : N_{x} \cdot N_{y}$ do	{Step 2: enforce the measured amplitude at sensor plane}
if ${\underline{a}}_{q} \neq 0$ then
4 ${\underline{a}}_{q} \leftarrow \sqrt{d_{q}} \cdot ({\underline{a}}_{q} / \| {\underline{a}}_{q} \|)$ ;
else
5 ${\underline{a}}_{q} \leftarrow 0$ ;
end
end
6 $\underline{o} \leftarrow {\underline{h}}_{- z} * (\underline{a} - 1);$	{Step 3: backpropagation to the sample plane}
for $q \leftarrow 1 : N_{x} \cdot N_{y}$ do	{Step 4: projection on the domain $O$ }
7 $R e {(\underline{o})}_{q} \leftarrow m i n (R e {(\underline{o})}_{q}, Λ {r m a x}_{q});$
8 $R e {(\underline{o})}_{q} \leftarrow m a x (R e {(\underline{o})}_{q}, Λ {r m i n}_{q});$
9 $J m {(\underline{o})}_{q} \leftarrow m i n (J m {(\underline{o})}_{q}, Λ {r m a x}_{q});$
10 $J m {(\underline{o})}_{q} \leftarrow m a x (J m {(\underline{o})}_{q}, Λ {r m i n}_{q});$
	{N.B.: the operators $R e ()$ and $J m ()$ , giving, respectively, the real and imaginary parts (arrays) of the array $\underline{o}$ , are applied pixelwise.}
end
end
end

From Fienup’s phase retrieval techniques to regularized inversion for in-line holography: tutorial

Abstract

1. INTRODUCTION

2. THE INVERSE PROBLEM OF IMAGE RECONSTRUCTION IN IN-LINE HOLOGRAPHY

3. FIENUP’S ALTERNATING PROJECTIONS STRATEGIES

4. INVERSE PROBLEMS METHODOLOGY FOR THE RECONSTRUCTION OF IN-LINE HOLOGRAMS: A TUTORIAL

A. Linearized Image Formation Model

B. Building a Regularized Inverse Approach

1. Data Fidelity

2. Constraints and Regularizations

C. Case Study: Solving a Standard Sparsity-Based Problem

1. Problem

2. Optimization Strategy

3. Alternating Projections Point of View

5. RESULTS

A. Data

B. Reconstructions

6. DISCUSSION

7. CONCLUSION

APPENDIX A: DETAILED IMPLEMENTATIONS OF ALGORITHM 1 AND ALGORITHM 2

Funding

Acknowledgment

Disclosures

REFERENCES

Cited By

Figures (9)

Tables (6)

Equations (29)

Journal of the Optical Society of America A

		Reconstruction Parameters
		Gradient Descent Step		Gradient Smoothness	$l_{1}$ -Sparsity	Edge-Preserving Smoothness		Max. nb. of
Figure	Method	$t$	Positivity	$μ_{l_{2} \nabla}$	$μ_{l_{1}}$	$μ_{TV}$	$ϵ_{TV}$	Iterations
Fig. 3(b); Fig. 6(b)	Ground Truth
Fig. 6(d)	Fienup	1.0	Yes	—	—	—	—	1000
Fig. 6(e)	Fienup	1.0	Yes	—	0.01	—	—	1000
Fig. 6(f)	Fienup	1.0	Yes	—	0.01	—	—	10
Fig. 3(c)	RI \| VMLMB	—	Yes	1.0	—	—	—	1000
Fig. 3(d)	RI \| VMLMB	—	Yes	—	—	0.1	0.01	1000
Fig. 3(e); Fig. 6(g)	RI \| FISTA	0.1	Yes	—	0.1	—	—	1000
Fig. 3(f); Fig. 6(h)	RI \| FISTA	0.05	Yes	—	0.1	0.1	0.01	1000
Fig. 6(i)	RI \| FISTA	0.05	Yes	—	0.1	0.1	0.01	10
Fig. 7(c)	Fienup	1.0	Yes	—	—	—	—	100
Fig. 7(d)	Fienup	1.0	Yes	—	0.1	—	—	100
Fig. 7(e)	Fienup	1.0	Yes	—	0.1	—	—	10
Fig. 7(f)	RI \| FISTA	0.1	Yes	—	0.5	—	—	100
Fig. 7(g)	RI \| FISTA	0.1	Yes	—	0.5	0.01	0.01	100
Fig. 7(h)	RI \| FISTA	0.1	Yes	—	0.5	0.01	0.01	10
Fig. 8(c)	RI \| FISTA	0.01	Yes	—	0.1	0.001	0.01	100
Fig. 8(d)	RI \| FISTA	0.01	Yes	—	0.1	0.1	0.01	100