Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Referenceless characterization of complex media using physics-informed neural networks

Open Access Open Access

Abstract

In this work, we present a method to characterize the transmission matrices of complex scattering media using a physics-informed, multi-plane neural network (MPNN) without the requirement of a known optical reference field. We use this method to accurately measure the transmission matrix of a commercial multi-mode fiber without the problems of output-phase ambiguity and dark spots, leading to up to 58% improvement in focusing efficiency compared with phase-stepping holography. We demonstrate how our method is significantly more noise-robust than phase-stepping holography and show how it can be generalized to characterize a cascade of transmission matrices, allowing one to control the propagation of light between independent scattering media. This work presents an essential tool for accurate light control through complex media, with applications ranging from classical optical networks, biomedical imaging, to quantum information processing.

Published by Optica Publishing Group under the terms of the Creative Commons Attribution 4.0 License. Further distribution of this work must maintain attribution to the author(s) and the published article's title, journal citation, and DOI.

1. Introduction

A scattering matrix provides complete knowledge of the linear optical response of a material. This not only gives us a better understanding of light-matter interactions, but also allows us to harness their use in a diverse range of photonic technologies [14]. For example, knowledge of the scattering matrix can be utilized in the control of light propagation through an opaque scattering medium [5,6], serve as an alternative way to construct optical devices [4], and has many applications spanning from imaging through scattering tissue [7,8], mode-multiplexing through a multimode fiber [9], optical neural networks [1013] to the transport and manipulation of quantum states of light [1418]. However, as the complexity of an optical system of interest grows larger, the efficient and accurate measurement of its scattering matrix can be challenging.

Over the last few decades, the development of techniques for measuring a scattering matrix— both its transmission (TM) and reflection (RM) components—have been advanced and tailored to particular scenarios with different conditions and prior assumptions [6,1931]. Conventionally, the measurement of a TM is performed under the assumption that a medium preserves the coherence properties of the input light, i.e., the scattering process (channel) is pure. In this case, the measurement is usually performed by sending different pure input states in a given basis sequentially to probe a medium/system of interest and detecting the corresponding output fields by means of off-axis holography using an external reference [3235]. The technique requires an interferometer that is insensitive to environmental disturbances, particularly for a long optical path length.

To mitigate the problem of stability, alternative methods such as phase-stepping holography with a co-propagating internal reference field have been developed [6]. Nevertheless, the use of a common-path reference field for an accurate TM measurement poses additional challenges since the internal reference needs to interfere with every optical mode within the complex medium with sufficient interference visibility for an optimal TM measurement. However, the internal reference field usually turns into a speckle pattern due to scattering, with a large variance in amplitude and consequently interference visibility, leading to a drawback known as the “dark-spot problem” [34,36,37]. An alternative way to characterize complex media without using a reference field has been achieved through optimization algorithms, which search for complex vectors using intensity-only measurements under the assumption of a pure process matrix. Various phase retrieval techniques have been developed, such as ones using Bayesian approaches [38], generalized approximate message passing [39], alternating projection methods such as the Gerchberg-Saxton algorithm [40,41], and convex relaxations [4244].

A caveat of all these techniques that do not involve an external reference is that they do not completely characterize the transmission matrix. This is due to the fact that the intensity-only measurements used in these techniques cause phase ambiguity at the output, resulting in the relative phases between output modes being undefined [45]. While a TM obtained in this manner is sufficient for the majority of imaging applications through complex media [36,46], complete coherent information about the TM is necessary for many applications such as programmable photonic circuits [47] and in quantum information processing, where complex media are used to transport [14,15] and manipulate quantum states of light [1618]. An extension to phase-stepping holography allows us to characterize this output phase ambiguity by interfering with different optical modes after the scattering process [48]. Alternatively, phase diversity techniques can be applied to characterize this phase ambiguity by effectively measuring the output field intensity at different propagation distances in free space [29,49,50]. However, these methods are still subject to the dark-spot problem mentioned above or require full reconstruction of the optical field.

In recent years, artificial neural networks such as perceptrons and convolutional neural networks have been applied for tasks such as image reconstruction, classification, and transfer through a scattering medium [5155], demonstrating their potential for learning the transfer function of a scattering medium. While light scattering through a complex medium is a linear process, its measurement in intensity is non-linear, which makes it a suitable system to model within the framework of artificial neural networks. Incorporating the physical model that describes this scattering process into a neural network architecture is thus a clear contender for solving optimization problems [56]. Unlike general machine-learning models, physics-informed neural networks are models based on physical systems, and thus do not require treating the algorithm like a black box, but indeed a simplification tool. Recent advances in optics-based machine learning have not only led us towards enhanced control through linear complex media [46,57,58], but also useful applications in non-linear microscopy [59] and programming quantum gates [60].

In this work, we demonstrate the complete characterization of a coherent transmission matrix without the use of a reference field and the accompanying problem of dark spots by employing the use of physics-informed neural networks, referred to as a multi-plane neural network (MPNN). We do so by performing a set of random measurements before and after the complex medium, while probing only intensity at the output and subsequently feeding the data into a neural network designed to mimic the experimental apparatus. We demonstrate the improved accuracy of our technique by comparing the focusing efficiency achieved through the medium using the transmission matrices obtained with an MPNN to ones recovered with conventional phase-stepping (PS) holography. Furthermore, we investigate the number of measurements required to characterize the TM and show that while phase-stepping requires fewer measurements, the TM recovered with MPNN is much more accurate. We also show that our technique is significantly more noise-robust as compared to phase-stepping holography, allowing the recovery of a high fidelity TM in the presence of large amounts of noise. Finally, using a numerical simulation, we demonstrate the general usage of MPNNs for the TM measurement of a cascade of random scattering media.

2. Multi-plane neural network

We start by describing the model of a multi-layer optical network, where each complex medium is placed between reconfigurable phase-shifter planes, as illustrated in Fig. 1. The $k$-th layer of the optical system is composed of a reconfigurable phase-shifter plane represented by a vector $x_k$ and a complex medium with complex transmission matrix $T_k$. The intensity $y$ observed at the output detectors of an $n$-layered network for a uniform excitation at an input of the network is given by

$$\begin{aligned} y & = | \mathbf{F}\mathbf{P_{n+1}}\mathbf{T_n}\mathbf{P_{n}} \dots \mathbf{T_k}\mathbf{P_{k}} \dots \mathbf{T_2}\mathbf{P_2}\mathbf{T_1}x_1|^2,\\ & = | \mathbf{F}(x_{n+1}\odot(\mathbf{T_n}(x_{n}\odot \cdots\mathbf{T_2}(x_2 \odot \mathbf{T_1} x_1) \cdots)))|^2,\\ \end{aligned}$$
where $\odot$ represents an element-wise dot product, $\mathbf {T_k}$ is the transmission matrix of the $k^{\textrm {th}}$ complex medium in the network, $\mathbf {F}$ is a known complex matrix defined by the detection optics, and $\mathbf {P_{k}}=\text {diag}[x_k]$. Equation (1) also describes the neural network that models this optical process, where $\{x_i\}_i$ are $n$ input vectors, $\{\mathbf {T_i}\}_i$ are fully-connected “hidden” complex layers with a linear activation, and $\mathbf {F}$ is a fully-connected known complex layer with a non-linear activation of $|.|^2$ to simulate intensity measurements. Since the input layers are located at different planes in the network, we refer to this model as the multi-plane neural network (MPNN).

 figure: Fig. 1.

Fig. 1. Schematic of an $n$-layered optical network: a cascade of $n$ complex media denoted by their optical transmission matrices $T_k$ are separated by reconfigurable phase-shifter planes $x_k$, followed by detection optics $F$. This optical network can be represented as a multi-plane neural network (MPNN) with $n+1$ input layers of $x_k$, $n$ “hidden-layers” for $T_k$, and a known layer for $F$ with a $|.|^2$ activation.

Download Full Size | PDF

We train the described model on a measured randomized dataset using Tensorflow and Keras on Python. Tensorflow provides an open-source framework that makes models like these convenient to implement and extend [61]. Building models in this framework allowed us to use previous complex-value neural network layers developed in [46] and makes our work open to extensions using different optimization techniques. Our model is optimized using adaptive moment estimation, also referred to as the Adam optimizer [62], the mean-squared error loss of which is given by

$$MSE= \sum_i | \bar{y}_i -y_i |^2,$$
where $i$ represents the index of a point in the dataset, $\bar {y}$ is the predicted output from the model, and $y$ is the measured output. Once the loss function is decided, gradients to each weight in the layer $\mathbf {T_k}$ are calculated with respect to the MSE using a chain-rule governed by the back-propagation algorithm [63]. To ensure that the learning is efficient, we appropriately set the learning rate before beginning the optimization while also reducing it during the optimization if the loss plateaus. Post training, retrieving the weights of the layer $\mathbf {T_k}$ gives us the required transmission matrix of the $k^{\textrm {th}}$ complex medium.

3. Experimental method

In this work, we use the formalism of MPNN to measure the coherent transmission matrix of a multi-mode fiber using two programmable phase planes. The phase planes are implemented on spatial light modulators (SLMs) placed before and after the MMF to probe the optical fields propagating through the fiber using intensity-only measurements. A schematic of the setup is illustrated in Fig. 2(a), where light from a superluminescent diode is filtered by a spectral filter centered at 810 nm (FWHM 3.1 nm) and coupled into a 2m-long graded-index multi-mode fiber (MMF, Thorlabs-M116L02) sandwiched between two SLMs (Hamamatsu LCOS-X10468). The requirement of two separate SLMs is not strict for these measurements and the setup can be designed such that a single SLM is employed with a double reflection before and after the MMF. The MMF has a core size of 50 $\mu$m and supports approximately $200$ modes for each polarization at 810 nm wavelength. The telescopes placed between each component in the setup are designed such that the incident beam covers a large area on each SLM. In this particular experiment, we only control a single polarization channel of the MMF. However, the techniques discussed here can be equivalently applied to characterize both polarization channels simultaneously.

 figure: Fig. 2.

Fig. 2. (a) Experiment: Light from a superluminescent diode is filtered by a 3.1 nm spectral filter centered at 810 nm, modulated by a random phase pattern displayed on a spatial light modulator (SLM$_1$) and coupled into a multi-mode fiber (MMF) which is the complex medium of interest. The output speckle field from the MMF is projected onto SLM$_2$, which displays another random phase pattern, and is then detected on a CMOS camera which is placed in the Fourier plane of SLM$_2$. (b) The phase patterns on the SLM are constructed in a specific macro-pixel basis with varying pixel size based on the incident field distribution. Intensities of the output modes are recorded at a given set of points at the camera, which enclose an area corresponding to the MMF core. (c) The physics-informed neural network consists of two input layers encoding phase patterns on SLM$_1$ and SLM$_2$, and a single output layer encoding the intensity pattern of the output speckle in the given basis. The hidden layer $T$ denotes the complex transformation between SLM$_1$ and SLM$_2$ in the macro-pixel bases, while $F$ is a known layer corresponding to the $2f$-lens system between SLM$_2$ and the camera.

Download Full Size | PDF

We choose to work in the so-called macro-pixel basis, which consists of groups of SLM pixels chosen such that the intensity per macro-pixel is approximately equal (Fig. 2(b)). $\textrm {SLM}_{1}$ is used to prepare a set of input modes in the macro-pixel basis and $\textrm {SLM}_{2}$ in combination with a CMOS camera (XIMEA-xiC USB3.1) allows us to perform measurements on the output modes of the fiber. $\mathbf {T}$ denotes the optical transmission matrix between $\textrm {SLM}_{1}$ and $\textrm {SLM}_{2}$ in the macro-pixel basis. The field after $\textrm {SLM}_{2}$ is given by the element-wise product $x_2 \odot \mathbf {T} x_1$, where $x_{1(2)}$ is a vector corresponding to the hologram implemented on $\textrm {SLM}_{1(2)}$. Finally, this field is incident on the camera placed in the Fourier plane of $\textrm {SLM}_{2}$ using the transfer matrix $\mathbf {F}$. The resultant measurement intensity $y$ on the camera is given by

$$y= | \mathbf{F} (x_2 \odot \mathbf{T} x_1) |^2,$$
which describes a single-layered MPNN model. A set of random measurements are performed on the setup to generate a dataset for this model. A number of holograms are generated with phases randomly varied following a uniform distribution. These holograms are displayed on each SLM and the corresponding intensities measured by the CMOS camera are recorded. We sample the intensity of the field at the camera at positions separated by a distance corresponding to the size of a diffraction-limited spot (as shown in Fig. 2(b)) calculated using the effective focal length of the lens system (L$_{9-11}$) between $\textrm {SLM}_{2}$ and the camera.

While we expect the TM to have a dimension of approximately $200 \times 200$ based on fiber specifications, we oversample our measurements at the SLM planes to capture any optical misalignment and aberrations in the experiment [58] . Thus, we use 800 macro-pixels at the input phase plane ($\textrm {SLM}_{1}$) and 832 macro-pixels at the output phase plane ($\textrm {SLM}_{2}$) to perform the measurements. In principle, any modal basis can be used to perform these measurements, however, we choose the macro-pixel basis because it provides accurate and low-loss phase-only modulation on the SLM. Since the intensity sampling at the camera plane is limited by the resolution of the optical system, we only use $367$ sampling points here. We train our model containing an $800\times 832$ dimensional TM. The optimization is carried out in multiple batches with a batch size of 500 samples, and is accelerated using a GPU (NVIDIA GeForce RTX 3060). We observe good convergence in about 20 epochs of training, as plotted in the training and test metrics shown in Fig. 3(a-b). Retrieving the weights from the hidden layer of the model recovers the measured TM in the macro-pixel basis, as visualized in Fig. 3(c), and can be shown using Dataset 1 (Ref. [64]). Transforming this TM to the Laguerre-Gauss modal basis with 20 mode groups reveals a block-diagonal structure with crosstalk as shown in Fig. 3(d), which is a typical structure expected for a graded-index multi-mode fiber [48,65].

 figure: Fig. 3.

Fig. 3. Learning the coherent transmission matrix of a multi-mode fiber: Test and train (a) loss and (b) mean-absolute percentage error (MAPE) as learning proceeds in epochs. A visual plot of the learnt transmission matrix in the (c) macro-pixel and (d) Laguerre-Gauss (LG) basis, where modes are segregated into respective mode groups. In order to change the basis of the TM from (c) to (d), a transformation matrix $B_{\textrm {Pix} \rightarrow \textrm {LG}}$ is constructed that maps the set of macro-pixel modes to a set of LG modes. The TM in the LG basis is calculated as $T_{\textrm {LG}} = B_{\textrm {Pix} \rightarrow \textrm {LG}}^\dagger T_{\textrm {Pix}} B_{\textrm {Pix} \rightarrow \textrm {LG}}$.

Download Full Size | PDF

4. Results and discussions

4.1 Accuracy of the measured transmission matrix

To verify the accuracy of the measured transmission matrix, we perform an optical phase conjugation (OPC) experiment to focus light from a given input mode into a particular output mode after scattering through the fibre, by displaying a phase-only solution on the first plane (SLM$_1$), the second plane (SLM$_2$), and both planes simultaneously. The focusing efficiency obtained by controlling light using only the first plane or the second plane allows us to assess the quality of individual rows and columns of the transmission matrix, respectively. On the other hand, manipulating light using both planes allows us to assess the overall quality of the measured transmission matrix. The focusing efficiency is defined as the ratio of intensity measured in a diffraction-limited spot at the camera to that measured in an output region that is 1.75 times the area corresponding to the output facet of the fiber. This is to capture any light that is diffracted outside the output facet due to phase modulation at SLM$_2$.

We measure the TM of a multi-mode fiber using the MPNN technique and compare it with one measured using the phase-stepping (PS) technique. It is important to note that the PS technique can be used to measure the full coherent TM without output-phase ambiguity via a two-step process as follows. In the first step, we measure the joint transfer matrix of the fiber and $2f$-lens system ambiguous to the reference field, i.e. $D~\mathbf {F}~\mathbf {T}$, where $D$ is an arbitrary diagonal matrix owing to the unknown reference field. We do so by displaying the superposition of each input mode with the chosen reference mode at $\textrm {SLM}_{1 }$ and varying their relative phase in multiple steps, while measuring the intensity at each output spot. The output fields at a particular input mode are then reconstructed by using the Fourier-Transform reconstruction algorithm [45,66]. In order to maximize the intensity per mode, we choose the input basis to be a discrete Fourier transform of the macro-pixel basis with the $50^{\text {th}}$ mode chosen as the reference. The second step involves the reconstruction of the input reference field $D$. To do so, we send the input reference mode using $\textrm {SLM}_{1}$ and perform the phase-stepping technique using $\textrm {SLM}_{2}$. We prepare the superpositions of each output mode with the chosen output reference mode on $\textrm {SLM}_{2}$ and vary their relative phase in multiple steps. The measured intensity at the camera thus allows us to reconstruct the matrix $D$ corresponding to the input reference field in a similar manner as the first step above. It should be noted that the knowledge of $D$ is unnecessary for many applications such as imaging through complex media as well as the aforementioned experiment on performing OPC using only the first plane. All measured data with PS is processed using the Fourier transform reconstruction algorithm [67,68].

In the case of performing OPC with only the first plane ($\textrm {SLM}_{1}$), the focusing efficiency achieved at different points across the output facet of the fiber using a TM obtained via the PS and MPNN techniques is shown in Fig. 4(a). Using the PS technique, we observe a reduction of the focusing efficiency at several output points due to the dark-spot problem [34]. This is due to the nature of speckle that results in a high probability of obtaining very low output intensities of the internal reference mode after scattering through the fiber. This leads to low interference visibility in the PS technique for specific output modes, which consequently results in inaccuracy in the reconstructed transmission matrix at these outputs. This problem is solved by measuring the transmission matrix with the MPNN technique, as this does not involve a static reference mode but instead uses many random inputs to probe the scattering process.

 figure: Fig. 4.

Fig. 4. A comparison of optical phase conjugation (OPC) performed with only the first phase plane: (a) Focusing efficiencies achieved at different points across the output facet of the multi-mode fiber using the phase-stepping (PS) and multi-plane neural network (MPNN) techniques. Output points with low focusing efficiency due to the inherent dark-spot problem can be seen in the PS method (see text for more details). In contrast, focusing efficiencies obtained with the MPNN method show more uniformity across the fiber facet. Log-scale images of light focused using the first SLM with a TM obtained by the (b) PS and (c) MPNN techniques. (d) A histogram comparing the PS and MPNN methods shows that the PS technique leads to some output points with very low focusing efficiencies, while those obtained with MPNN method are more uniform and significantly higher. Note: Only the points within first 80% diameter of the core from the center of the core are taken for the purpose of this histogram.

Download Full Size | PDF

The more uniform focusing efficiency achieved across the output facet of the fiber with the MPNN technique is clearly illustrated in Fig. 4(a). A histogram of focusing efficiencies achieved with these two methods (Fig. 4(b)) shows that low focusing efficiencies are only observed with the PS technique, while the MPNN technique achieves a significantly higher maximum focusing efficiency.

While improved control with the first phase plane using various optimization techniques has been well studied in many previous works, one of the chief merits of our approach lies in the simultaneous measurement of relative phases between the rows of the transmission matrix, i.e., the coherence between output modes. We assess the accuracy of the reconstructed relative phase between output modes by performing an OPC experiment to focus light by using only the second phase plane ($\textrm {SLM}_{2}$). Log-scale images of a focused spot at the centre of the output facet of the fiber using the two techniques are compared in Fig. 5(a),(b). We observe the suppression of unwanted speckle background when using a TM obtained with the MPNN technique as compared to the one acquired using the PS technique. By measuring the focusing efficiencies for different input modes, the overall enhancement obtained with the MPNN technique as compared to the PS technique is evident in Fig. 5(c). The average focusing efficiency using the second plane increases from $26.5 \pm 2.3 {\%}$ using PS to $40.8 \pm 1.7 {\%}$ using MPNN. The underlying reason for this improvement is that learning the TM with the MPNN technique does not require a static internal reference mode, whereas the use of the fixed internal reference mode in the PS technique results in errors at particular outputs due to the dark-spot problem.

 figure: Fig. 5.

Fig. 5. (a)-(c) A comparison of optical phase conjugation (OPC) performed with only the second phase plane. Log-scale images of light focused using only $\textrm {SLM}_{2}$ with a TM obtained by the (a) phase-stepping and (b) multi-plane neural network techniques. (c) A histogram comparing second-plane focusing efficiencies of 50 random input modes achieved with the PS and MPNN techniques shows a marked improvement with the latter. (d)-(f) A comparison of OPC performed with both phase planes. Log-scale images of light focused using both SLMs with a TM obtained by the (d) PS and (e) MPNN techniques. (f) A histogram comparing focusing efficiencies for all output modes achieved with both planes shows a significant improvement with MPNN over the PS technique. The focusing efficiencies are corrected for SLM$_2$ basis-dependent loss for both methods.

Download Full Size | PDF

As discussed above, a complete characterization of the transmission matrix including relative phases between its rows and columns is essential for coherent control of optical fields propagating through a complex medium. To examine this coherent control, we perform an OPC experiment by focusing light at different output spots by simultaneously utilising both the phase planes at hand. The solution of phase patterns for focusing is determined using an iterative wavefront-matching technique [69,70]. As seen in the log-scale images shown in Fig. 5(d),(e), light focused using both phase planes with a TM acquired using the MPNN method has significantly less speckle background as compared to one acquired with the phase-stepping technique. Quantitatively, we are able to achieve an average focusing efficiency of $65.5 \pm 2.5 {\%}$ (up to a maximum of $73.8 {\%}$) with both planes using the MPNN technique (Fig. 5(f)). This is a substantial increase with respect to that achieved with the PS technique, where we observe an average focusing efficiency of $42.4 \pm 3.1 {\%}$ (up to a maximum of $46.7{\%}$). This result also demonstrates the increase in focusing efficiency achievable with two-plane control as compared to individual phase planes, which makes it particularly suitable for usage in applications such as programmable optical circuits [17,18].

The maximum achievable focusing efficiencies (ideal simulations) shown in Figs. 4 and 5 are numerically calculated by taking into account the presence of polarization coupling in the fiber. The fiber TM is represented by a truncated random unitary matrix considering that only one polarization channel is measured and controlled. This results in the lowest achievable maximum focusing efficiency when only the first phase plane (SLM$_1$) is used for control. When only the second phase plane (SLM$_2$) is used, the maximum focusing efficiency is increased as compared to the first case. The third case entails both phase planes being used together to focus light through the MMF. As there is much more phase control achievable using the two phase planes, one can now focus light with much higher efficiency as compared to the previous two cases.

4.2 Efficiency of learning with the MPNN technique

In this section, we study the number of measurements required to obtain an optimal TM using the MPNN technique as compared to the PS technique. First, we experimentally evaluate this by performing an OPC experiment with the fiber TM reconstructed with different dataset sizes. For the MPNN method, the size of the dataset ($\alpha$) is quantified by the total number of intensity-only measurements performed on the camera divided by the number of input modes that characterize the TM. For the PS method, this quantity is a bit more nuanced since we only control the number of phase steps ($n_{\phi }$) within the interval $[0, 2\pi ]$ that are used per mode. As described in section 3, the first part of the PS method requires $(n_{\text {in}}-1)n_{\phi } + 1$ measurements, where $n_{\text {in}}$ is the number of input modes and the additional measurement corresponds to that of the reference itself. The second part requires $(n_{\text {out}}-1)n_{\phi }$ measurements where $n_{\text {out}}$ is the number of output modes measured on the camera. The total number of measurements for each input mode is then given by $\alpha \approx [(n_{\text {in}}+n_{\text {out}}) / n_{\text {in}}]n_{\phi }$. In our experiment, $n_{\text {in}}=800$ and $n_{\text {out}}=367$, which gives $\alpha \approx 1.46 n_{\phi }$ for the PS technique. In this manner, the parameter $\alpha$ corresponds to the total number of measurements performed per mode in both the MPNN and the PS technique.

The focusing efficiency achieved with a TM obtained via the MPNN and PS techniques as a function of the number of measurements per mode ($\alpha$) is plotted in Fig. 6 for all three cases—focusing with the first, the second, and both planes. For the PS technique, the focusing efficiency is seen to converge to its maximum value at $\alpha \sim 6-8$, which is close to the minimum required number of phase steps ($n_\phi \sim 3-4$) [71]. However, it reaches a plateau after this and cannot be improved further owing to the presence of noise in the experiment [7274]. In contrast, the focusing efficiency obtained via the MPNN technique is seen to converge at a higher number of measurements per mode ($\alpha =20$). However, in all three OPC cases, the maximum efficiency achieved with MPNN is higher than that achieved with the PS technique— $46.5{\%}$, $43.6{\%}$, and $73.8{\%}$ versus $45.0{\%}$, $30.2{\%}$, $46.7{\%}$ with the first, second, and both planes respectively. In particular, the maximum efficiency is significantly higher when focusing with only the second or both phase planes—cases where complete coherent information of the TM plays a critical role. Thus, while the MPNN technique takes longer to learn a given TM, the reconstructed TM is more accurate, as quantified by the focusing efficiency achieved through it. One should note that the number of measurements can also be reduced by incorporating the underlying physical model of a multimode optical fiber [57,58].

 figure: Fig. 6.

Fig. 6. Number of measurements required for optimal transmission matrix measurement: Experimental focusing efficiency achieved with (a) the first phase plane, (b) the second phase plane, and (c) both planes simultaneously, using a TM reconstructed with the phase-stepping technique and the multi-plane neural network (MPNN) plotted as a function of the number of measurements per input mode ($\alpha$). While phase-stepping converges to a maximum faster, the MPNN technique shows a much higher focusing efficiency.

Download Full Size | PDF

4.3 Noise-robustness of the MPNN technique

From the previous sections, it is clear that one of the advantages of the MPNN technique is improved performance over the PS technique in the presence of noise. While our experiment studies one specific case, here we quantify this improvement by simulating the effects of different noise levels on both techniques. An $800\times 800$-dimensional random unitary matrix is chosen as our ground truth TM and intensity measurements using the PS and MPNN techniques are simulated while varying the number of measurements per mode ($\alpha$). Noise in the measurement is modelled as additive white Gaussian noise on the readout intensity

$$\tilde{y}_{\sigma}= | \mathbf{F} (x_2 \odot \mathbf{T} x_1) |^2 + \mathcal{N}({\mu},\sigma),$$
where $\mathcal {N}(\mu,\sigma )$ is additive white Gaussian noise with mean $\mu =0$ and standard deviation $\sigma$. Such a noise model is standard and includes the effects of multiple random processes such as thermal noise [75,76]. The signal-to-noise ratio (SNR) is the ratio of the average norm of the signal intensity to the noise standard deviation, i.e. SNR $= \overline {|y|}/\sigma$ [77]. It should be noted that each simulated data point is normalized to unity before adding noise, which simply implies an SNR of $1/\sigma$ in our model. The fidelity between the recovered TM ($\widetilde {\mathbf {T}}$) and the ground-truth TM ($\mathbf {T}$) is calculated as $\mathcal {F}(\mathbf {T})=|{\textrm{Tr}}(\widetilde {\mathbf {T}}^{\dagger }{\mathbf {T}})|^2/({\textrm{Tr}}(\widetilde {\mathbf {T}}^{\dagger }\widetilde {\mathbf {T}})\textrm{Tr}(\mathbf {T}^{\dagger }{\mathbf {T}}))$. The fidelity is sensitive to the relative phases between rows and columns of the TM, and thus only reaches its maximum value of 1 when complete coherent information about the TM is present.

The TM fidelity as a function of the number of measurements ($\alpha$) is plotted in Fig. 7 for different levels of noise. In these simulations, we choose $n_{\text {in}}= n_{\text {out}} = 800$, which implies that $\alpha \approx 2 n_\phi$ for the PS technique. In the noiseless case (SNR = inf), the PS technique converges to perfect fidelity at $\alpha \approx 6$, while the MPNN technique requires $\alpha \approx 10$ to do the same. Note that in this case, the PS technique is able to reach perfect fidelity regardless of the dark spot problem, as even the smallest interference signal provides complete information with no noise present. As the SNR decreases, the maximum fidelity achievable via the PS technique drops rapidly and is unable to reach perfect fidelity regardless of the number of measurements used. For example, even with a small amount of noise (SNR=20), the PS technique can only recover a TM with fidelity less than 60%. In contrast, the MPNN technique is able to achieve very high fidelity in the presence of noise. As can be seen in Fig. 7(b), we are able to recover a high-quality TM ($\mathcal {F}=80.87{\%}$) even when the SNR is as low as 0.8, with fidelity gradually increasing with the number of measurements per mode ($\alpha$). This highlights the significantly improved noise-robustness of the MPNN technique in comparison to the PS technique.

 figure: Fig. 7.

Fig. 7. Noise-robustness of MPNN: A simulated $800 \times 800$- dimensional TM is recovered using the (a) phase-stepping (PS) and (b) multi-plane neural network (MPNN) techniques for different additive gaussian noise levels. The fidelity of the recovered TM is plotted as a function of the number of measurements per input mode ($\alpha$). While the PS technique quickly degrades in the presence of noise, the MPNN technique is able to reach high fidelities with small increases in the number of measurements.

Download Full Size | PDF

Despite using the most noise-resilient algorithm to reconstruct the TM using PS data [66], the MPNN method significantly outperforms the PS technique. This is because the MPNN method uses a large part of the dataset together to minimize the defined loss function in Eq. (2). The addition of noise to the data merely adds local minima to the loss function across the optimization space, leaving the global minimum largely untouched. In contrast, PS relies on processing each data point individually over the $n_{\phi }$ phase-steps as a sinusoid, where the addition of noise can severely impact the visibility and phase reconstruction.

These results may give the impression that PS is an unreliable technique, however, this is untrue because here we have pushed the technique to its very limits. The SNR range where we perform our simulations is far below previously tested ranges with PS [74,78], demonstrating a much superior noise resilience of MPNN. Moreover, fidelity is a very unforgiving metric as it requires both the relative phases and amplitudes of both rows and columns of the TM to be well reconstructed for it to be high. A poor quality TM recovered using PS with an SNR=5 and fidelity of 17%, can still control light with first and second planes with about 17% and 13% focusing efficiencies respectively. Nonetheless, such a metric is critical in high-precision applications such as programmable quantum gates [18], where slight errors in knowledge of the TM can drastically affect the performance of the entire experiment.

In practice, our experimental results are still far from ideal as estimated by the numerical simulations (black dotted lines in Figs. 4 and 5). This deviation from the ideal might originate from a variety of imperfections in the experiment that are not explained by the simple model of noise studied here. These include imperfections such as the choice of basis, instability of light source, phase instability of SLM, temperature-dependent movement of optomechanics and the optical medium, or linearity of the camera. Many of these issues can be addressed and improved in the experiment to achieve perfect focusing efficiency [7981], and could be combined with the MPNN technique presented here.

4.4 Learning multiple transmission matrices with an MPNN

In this section, we demonstrate how the MPNN technique can be used to reconstruct a complex optical network characterized by a series of transmission matrices, as described in Eq. (1). Furthermore, we show how knowledge of each individual TM allows us to focus light at any intermediate plane within this network. As shown in Fig. 8(a), we simulate a cascade of three $16\times 16$-dimensional TMs ($\textrm {T}_i$) separated by programmable phase planes. These are followed by a known mode-mixer (F) that performs a 2D discrete Fourier transform. By randomly modulating the phase planes and performing intensity measurements after F, the MPNN technique is able to fully characterize this optical network. The recovered TM of each medium as well as the training loss are shown in Fig. 8(b),(c).

 figure: Fig. 8.

Fig. 8. Learning multiple transmission matrices with an MPNN: (a) A cascade of multiple complex media interspersed by programmable phase planes can be characterized through the use of the MPNN technique. The recovered TMs corresponding to these media allows us to control the propagation of light at intermediate phase planes. (b) Numerical simulation showing three $16 \times 16$-dimensional TMs reconstructed with the MPNN technique, which are then used to focus light at each intermediate plane (insets). (c) Optimization loss using the training and testing datasets for the three-plane MPNN.

Download Full Size | PDF

A multi-plane network such as this can allow us to not only control light through the whole system, but also control light at intermediate planes as conceptualized in Fig. 8(a). As an example, we simulate optical phase conjugation using the three recovered TMs to focus light at each intermediate plane in the system. Insets in Fig. 8(b) show the focused image obtained at each plane in the trained network using the preceding phase planes. The size of the dataset required to characterize this network with dimension $3\times 16 \times 16$ is $\alpha \approx 10^5$ as also shown in Dataset 1 (Ref. [64]) . However, this is not the minimal size required to train this model and strongly depends on the training parameters. Further tuning and investigations can lead to a better understanding of how the minimum dataset size required to train this model varies with the number of planes and dimensions of the TM. Nonethless, it should be noted that the model for such a neural network is very complex and to our knowledge, MPNN is the only known method to date that can perform such a task.

A caveat of our technique is that the measured TM of two consecutive complex media in the series can be ambiguous up to a diagonal matrix on either side. In a cascade of complex media $T_i$ separated by planes $P_i$, taking a single layer at the $k^{\text {th}}$ plane,

$$T_k P_k T_{k-1} = T_k D P_k D^{{-}1} T_{k-1} = \widetilde{T}_k P_k \widetilde{T}_{k-1}$$
where $D$ is a diagonal matrix. Due to the commutation relation between $D, P_k$ and $D^{-1}$, the MPNN technique measures the TM of an individual complex medium up to such an ambiguity in the TMs of consecutive media, for example, the presence of equal but opposite diagonal phases. We anticipate that this ambiguity affects the training since few elements in $D$ can acquire high amplitudes leading to over-fitting, however, this problem can be tackled using suitable regularizers. Importantly, this ambiguity between $T_k$ and $\widetilde {T}_k$ does not affect the description of the overall cascaded optical network.

Systems consisting of cascaded programmable phase planes separated by optical media are fast gaining popularity, with recent work demonstrating their use in a variety of applications such as spatial mode-sorting [70,82,83], projective measurements [84], unambiguous state discrimination [85], programmable quantum circuits [18,86,87], and optical neural networks [11,88]. In all these implementations, accurate knowledge of the optical system between phase planes is critical to their performance. While free-space propagation is relatively straightforward to model, aberrations arising from the devices and elements used can introduce significant design challenges. By enabling the characterization of a cascade of independent TMs, the MPNN technique provides a way to more accurately design such multi-plane devices, with uses ranging from classical to quantum optical networks.

5. Conclusion

In this work, we have conceptualized and experimentally demonstrated a method to characterize a complex medium, or a network of complex media, using physics-informed neural networks, referred to as multi-plane neural networks (MPNN). We apply the proposed MPNN technique to measure the full coherent transmission matrix of a multi-mode fiber without the need for an external reference field. The key idea behind the measurement is to randomly modulate phases of optical fields both before and after the fiber and measure the intensity-only outcomes to form a dataset. The trained model produces a transmission matrix capable of controlling light by manipulating fields not only before the multi-mode fiber, but also after it, which relies on the complete coherent information of the obtained transmission matrix without the problem of dark spots and output-phase ambiguity. We demonstrate accurate control of optical modes through a multi-mode fiber using the MPNN method, with a significant improvement over the phase-stepping technique in terms of focusing efficiency and noise-robustness. Finally, we show the capability of this technique to learn more complex systems such as a cascade of transmission matrices interspersed with multiple phase planes and discuss possible applications. Our technique will allow for accurate control of coherent light not only through complex media but also through complex optical networks, with applications ranging from optical communication systems to biomedical imaging.

Funding

European Research Council (950402); Engineering and Physical Sciences Research Council (EP/P024114/1); Austrian Science Fund (I3773-N36); Royal Academy of Engineering (CiET-2223-112).

Acknowledgments

We would like to thank Dr. Will McCutcheon and Mayuna Gupta for fruitful discussions. The scientific color maps batlow and romaO [89] are used in this study to prevent visual distortion of the data and exclusion of readers with color vision deficiencies [90].

Disclosures

The authors declare no conflicts of interest.

Data availability

Data underlying the results presented in this paper are available in Dataset 1 (Ref. [64]).

References

1. M. Kim, W. Choi, Y. Choi, C. Yoon, and W. Choi, “Transmission matrix of a scattering medium and its applications in biophotonics,” Opt. Express 23(10), 12648–12668 (2015). [CrossRef]  

2. S. Rotter and S. Gigan, “Light fields in complex media: mesoscopic scattering meets wave control,” Rev. Mod. Phys. 89(1), 015005 (2017). [CrossRef]  

3. G. Wetzstein, A. Ozcan, S. Gigan, S. Fan, D. Englund, M. Soljačić, C. Denz, D. A. B. Miller, and D. Psaltis, “Inference in artificial intelligence with deep optics and photonics,” Nature 588(7836), 39–47 (2020). [CrossRef]  

4. H. Cao and Y. Eliezer, “Harnessing disorder for photonic device applications,” Appl. Phys. Rev. 9(1), 011309 (2022). [CrossRef]  

5. I. M. Vellekoop and A. P. Mosk, “Focusing coherent light through opaque strongly scattering media,” Opt. Lett. 32(16), 2309 (2007). [CrossRef]  

6. S. M. Popoff, G. Lerosey, R. Carminati, M. Fink, A. C. Boccara, and S. Gigan, “Measuring the Transmission Matrix in Optics : An Approach to the Study and Control of Light Propagation in Disordered Media,” Phys. Rev. Lett. 104(10), 100601 (2010). [CrossRef]  

7. Y. Choi, C. Yoon, M. Kim, T. D. Yang, C. Fang-Yen, R. R. Dasari, K. J. Lee, and W. Choi, “Scanner-Free and Wide-Field Endoscopic Imaging by Using a Single Multimode Optical Fiber,” Phys. Rev. Lett. 109(20), 203901 (2012). [CrossRef]  

8. J. Bertolotti and O. Katz, “Imaging in complex media,” Nat. Phys. 18(9), 1008–1017 (2022). [CrossRef]  

9. J. Carpenter, B. C. Thomsen, and T. D. Wilkinson, “Degenerate Mode-Group Division Multiplexing,” J. Lightwave Technol. 30(24), 3946–3952 (2012). [CrossRef]  

10. N. H. Farhat, D. Psaltis, A. Prata, and E. Paek, “Optical implementation of the Hopfield model,” Appl. Opt. 24(10), 1469 (1985). [CrossRef]  

11. X. Lin, Y. Rivenson, N. T. Yardimci, M. Veli, Y. Luo, M. Jarrahi, and A. Ozcan, “All-optical machine learning using diffractive deep neural networks,” Science 361(6406), 1004–1008 (2018). [CrossRef]  

12. U. Teğin, M. Yıldırım, İ. Oğuz, C. Moser, and D. Psaltis, “Scalable optical learning operator,” Nat. Comput. Sci. 1(8), 542–549 (2021). [CrossRef]  

13. L. G. Wright, T. Onodera, M. M. Stein, T. Wang, D. T. Schachter, Z. Hu, and P. L. McMahon, “Deep physical neural networks trained with backpropagation,” Nature 601(7894), 549–555 (2022). [CrossRef]  

14. W. Löffler, T. G. Euser, E. R. Eliel, M. Scharrer, P. S. J. Russell, and J. P. Woerdman, “Fiber Transport of Spatially Entangled Photons,” Phys. Rev. Lett. 106(24), 240505 (2011). [CrossRef]  

15. N. H. Valencia, S. Goel, W. McCutcheon, H. Defienne, and M. Malik, “Unscrambling entanglement through a complex medium,” Nat. Phys. 16(11), 1112–1116 (2020). [CrossRef]  

16. H. Defienne, M. Barbieri, I. A. Walmsley, B. J. Smith, and S. Gigan, “Two-photon quantum walk in a multimode fiber,” Sci. Adv. 2(1), e1501054 (2016). [CrossRef]  

17. S. Leedumrongwatthanakun, L. Innocenti, H. Defienne, T. Juffmann, A. Ferraro, M. Paternostro, and S. Gigan, “Programmable linear quantum networks with a multimode fibre,” Nat. Photonics 14(3), 139–142 (2020). [CrossRef]  

18. S. Goel, S. Leedumrongwatthanakun, N. H. Valencia, W. McCutcheon, C. Conti, P. W. H. Pinkse, and M. Malik, “Inverse-design of high-dimensional quantum optical circuits in a complex medium,” arXiv, arXiv:2204.00578 (2022). [CrossRef]  

19. A. Peres, “Construction of unitary matrices from observable transition probabilities,” Nuclear Phys. B - Proc. Suppl. 6, 243–245 (1989). [CrossRef]  

20. R. Di Leonardo and S. Bianchi, “Hologram transmission through multi-mode optical fibers,” Opt. Express 19(1), 247 (2011). [CrossRef]  

21. S. Rahimi-Keshari, M. A. Broome, R. Fickler, A. Fedrizzi, T. C. Ralph, and A. G. White, “Direct characterization of linear-optical networks,” Opt. Express 21(11), 13450 (2013). [CrossRef]  

22. A. Laing and J. L. O’Brien, “Super-stable tomography of any linear optical device,” arXiv, arXiv:1208.2868 (2012). [CrossRef]  

23. Y. Choi, T. R. Hillman, W. Choi, N. Lue, R. R. Dasari, P. T. C. So, W. Choi, and Z. Yaqoob, “Measurement of the Time-Resolved Reflection Matrix for Enhancing Light Energy Delivery into a Scattering Medium,” Phys. Rev. Lett. 111(24), 243901 (2013). [CrossRef]  

24. B. Gérardin, J. Laurent, A. Derode, C. Prada, and A. Aubry, “Full Transmission and Reflection of Waves Propagating through a Maze of Disorder,” Phys. Rev. Lett. 113(17), 173901 (2014). [CrossRef]  

25. H. Yu, J.-H. Park, and Y. Park, “Measuring large optical reflection matrices of turbid media,” Opt. Commun. 352, 33–38 (2015). [CrossRef]  

26. M. Tillmann, C. Schmidt, and P. Walther, “On unitary reconstruction of linear optical networks,” J. Opt. 18(11), 114002 (2016). [CrossRef]  

27. I. Dhand, A. Khalid, H. Lu, and B. C. Sanders, “Accurate and precise characterization of linear optical interferometers,” J. Opt. 18(3), 035204 (2016). [CrossRef]  

28. N. Spagnolo, E. Maiorino, C. Vitelli, M. Bentivegna, A. Crespi, R. Ramponi, P. Mataloni, R. Osellame, and F. Sciarrino, “Learning an unknown transformation via a genetic approach,” Sci. Rep. 7(1), 14316 (2017). [CrossRef]  

29. G. S. D. Gordon, R. Mouthaan, T. D. Wilkinson, and S. E. Bohndiek, “Coherent Imaging Through Multicore Fibres With Applications in Endoscopy,” J. Lightwave Technol. 37(22), 5733–5745 (2019). [CrossRef]  

30. G. S. D. Gordon, M. Gataric, A. G. C. P. Ramos, R. Mouthaan, C. Williams, J. Yoon, T. D. Wilkinson, and S. E. Bohndiek, “Characterizing Optical Fiber Transmission Matrices Using Metasurface Reflector Stacks for Lensless Imaging without Distal Access,” Phys. Rev. X 9(4), 041050 (2019). [CrossRef]  

31. L. Valzania and S. Gigan, “Online learning of the transmission matrix of dynamic scattering media,” Optica 10(6), 708–716 (2023). [CrossRef]  

32. I. Yamaguchi and T. Zhang, “Phase-shifting digital holography,” Opt. Lett. 22(16), 1268 (1997). [CrossRef]  

33. E. Cuche, P. Marquet, and C. Depeursinge, “Spatial filtering for zero-order and twin-image elimination in digital off-axis holography,” Appl. Opt. 39(23), 4070 (2000). [CrossRef]  

34. T. Cižmár and K. Dholakia, “Shaping the light transmission through a multimode optical fibre: complex transformation analysis and applications in biophotonics,” Opt. Express 19(20), 18871 (2011). [CrossRef]  

35. M. Plöschner, T. Tyc, and T. Cižmár, “Seeing through chaos in multimode fibres,” Nat. Photonics 9(8), 529–535 (2015). [CrossRef]  

36. S. Popoff, G. Lerosey, M. Fink, A. C. Boccara, and S. Gigan, “Image transmission through an opaque material,” Nat. Commun. 1(1), 81 (2010). [CrossRef]  

37. M. Hofer and S. Brasselet, “Manipulating the transmission matrix of scattering media for nonlinear imaging beyond the memory effect,” Opt. Lett. 44(9), 2137–2140 (2019). [CrossRef]  

38. A. Drémeau, A. Liutkus, D. Martina, O. Katz, C. Schülke, F. Krzakala, S. Gigan, and L. Daudet, “Reference-less measurement of the transmission matrix of a highly scattering material using a DMD and phase retrieval techniques,” Opt. Express 23(9), 11898 (2015). [CrossRef]  

39. P. Schniter and S. Rangan, “Compressive Phase Retrieval via Generalized Approximate Message Passing,” IEEE Trans. Signal Process. 63(4), 1043–1055 (2015). [CrossRef]  

40. G. Huang, D. Wu, J. Luo, L. Lu, F. Li, Y. Shen, and Z. Li, “Generalizing the Gerchberg–Saxton algorithm for retrieving complex optical transmission matrices,” Photonics Res. 9(1), 34 (2021). [CrossRef]  

41. D. Ancora, L. Dominici, A. Gianfrate, P. Cazzato, M. De Giorgi, D. Ballarini, D. Sanvitto, and L. Leuzzi, “Speckle spatial correlations aiding optical transmission matrix retrieval: the smoothed gerchberg–saxton single-iteration algorithm,” Photonics Res. 10(10), 2349 (2022). [CrossRef]  

42. M. N’Gom, M.-B. Lien, N. M. Estakhri, T. B. Norris, E. Michielssen, and R. R. Nadakuditi, “Controlling Light Transmission Through Highly Scattering Media Using Semi-Definite Programming as a Phase Retrieval Computation Method,” Sci. Rep. 7(1), 2518 (2017). [CrossRef]  

43. M. N’Gom, T. B. Norris, E. Michielssen, and R. R. Nadakuditi, “Mode Control in a Multimode Fiber Through Acquiring its Transmission Matrix from a Reference-less Optical System,” Opt. Lett. 43(3), 419 (2018). [CrossRef]  

44. D. Suess, N. Maraviglia, R. Kueng, A. Maïnos, C. Sparrow, T. Hashimoto, N. Matsuda, D. Gross, and A. Laing, “Rapid characterisation of linear-optical networks via PhaseLift,” arXiv, arXiv:2010.00517 (2020). [CrossRef]  

45. S. M. Popoff, G. Lerosey, M. Fink, A. C. Boccara, and S. Gigan, “Controlling Light Through Optical Disordered Media : Transmission Matrix Approach,” New J. Phys. 13(12), 123021 (2011). [CrossRef]  

46. P. Caramazza, O. Moran, R. Murray-Smith, and D. Faccio, “Transmission of natural scene images through a multimode fibre,” Nat. Commun. 10(1), 2029 (2019). [CrossRef]  

47. M. W. Matthes, P. del Hougne, J. de Rosny, G. Lerosey, and S. M. Popoff, “Optical complex media as universal reconfigurable linear operators,” Optica 6(4), 465 (2019). [CrossRef]  

48. J. Carpenter, B. J. Eggleton, and J. Schröder, “110x110 optical mode transfer matrix inversion,” Opt. Express 22(1), 96–101 (2014). [CrossRef]  

49. G. S. D. Gordon, J. Joseph, M. P. Alcolea, T. Sawyer, A. J. Macfaden, C. Williams, C. R. M. Fitzpatrick, P. H. Jones, M. di Pietro, R. C. Fitzgerald, T. D. Wilkinson, and S. E. Bohndiek, “Quantitative phase and polarisation endoscopy applied to detection of early oesophageal tumourigenesis,” J. Biomed. Opt. 24(12), 1–13 (2019). [CrossRef]  

50. G. S. D. Gordon, G. S. D. Gordon, J. Joseph, J. Joseph, T. Sawyer, A. J. Macfaden, C. Williams, C. Williams, T. D. Wilkinson, T. D. Wilkinson, S. E. Bohndiek, S. E. Bohndiek, and S. E. Bohndiek, “Full-field quantitative phase and polarisation-resolved imaging through an optical fibre bundle,” Opt. Express 27(17), 23929–23947 (2019). [CrossRef]  

51. A. Turpin, I. Vishniakou, and J. d. Seelig, “Light scattering control in transmission and reflection with neural networks,” Opt. Express 26(23), 30911 (2018). [CrossRef]  

52. N. Borhani, E. Kakkava, C. Moser, and D. Psaltis, “Learning to see through multimode fibers,” Optica 5(8), 960 (2018). [CrossRef]  

53. B. Rahmani, D. Loterie, G. Konstantinou, D. Psaltis, and C. Moser, “Multimode optical fiber transmission with a deep learning network,” Light: Sci. Appl. 7(1), 69 (2018). [CrossRef]  

54. Y. Li, Y. Xue, and L. Tian, “Deep speckle correlation: a deep learning approach toward scalable imaging through scattering media,” Optica 5(10), 1181 (2018). [CrossRef]  

55. P. Tang, K. Zheng, W. Yuan, T. Pan, Y. Xu, S. Fu, Y. Wang, and Y. Qin, “Learning to transmit images through optical speckle of a multimode fiber with high fidelity,” Appl. Phys. Lett. 121(8), 081107 (2022). [CrossRef]  

56. G. E. Karniadakis, I. G. Kevrekidis, L. Lu, P. Perdikaris, S. Wang, and L. Yang, “Physics-informed machine learning,” Nat. Rev. Phys. 3(6), 422–440 (2021). [CrossRef]  

57. S. Li, C. Saunders, D. J. Lum, J. Murray-Bruce, V. K. Goyal, T. Cižmár, and D. B. Phillips, “Compressively sampling the optical transmission matrix of a multimode fibre,” Light: Sci. Appl. 10(1), 88 (2021). [CrossRef]  

58. M. W. Matthes, Y. Bromberg, J. de Rosny, and S. M. Popoff, “Learning and avoiding disorder in multimode fibers,” Phys. Rev. X 11(2), 021060 (2021). [CrossRef]  

59. A. d’Arco, F. Xia, A. Boniface, J. Dong, and S. Gigan, “Physics-based neural network for non-invasive control of coherent light in scattering media,” Opt. Express 30(17), 30845–30856 (2022). [CrossRef]  

60. G. Marcucci, G. Marcucci, D. Pierangeli, D. Pierangeli, P. W. H. Pinkse, M. Malik, C. Conti, and C. Conti, “Programming multi-level quantum gates in disordered computing reservoirs via machine learning,” Opt. Express 28(9), 14018–14027 (2020). [CrossRef]  

61. M. Abadi, P. Barham, J. Chen, et al., “TensorFlow: A system for large-scale machine learning,” arXiv, arXiv:1605.08695 (2016). [CrossRef]  

62. D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv, arXiv:1412.6980 (2014). [CrossRef]  0

63. D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning representations by back-propagating errors,” Nature 323(6088), 533–536 (1986). [CrossRef]  

64. S. Goel, C. Conti, S. Leedumrongwatthanakun, and M. Malik, “Dataset and supplemental codes for: Referenceless characterisation of complex media using physics-informed neural networks,” Zenodo (2023), https://doi.org/10.5281/zenodo.7774974.

65. D. E. Boonzajer Flaes, J. Stopka, S. Turtaev, J. F. de Boer, T. Tyc, and T. Cižmár, “Robustness of Light-Transport Processes to Bending Deformations in Graded-Index Multimode Waveguides,” Phys. Rev. Lett. 120(23), 233901 (2018). [CrossRef]  

66. Y. Chen and Q. Kemao, “Phase-shifting algorithms with known and unknown phase shifts: comparison and hybrid,” Opt. Express 30(5), 8275–8302 (2022). [CrossRef]  

67. J. H. Bruning, D. R. Herriott, J. E. Gallagher, D. P. Rosenfeld, A. D. White, and D. J. Brangaccio, “Digital wavefront measuring interferometer for testing optical surfaces and lenses,” Appl. Opt. 13(11), 2693–2703 (1974). [CrossRef]  

68. Y. Surrel, “Design of algorithms for phase measurements by the use of phase stepping,” Appl. Opt. 35(1), 51–60 (1996). [CrossRef]  

69. Y. Sakamaki, T. Saida, T. Hashimoto, and H. Takahashi, “New Optical Waveguide Design Based on Wavefront Matching Method,” J. Lightwave Technol. 25(11), 3511–3518 (2007). [CrossRef]  

70. N. K. Fontaine, R. Ryf, H. Chen, D. Neilson, and J. Carpenter, “Design of High Order Mode-Multiplexers using Multiplane Light Conversion,” European Conference on Optical Communication pp. 1–3 (2017).

71. T. Heinosaari, L. Mazzarella, and M. M. Wolf, “Quantum tomography under prior information,” Commun. Math. Phys. 318(2), 355–374 (2013). [CrossRef]  

72. I. Vellekoop and A. Mosk, “Phase control algorithms for focusing light through turbid media,” Opt. Commun. 281(11), 3071–3080 (2008). [CrossRef]  

73. H. Yilmaz, W. L. Vos, and A. P. Mosk, “Optimal control of light propagation through multiple-scattering media in the presence of noise,” Biomed. Opt. Express 4(9), 1759 (2013). [CrossRef]  

74. M. Jang, C. Yang, and I. M. Vellekoop, “Optical Phase Conjugation with Less Than a Photon per Degree of Freedom,” Phys. Rev. Lett. 118(9), 093902 (2017). [CrossRef]  

75. C. Boncelet, “Image noise models,” in The Essential Guide to Image Processing (Elsevier, 2009), pp. 143–167.

76. Y. Surrel, “Additive noise effect in digital phase detection,” Appl. Opt. 36(1), 271–276 (1997). [CrossRef]  

77. J. R. Janesick, Photon Transfer (SPIE, 2007).

78. Y. Chen, M. K. Sharma, A. Sabharwal, A. Veeraraghavan, and A. C. Sankaranarayanan, “3PointTM: Faster Measurement of High-Dimensional Transmission Matrices,” Lecture Notes in Computer Science pp. 310–326 (2020).

79. A. D. Gomes, S. Turtaev, Y. Du, and T. Cižmár, “Near perfect focusing through multimode fibres,” Opt. Express 30(7), 10645–10663 (2022). [CrossRef]  

80. B. Mastiani, G. Osnabrugge, and I. M. Vellekoop, “Wavefront shaping for forward scattering,” Opt. Express 30(21), 37436 (2022). [CrossRef]  

81. S. Turtaev, I. T. Leite, K. J. Mitchell, M. J. Padgett, D. B. Phillips, and T. Cižmár, “Comparison of nematic liquid-crystal and dmd based spatial light modulation in complex photonics,” Opt. Express 25(24), 29874 (2017). [CrossRef]  

82. N. K. Fontaine, R. Ryf, H. Chen, D. T. Neilson, K. Kim, and J. Carpenter, “Laguerre-Gaussian mode sorter,” Nat. Commun. 10(1), 1865 (2019). [CrossRef]  

83. N. K. Fontaine, H. Chen, M. Mazur, L. Dallachiesa, K. Kim, R. Ryf, D. Neilson, and J. Carpenter, “Hermite-Gaussian mode multiplexer supporting 1035 modes,” Optical Fiber Communication Conference paper M3D–4 (2021).

84. M. Hiekkamäki, S. Prabhakar, and R. Fickler, “Near-perfect measuring of full-field transverse-spatial modes of light,” Opt. Express 27(22), 31456–31464 (2019). [CrossRef]  

85. S. Goel, M. Tyler, F. Zhu, S. Leedumrongwatthanakun, M. Malik, and J. Leach, “Simultaneously sorting overlapping quantum states of light,” Phys. Rev. Lett. 130(14), 143602 (2023). [CrossRef]  

86. S. Goel, M. Reynolds, M. Girling, W. McCutcheon, S. Leedumrongwatthanakun, V. Srivastav, D. Jennings, M. Malik, and J. K. Pachos, “Unveiling the non-abelian statistics of d(s3) anyons via photonic simulation,” arXiv, arXiv:2304.05286 (2023). [CrossRef]  

87. M. Hiekkamäki and R. Fickler, “High-dimensional two-photon interference effects in spatial modes,” Phys. Rev. Lett. 126(12), 123601 (2021). [CrossRef]  

88. P. Wang, W. Xiong, Z. Huang, Y. He, J. Liu, H. Ye, J. Xiao, Y. Li, D. Fan, and S. Chen, “Diffractive deep neural network for optical orbital angular momentum multiplexing and demultiplexing,” IEEE J. Sel. Top. Quantum Electron. 28(4), 1–11 (2022). [CrossRef]  

89. F. Crameri, “Scientific colour maps,” Zenodo (2021), https://doi.org/10.5281/zenodo.5501399.

90. F. Crameri, G. E. Shephard, and P. J. Heron, “The misuse of colour in science communication,” Nat. Commun. 11(1), 5444 (2020). [CrossRef]  

Supplementary Material (1)

NameDescription
Dataset 1       Dataset and supplemental codes

Data availability

Data underlying the results presented in this paper are available in Dataset 1 (Ref. [64]).

64. S. Goel, C. Conti, S. Leedumrongwatthanakun, and M. Malik, “Dataset and supplemental codes for: Referenceless characterisation of complex media using physics-informed neural networks,” Zenodo (2023), https://doi.org/10.5281/zenodo.7774974.

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (8)

Fig. 1.
Fig. 1. Schematic of an $n$-layered optical network: a cascade of $n$ complex media denoted by their optical transmission matrices $T_k$ are separated by reconfigurable phase-shifter planes $x_k$, followed by detection optics $F$. This optical network can be represented as a multi-plane neural network (MPNN) with $n+1$ input layers of $x_k$, $n$ “hidden-layers” for $T_k$, and a known layer for $F$ with a $|.|^2$ activation.
Fig. 2.
Fig. 2. (a) Experiment: Light from a superluminescent diode is filtered by a 3.1 nm spectral filter centered at 810 nm, modulated by a random phase pattern displayed on a spatial light modulator (SLM$_1$) and coupled into a multi-mode fiber (MMF) which is the complex medium of interest. The output speckle field from the MMF is projected onto SLM$_2$, which displays another random phase pattern, and is then detected on a CMOS camera which is placed in the Fourier plane of SLM$_2$. (b) The phase patterns on the SLM are constructed in a specific macro-pixel basis with varying pixel size based on the incident field distribution. Intensities of the output modes are recorded at a given set of points at the camera, which enclose an area corresponding to the MMF core. (c) The physics-informed neural network consists of two input layers encoding phase patterns on SLM$_1$ and SLM$_2$, and a single output layer encoding the intensity pattern of the output speckle in the given basis. The hidden layer $T$ denotes the complex transformation between SLM$_1$ and SLM$_2$ in the macro-pixel bases, while $F$ is a known layer corresponding to the $2f$-lens system between SLM$_2$ and the camera.
Fig. 3.
Fig. 3. Learning the coherent transmission matrix of a multi-mode fiber: Test and train (a) loss and (b) mean-absolute percentage error (MAPE) as learning proceeds in epochs. A visual plot of the learnt transmission matrix in the (c) macro-pixel and (d) Laguerre-Gauss (LG) basis, where modes are segregated into respective mode groups. In order to change the basis of the TM from (c) to (d), a transformation matrix $B_{\textrm {Pix} \rightarrow \textrm {LG}}$ is constructed that maps the set of macro-pixel modes to a set of LG modes. The TM in the LG basis is calculated as $T_{\textrm {LG}} = B_{\textrm {Pix} \rightarrow \textrm {LG}}^\dagger T_{\textrm {Pix}} B_{\textrm {Pix} \rightarrow \textrm {LG}}$.
Fig. 4.
Fig. 4. A comparison of optical phase conjugation (OPC) performed with only the first phase plane: (a) Focusing efficiencies achieved at different points across the output facet of the multi-mode fiber using the phase-stepping (PS) and multi-plane neural network (MPNN) techniques. Output points with low focusing efficiency due to the inherent dark-spot problem can be seen in the PS method (see text for more details). In contrast, focusing efficiencies obtained with the MPNN method show more uniformity across the fiber facet. Log-scale images of light focused using the first SLM with a TM obtained by the (b) PS and (c) MPNN techniques. (d) A histogram comparing the PS and MPNN methods shows that the PS technique leads to some output points with very low focusing efficiencies, while those obtained with MPNN method are more uniform and significantly higher. Note: Only the points within first 80% diameter of the core from the center of the core are taken for the purpose of this histogram.
Fig. 5.
Fig. 5. (a)-(c) A comparison of optical phase conjugation (OPC) performed with only the second phase plane. Log-scale images of light focused using only $\textrm {SLM}_{2}$ with a TM obtained by the (a) phase-stepping and (b) multi-plane neural network techniques. (c) A histogram comparing second-plane focusing efficiencies of 50 random input modes achieved with the PS and MPNN techniques shows a marked improvement with the latter. (d)-(f) A comparison of OPC performed with both phase planes. Log-scale images of light focused using both SLMs with a TM obtained by the (d) PS and (e) MPNN techniques. (f) A histogram comparing focusing efficiencies for all output modes achieved with both planes shows a significant improvement with MPNN over the PS technique. The focusing efficiencies are corrected for SLM$_2$ basis-dependent loss for both methods.
Fig. 6.
Fig. 6. Number of measurements required for optimal transmission matrix measurement: Experimental focusing efficiency achieved with (a) the first phase plane, (b) the second phase plane, and (c) both planes simultaneously, using a TM reconstructed with the phase-stepping technique and the multi-plane neural network (MPNN) plotted as a function of the number of measurements per input mode ($\alpha$). While phase-stepping converges to a maximum faster, the MPNN technique shows a much higher focusing efficiency.
Fig. 7.
Fig. 7. Noise-robustness of MPNN: A simulated $800 \times 800$- dimensional TM is recovered using the (a) phase-stepping (PS) and (b) multi-plane neural network (MPNN) techniques for different additive gaussian noise levels. The fidelity of the recovered TM is plotted as a function of the number of measurements per input mode ($\alpha$). While the PS technique quickly degrades in the presence of noise, the MPNN technique is able to reach high fidelities with small increases in the number of measurements.
Fig. 8.
Fig. 8. Learning multiple transmission matrices with an MPNN: (a) A cascade of multiple complex media interspersed by programmable phase planes can be characterized through the use of the MPNN technique. The recovered TMs corresponding to these media allows us to control the propagation of light at intermediate phase planes. (b) Numerical simulation showing three $16 \times 16$-dimensional TMs reconstructed with the MPNN technique, which are then used to focus light at each intermediate plane (insets). (c) Optimization loss using the training and testing datasets for the three-plane MPNN.

Equations (5)

Equations on this page are rendered with MathJax. Learn more.

y = | F P n + 1 T n P n T k P k T 2 P 2 T 1 x 1 | 2 , = | F ( x n + 1 ( T n ( x n T 2 ( x 2 T 1 x 1 ) ) ) ) | 2 ,
M S E = i | y ¯ i y i | 2 ,
y = | F ( x 2 T x 1 ) | 2 ,
y ~ σ = | F ( x 2 T x 1 ) | 2 + N ( μ , σ ) ,
T k P k T k 1 = T k D P k D 1 T k 1 = T ~ k P k T ~ k 1
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.