Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Inverse design of optical needles with central zero-intensity points by artificial neural networks

Open Access Open Access

Abstract

Optical needles with central zero-intensity points have attracted much attention in the field of 3D super-resolution microscopy, optical lithography, optical storage and Raman spectroscopy. Nevertheless, most of the studies create few types of optical needles with central zero-intensity points based on the theory and intuition with time-consuming parameter sweeping and complex pre-select of parameters. Here, we report on the inverse design of optical needles with central zero-intensity points by dipole-based artificial neural networks (DANNs), permitting the creation of needles which are close to specific length and amplitude. The resolution of these optical needles with central zero-intensity points is close to axial diffraction limit (∼1λ). Additionally, the DANNs can realize the inverse design of several types on-axis distributions, such as optical needles and multifocal distributions.

© 2020 Optical Society of America under the terms of the OSA Open Access Publishing Agreement

1. Introduction

With the characteristic of axial central minimized points and the ability to realize z-axis super-resolution, the optical needles with central zero-intensity points have attracted broad attention for their wide applications in spectroscopy [1,2] and 3D STED [3,4]. Besides, the optical needles with central zero-intensity points have the potential demand in all-optical magnetic recording [5], optical lithography [6] and fluorescent imaging [7].

So far, optical needles with central zero-intensity points have been demonstrated theoretically [810] and experimentally [13,11,12]. Optical needles with central zero-intensity points can be generated by the 4pi system [9,13] or phase plate [1,8,11]. Although there exist some theories and algorithms [1,4,8,9,14,15], these algorithms require physical perception and sweeping of many parameters, and the amplitude and resolution of the phase plate are invariable after optimizing. Besides, the initial parameters of these algorithms are set intuitively or empirically [16].

The general method of inverse design can provide the best possible output automatically after the desired targets input, which has been used to demonstrate an integrated nanophotonics polarization beamsplitters with controllable extinction ratios [17]. After combining with ANNs, the method can provide both the generality present in numerical optimization schemes and the speed of an analytical resolution [18], which have been introduced to design circuit elements [1921], vectorial holography [22], nanophononics particles [18] and metasurfaces [23].

In this work, we report on the inverse design of optical needles with central zero-intensity points by DANNs. Optical needles with central zero-intensity points can be inversely designed with different amplitude and resolution with DANNs for the first time. We generate several optical needles with central zero-intensity points along the longitudinal direction in this work and the resolution of these optical needles with central zero-intensity points are close to lateral diffraction limit (∼1λ). Surprisingly, the network can also realize inverse design of multi-focal distributions and optical needles. We generate some optical needles with DANNs. The depth of focus (DOF) (>81% for both cases) of optical needles ranges from 2.5λ to 8λ and most of them are sub-diffractive. This study provides new insights into the length-selectable optical tweezers [2427] and super-resolution microscopy [3,11,12].

2. Network architecture for the inverse design of optical needles with central zero-intensity points

The DANN employed here is a multilayer perception neural network [18,28], which can directly predict the optical needles with central zero-intensity points by on-axis intensity distribution due to dipole arrays’ axially and radially symmetrical properties. The advantage of direct prediction is superior performance [28], low-dimension sample data and less occupation of computer memory [18], which enables the network to be accomplished on a laptop. Figure 1 displays the framework of DANNs. The input to the DANN is a target on-axis intensity distribution I, while the output of the DANN is amplitude function E0 in the pupil plane.

 figure: Fig. 1.

Fig. 1. (a) Training process of DANNs. The parameters, pupil plane and distributions are unnormalized. (b) Flow-process diagram of the inverse design of optical needles with central zero-intensity points by DANNs. FFT denotes the fast Fourier transform. The predicted electric field in the pupil plane is radially polarized.

Download Full Size | PDF

The determination process of DANN is as follows. The on-axis intensity distribution is fed into the input layer I which is connected to the hidden layer H by the first weight matrix W1. Then, the input to hidden layer H is transmitted through a Sigmoid function. Additionally, the output of hidden layer H is calculated via the second weight matrix W2. Finally, the output layer E0 adopts the weighted output from second weight matrix W2, gets through linear output neurons and returns the amplitude function. Here, the network comprises 203 pixel on-axis intensity distribution and a single hidden layer consists of 256 neurons.

Here we assume many on-axis dipoles are symmetrically distributed along z axis and these ‘virtual’ dipoles can be derived to pupil plane function easily according to propagating function, which is further introduced in Eqs. (A4), (A5) and (A6). When the dipoles are distributed precisely, the optical needle with central zero-intensity points is created. We choose this method because the optical needles with central zero-intensity points can be generated by changing parameters on the pupil plane easily. Besides, this method can directly show the characteristics of an assumed optical needle with central zero-intensity points. Moreover, this method can be realized experimentally with a discrete complex pupil filter [16]. Figure 1(a) illustrates the training process of DANNs. The target pupil plane is derived from assumed dipoles which can be expressed by several parameters. The on-axis distributions are fed into the DANNs after Fast Fourier Transform (FFT). After several loops of forward and backward propagation, the DANN can predict parameters. The detailed definition of amplitude An, dipoles’ spacing distance dn and initial phase βn in Fig. 1(a) is discussed in Appendix A. Figure 1(b) illustrates pairs of dipoles (red horizontal arrows) design symmetrically along the z axis [16] and the derivation from dipoles to the pupil plane is shown in red declining arrows. The detailed derivation process will be introduced in Appendix A. In our work, we attempt to assume the situation of N2 [16] (2 pairs of optical needles), N3, N4 and N5.

3. Training of DANNs

The training is carried out by gradient descent with random fixed step size based on Mean Squared Error (MSE) [29]

$$MSE = \frac{1}{n}\mathop \sum \limits_{i = 1}^n {({{y_i} - {y_i}^{\prime}} )^2}.$$

In the training process, MSE is labelled as MSEtra. The yi denotes the target parameters in Fig. 1(a), yi denotes the predicted parameters in Fig. 1(a) and (n) denotes the number of samples. During the evaluation of result, MSE is labelled as MSEeva. The yi denotes the targe on-axis intensity distribution, yi denotes predicted on-axis intensity distribution and n denotes the number of samples. The value of MSE denotes the residual function which measures the quality of prediction during the training process.

To construct the network, we use MATLAB neural network toolbox, as it is a well-updated and easily operated network toolbox [30,31]. The DANNs are composed of a three-layer feed-forward and back-propagation network [28] trained with Levenberg-Marquardt backpropagation algorithm [32], because the three layer structure is widely utilized in chemistry [33], metallurgy [34], meteorology [35] and water conservancy project [36] as its capacities of non-linear prediction and recognition of high dimensional correlation. The Levenberg-Marquardt backpropagation algorithm is a mature algorithm in geography [37], electronics [38], economics [39] and water conservancy project [40]. The starting weights and bias will be randomized due to inner setting of the toolbox, while specific weights and networks can also set the start point to get a further and better performance. Besides, overfitting is one of the problems which should be avoided when training [28], so the network toolbox sets a validation check to check whether the error on the validation dataset starts to rise [30,31,41]. After the validation error reaches a number of iterations, the training process stops and the weights and biases are output to avoid overfitting. Each DANN spends 0.5–2 minutes in training process with MATLAB neural network toolbox, which spends less time than some algorithms [4244].

4. Network training and analyzing

The pair number of dipoles and corresponding parameters ranges, which are fed and trained in the DANNs, are given in Table 1. An is the arbitrary intensity, dn is the spatial distance of symmetrical dipoles and βn is the initial phase. The λ is wavelength of incident beam.

Tables Icon

Table 1. Parameters ranges of different dipole arrays for the training of DANNs

The error between target parameter and predicted parameter under N2 and N5 condition is displayed in Fig. 2(a). With traditional algorithms, the times and complexity grow exponentially as the number of parameters increases [18], so the prediction of high-N needles will be hindered and N3 is close to the calculation limitation of computers. Here, although the error increases with the increasing pair number, 5 pairs of dipoles can also be predicted easily and rapidly. When analyzing the predicted parameters of optical needles in Appendix B, some parameters are out of the limited scale, which shows that the networks are trained-well and worked.

 figure: Fig. 2.

Fig. 2. (a) Error between target parameters and predicted parameters under N2 and N5 condition. Each color represents one pair of parameters and contains stimulation (solid) and target (hollow) parameter. Geometric figures are also introduced to distinguish every pair of parameters clearly. (b) and (c) show gradient descent of N2 and N5 MSEtra during training process.

Download Full Size | PDF

According to Fig. 2(a), the networks are hard to converge when the parameter scale is enlarged and the number of dipole arrays is increasing. The corresponding gradient descent of MSEtra during training in two different situations is given in Figs. 2(b) and (c).

5. Inverse design of optical needles and optical needles with zero-intensity points

5.1 Inverse design of optical needles

This network has the ability to recognize different on-axis distribution after the similar models fed in, such as the optical needles, optical needles with central zero-intensity points and multi-focal distributions. After analyzing the optical needles existed [16,45], we visually find that optical needles have a flat top and both edges of on-axis intensity attenuate sharply from top to nearly zero. With curve fitting and feature analysis, what we found is that the first-order Gaussian function fits different edges well, so we simplify the model intuitively and create the Gaussian-edge model.

$${I_1} = {A_1}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {z_{left}} \le z \le {z_{right}},$$
$${I_1} = {A_1} \cdot \textrm{exp} \left[ { - {{\left( {\frac{{z - {B_1}}}{{{C_1}}}} \right)}^2}} \right]{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} z\textrm{ < }{z_{left}}\textrm{ or }z\textrm{ > }{z_{right}},$$
where I1 denotes the unnormalized on-axis intensity, A1 denotes the unnormalized top of the optical needles, B1 changes the position of Gauss function and C1 influence the slope of edge. With Gaussian-edge model, it is available to generate optical needles in Fig. 3. Besides the presented two optical needles in Fig. 3, the other 36 optical needles are shown in Appendix B.

 figure: Fig. 3.

Fig. 3. The first column is input (blue) and predicted (yellow) normalized on-axis intensity. The second column displays the predicted input pupil electric field. The third column shows the corresponding optical needles and evaluating parameters (white letters). The predicted input electric field includes both amplitude and phase. FWHM denotes full width at half maximum of central radial direction. Rm denotes the maximum radius of the entrance pupil. The predicted electric field in the pupil plane is radially polarized.

Download Full Size | PDF

To evaluate the optical needles, we introduce the purity η [16]

$$\eta = \frac{{{\mathrm{\Phi }_z}}}{{{\mathrm{\Phi }_z} + {\mathrm{\Phi }_r}}},$$
which η denotes the percentage of longitudinal intensity to the total field. φz and φr are longitudinal and radial components of one plane and they can be provided as
$${\mathrm{\Phi }_{z(r)}} = \mathop \smallint \nolimits_0^{{r_0}} {|{{E_{z(r)}}(r,z)} |^2}rdr,$$
where r0 denotes the first zero point of the radial component intensity. Here we choose the focal plane (z=0) to evaluate the purity.

5.2 Inverse design of optical needles with central zero-intensity points

The DANNs can also predict central minimized points. We found that the networks were sensitive to hollows in the middle of the on-axis distribution. After analyzing the feature of optical needles with central zero-intensity points and the convergence distance of the Gaussian function, we create the multi-trapezoidal-shape model. The one-trapezoidal-shape model can be shown as

$${I_1} = {A_1}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {z_{left}} \le z \le {z_{right}},$$
$${I_1} = k \cdot {A_1} \cdot z + {B_1}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} z\textrm{ < }{z_{left}}\textrm{ or }z\textrm{ > }{z_{right}},$$
which I1 denotes the unnormalized on-axis intensity, A1 denotes the unnormalized top of the optical needles with central zero-intensity points, k denotes the slope of edge and B1 denotes the intercept of edge. With this model, we successfully predict many optical needles with central zero-intensity points, which is shown in Fig. 4. The contrast (Imax/Imin) are above 105−106 and the resolutions are close to 1λ, so these are qualified central minimized points.

 figure: Fig. 4.

Fig. 4. Generation of optical needles with central zero-intensity points by N2, N3, N4 and N5 network (the first four rows). The first column is input (blue) and predicted (yellow) normalized on-axis intensity. The second column displays the predicted input pupil electric field. The third column shows the corresponding optical needles with central zero-intensity points and evaluating parameters (white letters). Resolution is the total width from the central zero-intensity point to the closest half maximum point of both sides. Contrast represents the sharpness of the central zero-intensity points. The predicted input electric field includes both amplitude and phase. Rm denotes the maximum radius of the entrance pupil. The predicted electric field in the pupil plane is radially polarized.

Download Full Size | PDF

The ratios of maximum on-axis intensity of focal field (Imaxfocal) to maximum on-axis input field (Imaxinput) are more than 105 in all the samples provided below, which means that the beam is tightly focused into the focal region.

5.3 Inverse design of optical needles with zero-intensity points and multifocal regions

Additionally, we also create needles with zero-intensity points (the first row of Fig. 5) with the N3 network, which has a flat top instead of sloping fast, and one multifocal region (the second row of Fig. 5). Nevertheless, as the flat top becomes wider and the shape of on-axis distribution become complex, it’s hard to predict needles with three-layer DANNs because of the predicting limit of these networks. More complex networks such as CNNs [46] or multi-layers DANNs can have a better performance.

 figure: Fig. 5.

Fig. 5. The first row is an optical needle with a zero-intensity point and the second row is a multifocal region. The predicted input electric field includes both amplitude and phase. Rm denotes the maximum radius of the entrance pupil. The predicted electric field in the pupil plane is radially polarized.

Download Full Size | PDF

6. Conclusion

Generation of optical needles with central zero-intensity points with specific resolution and amplitude requires inverse design. In this work, we proposed the inverse design of optical needles with central zero-intensity points by DANNs and permit the inverse design of optical needles with central zero-intensity points which are close to specific resolution and amplitude for the first time. The resolution is close to axial diffraction limit (∼1λ) and it shows the potential of designing sub-diffractive optical needles with central zero-intensity points by DANNs. Meanwhile, the multifunctional DANNs can also realize the inverse design of sub-diffractive optical needles range from 2.5λ to 8λ and multifocal regions. This study provided an interesting opportunity for the length-selectable optical tweezers [2427] and super-resolution microscopy [3,11,12] and could be further applied in fluorescent imaging [7], optical lithography [6] and all-optical magnetic recording [5].

Appendix A: Acceleration of training data generation on a high numerical aperture lens

The optical system of DANNs is modelled under high NA lens. Here, we choose the lens with a NA of 0.95. A radially polarized beam at a wavelength of 488 nm is selected as the incident beam [47].

The electric radiation in the pupil plane of the dipole array can be derived from [7,16,48]

$${\vec{E}_0}(\theta ) = C\sin \theta A{F_N}{\vec{a}_\theta },$$
$$A{F_N} = \mathop \sum \limits_{n = 1}^N {A_n}[{{e^{j({k{d_n}\cos \theta + {\beta_n}} )/2}} + {e^{ - j({k{d_n}\cos \theta + {\beta_n}} )/2}}} ],$$
which ${\vec{E}_0}(\theta )$ is the electric radiation in the pupil plane, C is the intrinsic impedance of air [16] and normalized to 1 in our assumption, θ is the incident angle of the beam and ${\vec{a}_\theta }$ is a unit vector pointing from spherical surface Ω in Fig. 1 (dotted arc) to focal region. AFN is the array factor [16] which is relevant to amplitude An, incidence angle θ, dipoles’ spacing distance dn and initial phase βn. The N here is the number of dipoles and is an even number. Furthermore, we can simplify Eq. (A2) to
$$A{F_N} = \mathop \sum \limits_{n = 1}^N 2{A_n}\cos (\frac{{k{d_n}\cos \theta \textrm{ + }{\beta _\textrm{n}}}}{2}).$$

The incident beam can be modulated by a discrete complex pupil filter [16]. In our work, we assume the situation of N = 4 (N2) or 6 (N3) or 8 (N4) or 10 (N5).

With the help of fast Fourier transform [49], it took 0.6 seconds to produce a pair of samples on a laptop with RTX2060 and Intel i7 9th CPU. When generating an axial cross section, vector Debye integral is always introduced due to its specific optimization and numerical precision [50], but the double integral is time-consuming. Whilst, with the help of fast Fourier transform [49], the speed can be accelerated greatly.

Fast Fourier transform of three Debye electric field components in the Sine condition objective lens and radially polarized beam distribution across the pupil plane are expressed by the Eq. (A4), (A5) and (A6), we set the precision of FFT as 20 nm in radial position and 20 nm in axial position.

$${E_x}(z) = FFT\{{C\sin \theta A{F_N}\cos \theta \cos \varphi \textrm{exp} [ - ikr\sin \theta \cos (\varphi - \psi ) - ik\textrm{z}\cos \theta ]\sin \theta } \},$$
$${E_y}(z) = FFT\{{C\sin \theta A{F_N}\cos \theta \sin \varphi \textrm{exp} [ - ikr\sin \theta \cos (\varphi - \psi ) - ik\textrm{z}\cos \theta ]\sin \theta } \},$$
$${E_z}(z) = FFT\{{C\sin \theta A{F_N}\sin \theta \textrm{exp} [ - ikr\sin \theta \cos (\varphi - \psi ) - ik\textrm{z}\cos \theta ]\sin \theta } \}.$$

Moreover, we choose on-axis light intensity distribution as our samples, because the diploes’ distribution is symmetrical axially and radially [16], so there are no requirements to lock the distribution with two planes [46]. With the design above, the input matrix dimension is reduced greatly and the network occupies less computer memory, so the generation and training process can both implement on the laptop. Additionally, we would like to emphasize that the datasets are unnormalized and there will be an ergodic and evaluating process of finding an optical needle after training.

After the fast Fourier transform, the intensity distribution comprises three Debye electric field components. The on-axis intensity is available when $x$ = 0 and $y$ = 0, which is provided in Eq. (A7).

$$I(z) = {|{{E_x}(z)} |^2} + {|{{E_y}(z)} |^2} + {|{{E_z}(z)} |^2}.$$

Appendix B: Several optical needles and their parameters

The optical needles created by N2, N3, N4 and N5 DANN are listed in Tables 234 and 5. The sampling precision of the fast Fourier transform here is 10 nm in radial position and 10 nm in axial position.

Tables Icon

Table 2. List of created optical needles by DANN of N2

Tables Icon

Table 3. List of created optical needles by DANN of N3

Tables Icon

Table 4. List of created optical needles by DANN of N4

Tables Icon

Table 5. List of created optical needles by DANN of N5

Minimizing the scale of parameters is one of the most time-consuming processes, so the tables shown below will pave the way for further optimization. Besides, with the analysis of these parameters, some thought-provoking ideas or physical models can also be implemented in further progress.

Funding

National Natural Science Foundation of China (61975123); Zhangjiang National Innovation Demonstration Zone (ZJ2019-ZD-005).

Acknowledgments

We would like to thank the support from all the members in the Centre for Artificial-Intelligence Nanophotonics. Additionally, Wei Xin wants to thank Dr Yangyundou Wang, Mr. Yiming Li, Mr. Hao Dong and associate professor Zhiwei Bi for discussing of neural networks, MATLAB, Blender and academic English. Wei Xin would also specifically like to give his sincere thanks to his parents for their support over the years.

Disclosures

The authors declare no conflicts of interest.

References

1. Y. Ma and T. Ha, “Fight against background noise in stimulated emission depletion nanoscopy,” Phys. Biol. 16(5), 051002 (2019). [CrossRef]  

2. A. Barbotin, S. Galiani, I. Urbančič, C. Eggeling, and M. J. Booth, “Adaptive optics allows STED-FCS measurements in the cytoplasm of living cells,” Opt. Express 27(16), 23378–23395 (2019). [CrossRef]  

3. R. Schmidt, C. A. Wurm, S. Jakobs, J. Engelhardt, A. Egner, and S. W. Hell, “Spherical nanosized focal spot unravels the interior of cells,” Nat. Methods 5(6), 539–544 (2008). [CrossRef]  

4. M. G. Velasco, M. Zhang, J. Antonello, P. Yuan, E. Allgeyer, D. May, O. M’Saad, P. Kidd, A. Barentine, V. Greco, J. Grutzendler, M. Booth, and J. Bewersdorf, “3D super-resolution deep-tissue imaging in living mice,” (2019).

5. Y. Jiang, X. Li, and M. Gu, “Generation of sub-diffraction-limited pure longitudinal magnetization by the inverse Faraday effect by tightly focusing an azimuthally polarized vortex beam,” Opt. Lett. 38(16), 2957–2960 (2013). [CrossRef]  

6. Z. Gan, Y. Cao, R. A. Evans, and M. Gu, “Three-dimensional deep sub-diffraction optical beam lithography with 9 nm feature size,” Nat. Commun. 4(1), 2061 (2013). [CrossRef]  

7. H. Li, Y. Wang, and P. Chen, “Ultra-long optical needles with controllable homogeneously 3D spin-orientation produced with an annular spherical mirror,” in Proc.SPIE (2019), Eleventh International Conference on Information Optics and Photonics (CIOP 2019), 1120908 (20 December 2019).

8. Y. Li, L. Lai, C. Rui, and L. Wang, “Optimization of depletion focal spot in STED nanoscopy using amplitude manipulation,” Opt. Commun. 372, 132–136 (2016). [CrossRef]  

9. M. Luo, D. Sun, Y. Yang, S. Liu, J. Wu, Z. Ma, S. Sun, and X. Sun, “Three-dimensional isotropic STED microscopy generated by 4π focusing of a radially polarized vortex Laguerre–Gaussian beam,” Opt. Commun. 463, 125434 (2020). [CrossRef]  

10. Y. Li, H. Zhou, X. Liu, Y. Li, and L. Wang, “Effects of aberrations on effective point spread function in STED microscopy,” Appl. Opt. 57(15), 4164–4170 (2018). [CrossRef]  

11. J. Heine, C. A. Wurm, J. Keller-Findeisen, A. Schönle, B. Harke, M. Reuss, F. R. Winter, and G. Donnert, “Three dimensional live-cell STED microscopy at increased depth using a water immersion objective,” Rev. Sci. Instrum. 89(5), 053701 (2018). [CrossRef]  

12. T. A. Klar, S. Jakobs, M. Dyba, A. Egner, and S. W. Hell, “Fluorescence microscopy with diffraction resolution barrier broken by stimulated emission,” Proc. Natl. Acad. Sci. U. S. A. 97(15), 8206–8210 (2000). [CrossRef]  

13. X. Hao, E. S. Allgeyer, M. J. Booth, and J. Bewersdorf, “Point-spread function optimization in isoSTED nanoscopy,” Opt. Lett. 40(15), 3627–3630 (2015). [CrossRef]  

14. X. Yang, H. Xie, E. Alonas, Y. Liu, X. Chen, P. J. Santangelo, Q. Ren, P. Xi, and D. Jin, “Mirror-enhanced super-resolution microscopy,” Light Sci. Appl. 5(6), e16134 (2016). [CrossRef]  

15. A. Barbotin, I. Urbančič, S. Galiani, C. Eggeling, M. Booth, and E. Sezgin, “z-STED Imaging and Spectroscopy to Investigate Nanoscale Membrane Structure and Dynamics,” Biophys. J. 118(10), 2448–2457 (2020). [CrossRef]  

16. J. Wang, W. Chen, and Q. Zhan, “Engineering of high purity ultra-long optical needle field through reversing the electric dipole array radiation,” Opt. Express 18(21), 21965–21972 (2010). [CrossRef]  

17. B. Shen, P. Wang, R. Polson, and R. Menon, “An integrated-nanophotonics polarization beamsplitter with 2.4×2.4µm 2 footprint,” Nat. Photonics 9(6), 378–382 (2015). [CrossRef]  

18. J. Peurifoy, Y. Shen, L. Jing, Y. Yang, F. Cano-Renteria, B. G. DeLacy, J. D. Joannopoulos, M. Tegmark, and M. Soljačić, “Nanophotonic particle simulation and inverse design using artificial neural networks,” Sci. Adv. 4(6), eaar4206 (2018). [CrossRef]  

19. E. Goi, Q. Zhang, X. Chen, H. Luan, and M. Gu, “Perspective on photonic memristive neuromorphic computing,” PhotoniX 1(1), 1–26 (2020). [CrossRef]  

20. Q. Zhang, H. Yu, M. Barbiero, B. Wang, and M. Gu, “Artificial neural networks enabled by nanophotonics,” Light: Sci. Appl. 8(1), 1–14 (2019). [CrossRef]  

21. W. Ma, Z. Liu, Z. A. Kudyshev, A. Boltasseva, W. Cai, and Y. Liu, “Deep learning for the design of photonic structures,” Nat. Photonics1–14 (2020). [CrossRef]  

22. H. Ren, W. Shao, Y. Li, F. Salim, and M. Gu, “Three-dimensional vectorial holography based on machine learning inverse design,” Sci. Adv. 4(1), 2061 (2020). [CrossRef]  

23. Z. Liu, D. Zhu, S. P. Rodrigues, K. T. Lee, and W. Cai, “Generative Model for the Inverse Design of Metasurfaces,” Nano Lett. 18(10), 6570–6576 (2018). [CrossRef]  

24. B. Spektor, A. Normatov, and J. Shamir, “Singular beam microscopy,” Appl. Opt. 47(4), A78–A87 (2008). [CrossRef]  

25. T. Kinoshita, “Stress singularity near the crack-tip in silicon carbide: Investigation by atomic force microscopy,” Acta Mater. 46(11), 3963–3974 (1998). [CrossRef]  

26. T. A. Nieminen, N. R. Heckenberg, and H. Rubinsztein-Dunlop, “Computational modeling of optical tweezers,” Opt. Trapp. Opt. Micromanipulation 5514, 514–523 (2004). [CrossRef]  

27. Y. Shen, X. Wang, Z. Xie, C. Min, X. Fu, Q. Liu, M. Gong, and X. Yuan, “Optical vortices 30 years on: OAM manipulation from topological charge to multiple singularities,” Light: Sci. Appl. 8(1), 90 (2019). [CrossRef]  

28. B. P. Cumming and M. Gu, “Direct determination of aberration functions in microscopy by an artificial neural network,” Opt. Express 28(10), 14511–14521 (2020). [CrossRef]  

29. J. Prakash Vijay, N. Kumar, S. A. Professor, and M. T. Scholar, “Performance Analysis Of RLS Over LMS Algorithm For MSE In Adaptive Filters,” Int. J. Technol. Enhanc. Emerg. Eng. Res. 2, 40–44 (2014).

30. I. Kirbas and A. Kerem, “Short-Term Wind Speed Prediction Based on Artificial Neural Network Models,” Meas. Control 49(6), 183–190 (2016). [CrossRef]  

31. D. P. B. T. B. Strik, A. M. Domnanovich, L. Zani, R. Braun, and P. Holubar, “Prediction of trace compounds in biogas from anaerobic digestion using the MATLAB Neural Network Toolbox,” Environ. Model. Softw. 20(6), 803–810 (2005). [CrossRef]  

32. J. Moré, “The Levenberg-Marquardt algorithm: Implementation and theory,” Numer. Anal. 630, G. A. Watson, ed. (Springer-Verlag, Berlin, 1977).

33. T. B. Blank and S. D. Brown, “Nonlinear Multivariate Mapping of Chemical Data Using Feed-Forward Neural Networks,” Anal. Chem. 65(21), 3081–3089 (1993). [CrossRef]  

34. Y. C. Lin, J. Zhang, and J. Zhong, “Application of neural networks to predict the elevated temperature flow behavior of a low alloy steel,” Comput. Mater. Sci. 43(4), 752–758 (2008). [CrossRef]  

35. B. ZareNezhad and A. Aminian, “A multi-layer feed forward neural network model for accurate prediction of flue gas sulfuric acid dew points in process industries,” Appl. Therm. Eng. 30(6-7), 692–696 (2010). [CrossRef]  

36. A. Sarkar and P. Pandey, “River Water Quality Modelling Using Artificial Neural Network Technique,” Aquat. Procedia 4, 1070–1077 (2015). [CrossRef]  

37. Ö Çelik, A. Teke, and H. B. Yildirim, “The optimized artificial neural network model with Levenberg-Marquardt algorithm for global solar radiation estimation in Eastern Mediterranean Region of Turkey,” J. Cleaner Prod. 116, 1–12 (2016). [CrossRef]  

38. B. G. Kermani, S. S. Schiffman, and H. T. Nagle, “Performance of the Levenberg-Marquardt neural network training method in electronic nose applications,” Sens. Actuators, B 110(1), 13–22 (2005). [CrossRef]  

39. S. Mammadli, “Financial time series prediction using artificial neural network based on Levenberg-Marquardt algorithm,” Procedia Comput. Sci. 120, 602–607 (2017). [CrossRef]  

40. A. J. Adeloye and A. De Munari, “Artificial neural network based generalized storage-yield-reliability models using the Levenberg-Marquardt algorithm,” J. Hydrol. 326(1-4), 215–230 (2006). [CrossRef]  

41. M. H. Beale, M. T. Hagan, and B. Demuth, Neural Network Toolbox User’s Guide (The MathWorks. Inc, 2013).

42. J. Lin, H. Zhao, Y. Ma, J. Tan, and P. Jin, “New hybrid genetic particle swarm optimization algorithm to design multi-zone binary filter,” Opt. Express 24(10), 10748–10758 (2016). [CrossRef]  

43. Y. Zha, J. Wei, H. Wang, and F. Gan, “Creation of an ultra-long depth of focus super-resolution longitudinally polarized beam with a ternary optical element,” J. Opt. (United Kingdom) 15 (2013).

44. S. W. K. Roper, S. Ryu, B. Seong, C. Joo, and I. Y. Kim, “A topology optimization implementation for depth-of-focus extension of binary phase filters,” Struct. Multidiscip. Optim. 62(5), 2731–2748 (2020). [CrossRef]  

45. H. Wang, L. Shi, B. Lukyanchuk, C. Sheppard, and C. T. Chong, “Creation of a needle of longitudinally polarized light in vacuum using binary optics,” Nat. Photonics 2(8), 501–505 (2008). [CrossRef]  

46. H. Ma, H. Liu, Y. Qiao, X. Li, and W. Zhang, “Numerical study of adaptive optics compensation based on Convolutional Neural Networks,” Opt. Commun. 433, 283–289 (2019). [CrossRef]  

47. R. Dorn, S. Quabis, and G. Leuchs, “Sharper focus for a radially polarized light beam,” Phys. Rev. Lett. 91(23), 233901 (2003). [CrossRef]  

48. J. Luo, H. Zhang, S. Wang, L. Shi, Z. Zhu, B. Gu, X. Wang, and X. Li, “Three-dimensional magnetization needle arrays with controllable orientation,” Opt. Lett. 44(4), 727–730 (2019). [CrossRef]  

49. M. Leutenegger, R. Rao, R. A. Leitgeb, and T. Lasser, “Fast focus field calculations,” Opt. Express 14(23), 11277–11291 (2006). [CrossRef]  

50. M. Gu, Advanced Optical Imaging Theory (Springer, 2000).

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (5)

Fig. 1.
Fig. 1. (a) Training process of DANNs. The parameters, pupil plane and distributions are unnormalized. (b) Flow-process diagram of the inverse design of optical needles with central zero-intensity points by DANNs. FFT denotes the fast Fourier transform. The predicted electric field in the pupil plane is radially polarized.
Fig. 2.
Fig. 2. (a) Error between target parameters and predicted parameters under N2 and N5 condition. Each color represents one pair of parameters and contains stimulation (solid) and target (hollow) parameter. Geometric figures are also introduced to distinguish every pair of parameters clearly. (b) and (c) show gradient descent of N2 and N5 MSEtra during training process.
Fig. 3.
Fig. 3. The first column is input (blue) and predicted (yellow) normalized on-axis intensity. The second column displays the predicted input pupil electric field. The third column shows the corresponding optical needles and evaluating parameters (white letters). The predicted input electric field includes both amplitude and phase. FWHM denotes full width at half maximum of central radial direction. Rm denotes the maximum radius of the entrance pupil. The predicted electric field in the pupil plane is radially polarized.
Fig. 4.
Fig. 4. Generation of optical needles with central zero-intensity points by N2, N3, N4 and N5 network (the first four rows). The first column is input (blue) and predicted (yellow) normalized on-axis intensity. The second column displays the predicted input pupil electric field. The third column shows the corresponding optical needles with central zero-intensity points and evaluating parameters (white letters). Resolution is the total width from the central zero-intensity point to the closest half maximum point of both sides. Contrast represents the sharpness of the central zero-intensity points. The predicted input electric field includes both amplitude and phase. Rm denotes the maximum radius of the entrance pupil. The predicted electric field in the pupil plane is radially polarized.
Fig. 5.
Fig. 5. The first row is an optical needle with a zero-intensity point and the second row is a multifocal region. The predicted input electric field includes both amplitude and phase. Rm denotes the maximum radius of the entrance pupil. The predicted electric field in the pupil plane is radially polarized.

Tables (5)

Tables Icon

Table 1. Parameters ranges of different dipole arrays for the training of DANNs

Tables Icon

Table 2. List of created optical needles by DANN of N2

Tables Icon

Table 3. List of created optical needles by DANN of N3

Tables Icon

Table 4. List of created optical needles by DANN of N4

Tables Icon

Table 5. List of created optical needles by DANN of N5

Equations (14)

Equations on this page are rendered with MathJax. Learn more.

M S E = 1 n i = 1 n ( y i y i ) 2 .
I 1 = A 1 z l e f t z z r i g h t ,
I 1 = A 1 exp [ ( z B 1 C 1 ) 2 ] z  <  z l e f t  or  z  >  z r i g h t ,
η = Φ z Φ z + Φ r ,
Φ z ( r ) = 0 r 0 | E z ( r ) ( r , z ) | 2 r d r ,
I 1 = A 1 z l e f t z z r i g h t ,
I 1 = k A 1 z + B 1 z  <  z l e f t  or  z  >  z r i g h t ,
E 0 ( θ ) = C sin θ A F N a θ ,
A F N = n = 1 N A n [ e j ( k d n cos θ + β n ) / 2 + e j ( k d n cos θ + β n ) / 2 ] ,
A F N = n = 1 N 2 A n cos ( k d n cos θ  +  β n 2 ) .
E x ( z ) = F F T { C sin θ A F N cos θ cos φ exp [ i k r sin θ cos ( φ ψ ) i k z cos θ ] sin θ } ,
E y ( z ) = F F T { C sin θ A F N cos θ sin φ exp [ i k r sin θ cos ( φ ψ ) i k z cos θ ] sin θ } ,
E z ( z ) = F F T { C sin θ A F N sin θ exp [ i k r sin θ cos ( φ ψ ) i k z cos θ ] sin θ } .
I ( z ) = | E x ( z ) | 2 + | E y ( z ) | 2 + | E z ( z ) | 2 .
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.