Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Wavefront sensorless adaptive optics: a general model-based approach

Open Access Open Access

Abstract

Wavefront sensorless adaptive optics (AO) systems have been widely studied in recent years. To reach optimum results, such systems require an efficient correction method. In this paper, a general model-based correction method for a wavefront sensorless AO system is presented. The general model-based approach is set up based on a relationship wherein the second moments (SM) of the wavefront gradients are approximately proportionate to the FWHM of the far-field intensity distribution. The general model-based method is capable of taking various common sets of functions as predetermined bias functions and correcting the aberrations by using fewer photodetector measurements. Numerical simulations of AO corrections of various random aberrations are performed. The results show that the Strehl ratio is improved from 0.07 to about 0.90, with only N + 1 photodetector measurement for the AO correction system using N aberration modes as the predetermined bias functions.

©2010 Optical Society of America

1. Introduction

Using wavefront sensorless adaptive optics (AO) is a better option than using distinct, separate wavefront sensor AOs for some applications that benefit from AO correction [1], such as inertial confinement fusion (ICF), optical tracking, free-space laser propagation, and microscopy. The most significant potential advantage of wavefront sensorless AOs is related to the fact that far-field intensity distribution is permitted to be used as the feedback signal, which in turn allows a wavefront sensorless AO to be used in poor illumination situations. Wavefront sensorless AO systems operate by sequentially modulating the AO corrector and maximizing a feedback signal according to particular optimization algorithms. These particular optimization algorithms include model-free methods and model-based methods. Model-free methods contain stochastic, local, or global search methods. Among model-free methods it is noticeable that works on stochastic gradient methods proposed by M. Vorontsov are verified to be the fastest search methods [2], whereas many measurements are still required for stochastic gradient methods. Taking Ref. [3] as an example, more than 100 measurements are still needed for a 25-element actuator deformable mirror (DM) in order to correct the aberrations [3], and the more measurements that are required, the more difficult it is to realize real-time AO systems.

The model-based approach is a much better choice to reduce the measurements. M. J. Booth has proposed that the model-based approach is capable of correcting aberrations with a minimum of N + 1 photodetector measurements for N aberration modes [4,5]. However, the model-based method requires taking different sets of functions as the predetermined bias functions for aberrations of various magnitudes. The approach requires taking Zernike functions as the predetermined bias functions for small aberrations and Lukosz–Zernike (L–Z) functions for large aberrations [4,5]. Furthermore, the accuracy of this method relies upon factors such as bias values (the coefficients of the predetermined bias functions).

In this paper we propose a general model-based approach. Unlike the model-based method proposed by M. J. Booth, the general approach is insensitive to the selection of sets of functions as well as the bias values. Besides the L-Z functions, the general model-based approach can also take other kind of modes, such as the Zernike functions, as the predetermined bias functions and correct aberrations effectively, not only for large aberrations but also for small aberrations.

2. The general model-based method

2.1 The wavefront sensorless AO correction system

It is well known that a wavefront sensorless AO can be depicted as shown in Fig. 1 , where the input wavefront is incident from the left. After aberration correction by the adaptive element (the DM), the input wavefront is focused onto the photodetector by a positive lens. A mark or a small pinhole is placed on the photodetector to generate a feedback signal, such as the encircled energy signal [4]. The encircled energy feedback signal is produced by using a pinhole, which filters energy outside the pinhole and allows energy within the pinhole to be obtained by the photodetector. The feedback signal is used to drive the AO element. The aberration of the input wavefront is described by function Φ(x,y), where x and y are the rectangular coordinates in the pupil plane of the lens. The correcting aberration of the DM is depicted by function Ψ(x,y). The residual aberration of the input wavefront is represented by function R(x,y), and R(x,y)=Φ(x,y)Ψ(x,y). The corresponding far-field intensity distribution I(x',y') is given by [6]

I(x',y')=I0|A(x,y)exp[jR(x,y)]ejk2z[(xx')2+(yy')2]dxdy|2,
where x’ and y’ are the rectangular coordinates in the input plane of the photodetector; I0 is proportional to the incident light power; A is the amplitude of the input wavefront and is set to be uniform in this paper; k=2π/λ, λ is the wavelength of the input wavefront; z is the focal length of the positive lens; and j is the imaginary unit.

 figure: Fig. 1

Fig. 1 Schematic diagram of the adaptive system.

Download Full Size | PDF

We assume that Φ and Ψ can be represented by a series of M orthonormal functions and N orthonormal functions, respectively. In most cases, the number M is greater than N. The orthonormal functions are called modes in the following text, each mode denoted by Fi(x,y):

Φ(x,y)=i=1MνiFi(x,y),Ψ(x,y)=i=1NμiFi(x,y).

In other words, aberration Φ of the input wavefront can be denoted by vector V, whose elements are the coefficients ofνi. Similarly, the correcting aberration and the residual aberration R can be represented by vectors U and Z, respectively. The elements of U and Z are μi and zi, and moreover, zi=νiμi. Minimizing vector Z is the job of this wavefront sensorless AO system.

In order to minimize vector Z efficiently, the relationship between the information about the far-field intensity distribution and the aberration need to be set up first.

2.2 Relationship between the second moment (SM) of the aberration gradients and the FWHM of far-field intensity distribution

As we know, the centroid of the far-field intensity distribution in geometric optics is related to the aberration of the input wavefront. When only the Seidel aberration of tilt is present, this relationship can be described by the following expression [6,7]:

[xΦ(x,y)]2+[yΦ(x,y)]2(x'2+y'2).

When other aberrations are present, those aberrations can be considered as the compositions of many small pieces of Seidel aberrations of tilt. The relationship between tilt aberrations in each small piece and the centroid of the corresponding spot observes the rule in Eq. (3). Hence from this fact we may deduce that the sum of all small pieces have the following relationship:

i=0TL{[xΦi(x,y)]2+[yΦi(x,y)]2}i=0TLFIi(x',y')(x'2+y'2),
where Φi(x,y) stands for the ith piece of tilt. TL is the total number of the small pieces. TLF is the total number of points in the far-field intensity distribution. Ii(x',y') is the far-field intensity at point (x',y'). Since the far-field intensity distributions of two or more small pieces of input wavefronts may locate at the same point, a variable related to the number of pieces is needed. On the other hand, it is well known that the magnitude of a certain far-field intensity distribution is positive proportional to the number of input small-piece wavefronts in geometric optics, on the condition that those input small-piece wavefronts have the same tilt aberrations. Therefore, Ii(x',y') is present in Eq. (4) to represent the total number of input small-piece wavefronts that focus on point (x', y').

When TL→∞, TLF→∞, Eq. (4) is

xy{[xΦ(x,y)]2+[yΦ(x,y)]2}dxdyx'y'I(x',y')(x'2+y'2)dx'dy',
where the left-hand side of Eq. (5) is the SM of the aberration gradients, and the right-hand side of Eq. (5) is the sum of the far-field intensity distribution multiplied by a mask [corresponding position(x'2+y'2)]. Since the total sum of far-field intensity distribution I(x',y') is a constant, the right-hand side of Eq. (5) is changed to be

x'y'I(x',y')(x'2+y'2)dx'dy'=R2x'y'I(x',y')(x'2+y'2)R2dx'dy'=R2{x'y'I(x',y')dx'dy'x'y'I(x',y')[1r2R2]dx'dy'}.

Hence the mask becomes 1r2/R2 for r ≤ R and zero otherwise. r=x'2+y'2, R is a suitable chosen detector radius, and R is weighted by the system’s diffraction limitation (DL). The significant advantage of the new mask is that the new mask is insensitive to the actual detected size. In practice, the new mask could be implemented by the weighting of pixels within the software when the photodetector is replaced by a CCD camera.

Moreover, the whole expression of the right-hand side of Eq. (5) is also normalized by dividing by the sum of I(x',y'). For convenience, Eq. (7) is called the masked detector signal (MDS) in subsequent text.

MDS=x'y'I(x',y')[1r2R2]dx'dy'x'y'I(x',y')dx'dy'.

Therefore, Eq. (5) can be expressed as

SMc0(1MDS) ,
where SM is the second moment of the aberration gradients, and c0 is the slop of the trend line, which is determined by the detector radius R.

To verify the relationship between the SM of the aberration gradients and the MDS, 500 random atmospheric aberrations with various secondary moments are produced by the method proposed by Noll in [8]. By calculating the SM of the 500 random atmospheric aberration gradients as well as the MDS from the corresponding far fields, the signal responses are depicted in Fig. 2 . The values of detector radius R are set to be 5 DL, 12 DL and 24 DL, respectively. An approximate linearity between the SM of the aberration gradient and the MDS is obtained in Fig. 2 when the detector radius R is selected to be 12 DL or 24 DL. The approximate linearity has some errors when detector radius R = 5 DL. The errors of approximate linearity result from the cases where the distributions of the far field go beyond the calculated area. Thus, when the detector radius R is suitably chosen, the MDS is considered to be related to the SM of the aberration gradients by using Eq. (8).

 figure: Fig. 2

Fig. 2 Signal response between SM of aberration gradient and MDS. 500 random aberrations are taken for study.

Download Full Size | PDF

For comparison, the relationship used in [4] is numerically calculated by using random aberrations and are drawn in Fig. 3 . The relationship between the MDS and aberration magnitude |V| is calculated from 5000 random aberrations, and each circle in Fig. 3 stands for the mean MDS of 100 random aberrations. Note that the MDS is essentially the same as the detected signal used in [4], except that the MDS is normalized by the sum of the far-field intensity. Obviously, our method results in a linear relationship between the MDS and the SM.

 figure: Fig. 3

Fig. 3 Signal response between aberration magnitude |V| and MDS. Each point in the figure stands for the mean MDS values of 100 random aberrations.

Download Full Size | PDF

2.3 Modeling and analysis

So far, the relationship between the MDS and the SM of the aberration gradient has been set up. We will build the general model-based method for a wavefront sensorless AO system according to the relationship.

As we know that the aberration of an input wavefront can be expressed by a series of M orthonormal modesFi(x,y),

Φ(x,y)=i=1MνiFi(x,y).

The input aberration Φ’s gradients in the x and y axes are [9]

xΦ(x,y)=i=1MνixFi(x,y),yΦ(x,y)=i=1MνiyFi(x,y),
where Φ(x,y)/x and Φ(x,y)/y are the input aberration gradients in the x and y axes, respectively, and Fi(x,y)/x and Fi(x,y)/y are the orthonormal mode gradients in the x and y axes, respectively.

In order to find out the value of coefficients vi, the N orthonormal mode Fi(x,y) is taken as the predetermined bias function and is added by the DM sequentially with coefficient α to the input aberrations. Then the detector measurements are recorded.

The difference wi,0 between the SM of the gradients of aberration Φ and that after adding a predetermined bias function Fi(x,y) with the coefficient α is declared as

wi,0=SMiSM0=s{[xΦ(x,y)+αxFi(x,y)]2+[yΦ(x,y)+αyFi(x,y)]2}{[xΦ(x,y)]2+[yΦ(x,y)]2}dxdys,
whereSM0andSMiare the SMs of the gradients of Φ(x,y) and Φ(x,y)+αFi(x,y), respectively. By rearranging, we can get

wi,0=s[αxFi(x,y)2xΦ(x,y)]dxdys+s[αxFi(x,y)]2dxdys+s[αyFi(x,y)2yΦ(x,y)]dxdys+s[αyFi(x,y)]2dxdys=s2α[xFi(x,y)xΦ(x,y)+yFi(x,y)yΦ(x,y)]dxdys+sα2{[xFi(x,y)]2+[yFi(x,y)]2}dxdys.

After N predetermined bias functions Fi(x, y) are added sequentially by the DM, we will obtain N equations

wi,0=s2α[xFi(x,y)xΦ(x,y)+yFi(x,y)yΦ(x,y)]dxdys+sα2{[xFi(x,y)]2+[yFi(x,y)]2}dxdys,i=1~N.

Replacing the gradients of aberration Ф by Eq. (9), it is convenient to use the vectors that represent the sampled values of the original functions. Equation (12) becomes

W=2α*S*V+α2*Sm,
where S=(s1,1s1,2...s1,Ns2,1s2,2.........sN1,NsN,1...sN,N1sN,N............s1,M...sN1,MsN,M), Sm=(s1,1s2,2...sN,N), W=(w1,0w2,0...sN,0), V=(ν1ν2...νN...νM), and

sn,m=s{[xFn(x,y)*xFm(x,y)]+[yFn(x,y)*yFm(x,y)]}dxdys.

Generally, S is invertible, so the coefficient V of the modes can be calculated by

V=S1(Wα2*Sm)2*α.

According to Eq. (8) and Eq. (10), we have

wi,0=SMiSM0c0(1MDSi)-c0(1MDS0)-c0(MDSi-MDS0),
where MDS0 and MDSi are the corresponding MDS of input wavefronts Ф(x, y) and Ф(x, y) + αFi(x, y), respectively. That is, Wc0M, thus the vector V is estimated by
VS1(c0*Mα2*Sm)2*α,
where M=(MDS1MDS0MDS2MDS0...MDSNMDS0).

Note that vector V can be resolved exactly by Eq. (15), no matter what sets of modes are adopted as the predetermined bias functions; hence, it can be concluded that the method is insensitive to the selected sets of modes as well as the choice of the bias value α, and the correction error comes only from Eq. (8). Furthermore, it is known that Eq. (8) is set up based on geometric optics and is satisfied with all kinds of aberrations as long as a suitable detector radius R is selected.

3. Numerical simulations and result analysis

To verify the performance of the general method, 50 random aberrations with a normalized root-meaning-square value of 0.5 λ are produced as the input wavefront aberrations in the paper. The aberrations consist of 18 Zernike modes represented by random coefficients ui. The 18 Zernike modes and 18 L–Z modes are taken sequentially to be the biases of the model-based method and added by the DM to the input wavefront.

The 18 Zernike modes and 18 L–Z modes are drawn in Fig. 4(a) and Fig. 4(b), respectively. The corresponding inverses S−1 are shown in Fig. 5(a) and Fig. 5(b) respectively.

 figure: Fig. 4

Fig. 4 Set of aberration modes: (a) 18 Zernike modes (3–20); (b) 18 L–Z modes (3–20).

Download Full Size | PDF

 figure: Fig. 5

Fig. 5 Corresponding inverse S−1 of sets of aberration modes in Fig. 4: The inverse S−1 of (a) 18 Zernike modes (3-20), (b) 18 L–Z modes (3-20).

Download Full Size | PDF

The biasesFi(x,y)with coefficient α=0.05λ are added sequentially to the aberrations, given the total aberrations U+V,UT=[u1,u2,...,u18],VT=[0,0,.α..,0], and the corresponding MDSi are calculated. The detector radius of far-field intensity R equals 16DL, and the slope of the trend line c0 is −194.1. After the total N biases are added, the correction vector V is calculated by Eq. (16): i = 1,2…,18.

The Strehl Ratio (SR) is adopted to evaluate the correction results, and the SR is defined as

SR=P[I(x,y)]P[I0(x,y)],
whereP[]is an operation, which calculates the peak intensity. I is the actual intensity distribution, and I0 is the intensity distribution when no aberrations are present.

Since the vector S−1, Sm, and MDS0 are known in advance and do not need recalculation or measurement for each correction, the total detector measurements for one aberration correction would be 19. One aberration correction is considered as one iteration in the following results.

Figure 6(a) and Fig. 6(b) show the corresponding results after correction by the method proposed by M. J. Booth and the method in this paper, respectively. The icons “method 1” and “method 2” in the figure correspond to the method proposed by M. J. Booth and our method, respectively. The method proposed by M. J. Booth is referred to as previous works in the following text.

 figure: Fig. 6

Fig. 6 Results of AO correction using (a) L–Z modes and (b) Zernike modes as the predetermined bias functions; the surfaces of the modes are drawn in Fig. 4. The icons “method 1” and “method 2” in the figure refer to the method proposed by M. J. Booth and the method proposed in this paper.

Download Full Size | PDF

According to the correction results in Fig. 6(a) and Table 1 , it is not difficult to find that our method is more efficient than previous works when the L–Z polynomials are taken as the predetermined bias functions. When using previous works to correct the aberrations, the values of SR rise slowly when the aberrations become smaller. However, our method works well both for small and large aberrations.

Tables Icon

Table 1. SR Results of AO Correction Using L–Z Modes by Our Method and Previous Works

From Fig. 6(b), we see that the correction results of using the method of previous works decline when Zernike polynomials are taken as the predetermined bias functions. However, the correction results of our method keep consistent with those using L–Z polynomials as the biases. The correction results indicate that our method is insensitive to the selected sets of functions.

The effect of bias values (the coefficient α) on correction results is under consideration. We take the L–Z modes and Zernike functions as the predetermined bias functions and change the value of coefficient α. The curves of such iterations are show in Fig. 7 . As we have expected in the conclusions of Section 2, our method is insensitive to the choice of bias values.

 figure: Fig. 7

Fig. 7 Results of AO correction using (a) L–Z and (b) Zernike modes when coefficientαvary from 0.02 to 0.4 λ. The icons “method 1” and “method 2” in the figure refer to the method proposed by M. J. Booth and the method proposed in this paper.

Download Full Size | PDF

4. Conclusions

A general model-based approach for wavefront sensorless adaptive optics is presented in the paper. The general model-based approach is set up based on the approximate linearity between the MDS and the SM of the wavefront gradient. Because the general method is insensitive to the selected modes, it permits correction of the aberrations by using all kinds of orthogonal aberrations as the predetermined bias functions. Numerical simulations of AO correction to the random aberrations have shown that the general method is a fast and stable wavefront sensorless AO correction method for all kind of modes.

Acknowledgments

The authors thank Wenham Jiang for providing good suggestions. The authors acknowledge helpful suggestions from the reviewers and help from the editors.

References and links

1. B. Wang and M. J. Booth, “Optimum deformable mirror modes for sensorless adaptive optics,” Opt. Commun. 282(23), 4467–4474 (2009). [CrossRef]  

2. M. A. Vorontsov, G. W. Carhart, M. Cohen, and G. Cauwenberghs, “Adaptive optics based on analog parallel stochastic optimization: analysis and experimental demonstration,” J. Opt. Soc. Am. A 17(8), 1440–1453 (2000). [CrossRef]  

3. P. Piatrou and M. Roggemann, “Beaconless stochastic parallel gradient descent laser beam control: numerical experiments,” Appl. Opt. 46(27), 6831–6842 (2007). [CrossRef]   [PubMed]  

4. M. J. Booth, “Wavefront sensorless adaptive optics for large aberrations,” Opt. Lett. 32(1), 5–7 (2007). [CrossRef]  

5. M. J. Booth, “Wave front sensor-less adaptive optics: a model-based approach using sphere packings,” Opt. Express 14(4), 1339–1352 (2006). [CrossRef]   [PubMed]  

6. M. Born and E. Wolf, Principles of Optics, 6th ed. (Pergamon, 1983).

7. J. Braat, “Polynomial expansion of severely aberrated wave fronts,” J. Opt. Soc. Am. A 4(4), 643–650 (1987). [CrossRef]  

8. R. Noll, “Zernike polynomials and atmospheric turbulence,” J. Opt. Soc. Am. 66(3), 207–211 (1976). [CrossRef]  

9. W. J. Hardy, Adaptive Optics for Astronomical Telescopes, (Oxford Univ. Press, 1998).

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (7)

Fig. 1
Fig. 1 Schematic diagram of the adaptive system.
Fig. 2
Fig. 2 Signal response between SM of aberration gradient and MDS. 500 random aberrations are taken for study.
Fig. 3
Fig. 3 Signal response between aberration magnitude |V| and MDS. Each point in the figure stands for the mean MDS values of 100 random aberrations.
Fig. 4
Fig. 4 Set of aberration modes: (a) 18 Zernike modes (3–20); (b) 18 L–Z modes (3–20).
Fig. 5
Fig. 5 Corresponding inverse S−1 of sets of aberration modes in Fig. 4: The inverse S−1 of (a) 18 Zernike modes (3-20), (b) 18 L–Z modes (3-20).
Fig. 6
Fig. 6 Results of AO correction using (a) L–Z modes and (b) Zernike modes as the predetermined bias functions; the surfaces of the modes are drawn in Fig. 4. The icons “method 1” and “method 2” in the figure refer to the method proposed by M. J. Booth and the method proposed in this paper.
Fig. 7
Fig. 7 Results of AO correction using (a) L–Z and (b) Zernike modes when coefficientαvary from 0.02 to 0.4 λ. The icons “method 1” and “method 2” in the figure refer to the method proposed by M. J. Booth and the method proposed in this paper.

Tables (1)

Tables Icon

Table 1 SR Results of AO Correction Using L–Z Modes by Our Method and Previous Works

Equations (19)

Equations on this page are rendered with MathJax. Learn more.

I ( x ' , y ' ) = I 0 | A ( x , y ) exp [ j R ( x , y ) ] e j k 2 z [ ( x x ' ) 2 + ( y y ' ) 2 ] d x d y | 2 ,
Φ ( x , y ) = i = 1 M ν i F i ( x , y ) , Ψ ( x , y ) = i = 1 N μ i F i ( x , y ) .
[ x Φ ( x , y ) ] 2 + [ y Φ ( x , y ) ] 2 ( x ' 2 + y ' 2 ) .
i = 0 T L { [ x Φ i ( x , y ) ] 2 + [ y Φ i ( x , y ) ] 2 } i = 0 T L F I i ( x ' , y ' ) ( x ' 2 + y ' 2 ) ,
x y { [ x Φ ( x , y ) ] 2 + [ y Φ ( x , y ) ] 2 } d x d y x ' y ' I ( x ' , y ' ) ( x ' 2 + y ' 2 ) d x ' d y ' ,
x ' y ' I ( x ' , y ' ) ( x ' 2 + y ' 2 ) d x ' d y ' = R 2 x ' y ' I ( x ' , y ' ) ( x ' 2 + y ' 2 ) R 2 d x ' d y ' = R 2 { x ' y ' I ( x ' , y ' ) d x ' d y ' x ' y ' I ( x ' , y ' ) [ 1 r 2 R 2 ] d x ' d y ' } .
MDS = x ' y ' I ( x ' , y ' ) [ 1 r 2 R 2 ] d x ' d y ' x ' y ' I ( x ' , y ' ) d x ' d y ' .
SM c 0 ( 1 M D S ) ,
Φ ( x , y ) = i = 1 M ν i F i ( x , y ) .
x Φ ( x , y ) = i = 1 M ν i x F i ( x , y ) , y Φ ( x , y ) = i = 1 M ν i y F i ( x , y ) ,
w i , 0 = S M i S M 0 = s { [ x Φ ( x , y ) + α x F i ( x , y ) ] 2 + [ y Φ ( x , y ) + α y F i ( x , y ) ] 2 } { [ x Φ ( x , y ) ] 2 + [ y Φ ( x , y ) ] 2 } d x d y s ,
w i , 0 = s [ α x F i ( x , y ) 2 x Φ ( x , y ) ] d x d y s + s [ α x F i ( x , y ) ] 2 d x d y s + s [ α y F i ( x , y ) 2 y Φ ( x , y ) ] d x d y s + s [ α y F i ( x , y ) ] 2 d x d y s = s 2 α [ x F i ( x , y ) x Φ ( x , y ) + y F i ( x , y ) y Φ ( x , y ) ] d x d y s + s α 2 { [ x F i ( x , y ) ] 2 + [ y F i ( x , y ) ] 2 } d x d y s .
w i , 0 = s 2 α [ x F i ( x , y ) x Φ ( x , y ) + y F i ( x , y ) y Φ ( x , y ) ] d x d y s + s α 2 { [ x F i ( x , y ) ] 2 + [ y F i ( x , y ) ] 2 } d x d y s , i = 1 ~ N .
W = 2 α * S * V + α 2 * S m ,
s n , m = s { [ x F n ( x , y ) * x F m ( x , y ) ] + [ y F n ( x , y ) * y F m ( x , y ) ] } d x d y s .
V = S 1 ( W α 2 * S m ) 2 * α .
w i , 0 = S M i S M 0 c 0 ( 1 M D S i )- c 0 ( 1 M D S 0 ) - c 0 ( M D S i - M D S 0 ) ,
V S 1 ( c 0 * M α 2 * S m ) 2 * α ,
S R = P [ I ( x , y ) ] P [ I 0 ( x , y ) ] ,
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.