Overlay control is of vital importance to good device performances in semiconductor manufacturing. In this work, the differential Mueller matrix calculus is introduced to investigate the Mueller matrices of double-patterned gratings with overlay displacements, which helps to reveal six elementary optical properties hidden in the Mueller matrices. We find and demonstrate that, among these six elementary optical properties, the linear birefringence and dichroism, LB′ and LD′, along the ± 45° axes show a linear response to the overlay displacement and are zero when the overlay displacement is absent at any conical mounting. Although the elements from the two 2 × 2 off-diagonal blocks of the Mueller matrix have a similar property to LB′ and LD′, as reported in the literature, we demonstrate that it is only valid at a special conical mounting with the plane of incidence parallel to grating lines. The better property of LB′ and LD′ than the Mueller matrix elements of the off-diagonal blocks in the presence of overlay displacement verifies them to be a more robust indicator for the diffraction-based overlay metrology.
© 2017 Optical Society of America
The unceasing requirement of small semiconductor device features drives the development of many new optical lithography techniques, of which double patterning or multi-patterning lithography has emerged as a promising enhancement technique to reduce the critical dimension (CD) in the pattern on a wafer [1–3]. For the double patterning technique, the 2nd grating feature must be accurately aligned and printed at the half pitch of the 1st grating. A misalignment that results in a so-called overlay error will lead to a non-yielding device [4, 5]. According to the ITRS (International Technology Roadmap for Semiconductors, www.itrs2.net), the requirement of the maximum allowable overlay error is 4 nm (3σ) at today’s feature sizes, but will approach to as small as 2 nm in 2018. The demand for the tighter overlay control poses serious challenges to the traditional image based overlay metrology. Over the past years, the diffraction based overlay (DBO) metrology techniques were developed to address the challenges [6–12].
The implementation of the DBO metrology techniques mainly involves two key issues, one is a specially designed DBO target and another is the specific measurement and analysis method. Moreover, the design of the DBO target that typically consists of one-dimensional gratings in the X and Y directions is usually closely related with the specific measurement and analysis method. A general requirement for the DBO target is that its size should be as small as possible because of the limited space reserved for the target in the pattern on a wafer. Most of current overlay measurement techniques, such as the normal incidence spectroscopic reflectometry (NISR) , the angle-resolved scatterometry [7, 8], the phase-structured illumination technique , and the Mueller matrix ellipsometry (MME) [10–12], arise from the optical scatterometry, also termed as the optical critical dimension (OCD) metrology, which has been developed for the measurement of CD in semiconductor manufacturing . For this reason, the data analysis methods that have been developed for OCD metrology could be naturally applied for overlay metrology. Moreover, in this case, a DBO target that contains only a single cell per direction (X and Y) is enough to obtain the overlay error, which can therefore reduce the total target size as well. However, this data analysis approach involves complicated computation-intensive inverse problem solving [14–16], which is typically ill-posed, especially when multi-parameters including the overlay error are floated together to achieve a best fit. Thus, the data analysis approach directly inherited OCD metrology on a single cell is usually difficult to extract overlay accurately. To this end, an empirical analysis approach that does not need to solve the inverse problem was adopted to measure the overlay error by using DBO targets that usually consist of multiple cells per direction [6, 8, 11, 12].
The empirical approach relies on a linear relation between the optical signature of the sample under test and the overlay displacement within a small range. Here, the term signature represents a property of light, which can be in the form of amplitude, phase, polarization, reflectance, or ellipsometric parameters, etc. For the NISR measurement technique , which collects the reflectance spectra of the zeroth-order diffracted light, it requires a DBO target that contains at least three cells per direction with suitably designed shifts to measure the overlay error. Moreover, to improve the sensitivity, the designed shifts between two resist lines in the DBO target of NISR are typically set as 25~35% of pitch, which makes the target less similar to device structures and is especially troublesome to the double patterning processes. Other measurement techniques that use less number of cells per target were also developed. The angle-resolved scatterometry that collects the ± 1st diffraction orders uses only two cells per target . Its measurement is based on the fact that the overlay error will induce asymmetric intensity distribution, which is particularly pronounced in the higher diffraction orders. However, the ± 1st order scatterometry poses a very tight requirement on the pupil uniformity and the quality of the calibration method used to reduce the effect of pupil non-uniformity. The MME also uses a target that consists of only two cells per direction but collects the zeroth-order diffracted light [11, 12]. Since the zeroth-order diffraction usually has a large intensity than higher orders, the measurement provides a good signal-to-noise ratio. In addition, 16 elements of a 4 × 4 Mueller matrix provide much more information than a single intensity in angle-resolved scatterometry, and moreover, the overlay displacement introduces a composite structural asymmetry [17–21], which makes MME a good candidate for the DBO metrology.
Current DBO metrology using MME is typically based on a property that the elements from the two 2 × 2 off-diagonal blocks of the Mueller matrix are zero when the overlay displacement is absent, otherwise will deviate from zero and respond to the overlay displacement linearly [10–12]. However, we will show in this work that the above property of the off-diagonal block elements of the Mueller matrix is essentially only valid at the azimuthal angle of ϕ = 90°, i.e., with the plane of incidence parallel to grating lines. Furthermore, it is also necessary to accurately align the sample at ϕ = 90° when performing measurement, since even a minor offset of 0.1° from ϕ = 90° will lead to a large measurement error. This makes the DBO metrology using MME lack of robustness. In this work, we applied the differential Mueller matrix calculus  to investigate the Mueller matrices of double-patterned grating structures with overlay displacements. The differential Mueller matrix calculus that can be obtained by recently developed Mueller matrix differential decomposition [23, 24] summarizes the elementary optical properties of the medium which influence the Stokes vector, including the linear birefringence along the p-s and ± 45° axes, the linear dischroism along the p-s and ± 45° axes, and the circular birefringence and dichroism. We found and demonstrated that, among the above elementary optical properties, the linear birefringence and dichroism along the ± 45° axes exhibited a linear response to the overlay displacement and were zero when the overlay displacement was absent at any conical mounting (with the plane of incidence no longer perpendicular to grating lines). The simulation results demonstrated that the linear birefringence or dichroism along the ± 45° axes was a more robust overlay indicator than the off-diagonal block elements of the Mueller matrix for the DBO metrology.
The polarization state of an electromagnetic wave can be described by Stokes vectors that consist of four elements
In the differential Mueller matrix calculus, the differential Mueller matrix m relates the Mueller matrix M to its spatial derivative along the light propagation direction z as Eq. (3) can be achieved by taking the logarithm of M if m does not depend on z, that is [23, 24],Eq. (4), L is the accumulated differential Mueller matrix and , with l being the optical path length in the medium. and are the G-antisymmetric (defined by ) and G-symmetric (defined by ) terms of L, respectively. Here, G is the Minkowski metric and . The superscript “T” denotes the matrix transpose. The term contains the mean values of the elementary optical properties, while contains the anisotropic absorption as well as the uncertainties of the corresponding elementary optical properties resulting from depolarization. It is shown that, if M is a nondepolarizing Mueller matrix, we will have within the experimental error, provided that the isotropic absorption has been subtracted beforehand from the diagonal elements of L. The term is given by 
The accumulated differential Mueller matrix L can be readily calculated by Eq. (4) for a given experimental Mueller matrix , however, it should be pointed out that the associated L is not always physical . To avoid an unphysical L, it is recommended to perform sum decomposition for first. According to the sum decomposition, any Mueller matrix M (nondepolarizing or depolarizing) can be represented as a weighted sum of four nondepolarizing Mueller matrices (k = 1, 2, 3, 4) 28]. We can then calculate the logarithm L of using Eq. (4).
It is also worth pointing out that the differential Mueller matrix m (L) has a direct physical interpretation only in transmission measurements. Although we can also calculate the logarithm L of a sample Mueller matrix obtained in reflection using Eq. (4), here L will not be a direct read of the elementary optical properties of the sample as it happens in transmission [21, 29]. The calculation of L by Eq. (4) in a reflection mode, as we will see in the remainder of this paper, should be understood as a pure mathematical operation used to reveal the minimum number of independent parameters that are necessary to describe the change of polarization upon reflection. Moreover, we will show in Section 3 that the limitation of lack of direct physical meaning of the elementary optical properties obtained in reflection does not affect them to be indicators for overlay monitoring.
Figure 1 depicts a polarized light incidence upon a double-patterned grating structure with a pitch of Λ and an overlay displacement of δ. We assume that corresponds to the position of Λ/2 and for the overlay displacement shown in Fig. 1 (left overlay) and otherwise (right overlay). Without loss of generality, we denote as the Mueller matrix of the double-patterned structure with an overlay displacement of δ and obtained at the azimuthal angle of ϕ, and as the Mueller matrix obtained after a rotation of the sample by π around z-axis. According to the principle of electromagnetic reciprocity , when a light beam passes through the media (or is reflected or scattered) in a reverse direction, the reciprocal Mueller matrix can be represented byEq. (9) is thus also equivalent toEq. (10), we can further derive thatEqs. (5) and (11), we have
Considering the electromagnetic reciprocity as well as the reflection symmetry of the double-patterned grating in Fig. 1 relative to the plane that is perpendicular to grating lines, we can restrict the azimuthal angle ϕ to the range of 0 to 90°. In addition, we assume that the grating is composed of only reciprocal materials and the investigated Mueller matrices are exclusively associated with the zeroth-order diffracted light. Based on Eqs. (9)-(12), we can derive the follow properties that will be useful in overlay measurement.
- – Property 1: Among the six elementary optical properties (LB, LD, LB′, LD′, CB, and CD), only LB′ and LD′ are sensitive to the direction of overlay displacement. When , we will have (i.e., LB′ = LD′ = 0) for any azimuthal configuration. However, when , we will have for any azimuthal configuration except ϕ = 0°.
- – Property 2: When ϕ = 0°, we have (i.e., LB′ = LD′ = CB = CD = 0) for any overlay displacement; and when ϕ = 90°, we have for any overlay displacement.
The property of when in Property 1 can be readily derived from Eq. (12). When , the overlay displacement breaks the C2z and σy symmetry which thereby leads to at . Here, C2z and σy are standard symbols in group theory used to denote symmetry operations and symmetry types .The Property 1 indicates that we can judge whether or not the grating sample has an overlay displacement from the absolute value of and meanwhile we can distinguish the direction of overlay displacement from the sign of . The proof of Property 2 is given in Appendix A. According to Eq. (10) as well as Properties 1 and 2, we can obtain the following relation for any element of the two 2 × 2 off-diagonal blocks of the Mueller matrix at ϕ = 90°Eq. (13) was also the starting point for most of the reported literature dealing with overlay measurement based on the Mueller matrix formalism [10–12]. However, we should note that Eq. (13) is only valid at ϕ = 90°.
According to Property 1, we can adopt (or ) as the indicator for overlay monitoring. In actual measurements, the sample Mueller matrices are typically collected in a spectral range. Therefore, the extracted from the collected Mueller matrices will also vary in a spectral range. For overlay measurement, it is necessary to define a scalar indicator that has a linear relation with respect to the overlay displacement within a small range. In this work, we define the scalar overlay indicator asEq. (14) corresponds to the mean weighting approach that was commonly adopted for the Mueller matrix elements of the 2 × 2 off-diagonal blocks in the reported literature [10, 11]. In this work, we propose another approach which we term as the principal component (PC) weighting approach based on the principal component analysis (PCA) . In the PC weighting approach, that satisfy are the elements of the eigenvector corresponding to the largest eigenvalue of the covariance matrix of . The covariance matrix of can be estimated by calculating at different overlay displacements. In this case, Eq. (14) is essentially the 1st PC of , which is expected to show a linear relation with respect to the overlay displacement within a small range according to PCA. We will also show in Section 3 that the PC-based scalar indicator is more robust than the mean-based indicator in most cases. With the scalar overlay indicator, we can design the DBO target to realize overlay measurement. Figure 2 illustrates the layout the DBO target, where d (or −d) is a known value that represents the designed shift and ε is the actual overlay error induced in the manufacturing processes. According to Fig. 2, we know that the total overlay displacement will be δ = d + ε (or −d + ε). Based on the linear assumption between the scalar indicator and the overlay displacement δ, we can obtain the actual overlay error ε by
3. Results and discussion
3.1 Sample description
Figure 3 depicts the schematic of the investigated double-patterned grating in the simulation, which has a pitch of Λ = 660 nm. The profile of Resist1 is characterized by top CD w1 = 120 nm, line height h1 = 50 nm, and sidewall angle α1 = 87°, and the profile of Resist2 is characterized by top CD w2 = 100 nm, line height h2 = 140 nm, and sidewall angle α2 = 87°. The thickness of the BARC (bottom anti-reflective coating) layer is t = 30 nm. The total overlay displacement is denoted as δ, which should be the sum of the designed shift d and the actual overlay error ε, as illustrated in Fig. 2. In the simulation, the Mueller matrices are calculated using rigorous coupled-wave analysis [34, 35] in the spectral range of 250 to 800 nm with an increment of 5 nm and by fixing the incidence angle at θ = 65°. The Mueller matrices at different azimuthal angles varied from 0 to 90° will be examined.
3.2 Sensitivity analysis
We first performed simulation for the double-patterned grating given in Fig. 3 to give an intuitive understanding of Properties 1 and 2. Figure 4 presents the spectra of the six elementary optical properties extracted from the simulated Mueller matrices at azimuthal angles varied from 0 to 90° with an increment of 30°. We can observe from Fig. 4 that when ϕ = 0°, LB′ = LD′ = CB = CD = 0, and when ϕ ≠ 0°, only LB′ and LD′ exhibit sensitivity to the direction of overlay displacement while other optical properties are at most sensitive to the absolute value of the overlay displacement. In addition, when ϕ = 90°, CB = CD = 0 even if the overlay displacement δ ≠ 0. However, when ϕ ≠ 90° and ϕ ≠ 0°, CB and CD will not be equal to zero any more. The nonzero property of CB and CD when 0 < ϕ < 90° at oblique incidence can be used to assist the alignment of azimuthal angle at ϕ = 90°. The necessity of accurate alignment of azimuthal angle at ϕ = 90° will be discussed in Section 3.4. The specific alignment process is to adjust the sample stage until the associated CB and CD are within the experimental error.
We also checked the spectral response of the Mueller matrix elements of the 2 × 2 off-diagonal blocks when the grating sample has an overlay displacement. We took the elements m13 and m14 as an example. Considering that the off-diagonal block elements will not be equal to zero when ϕ ≠ 0° and ϕ ≠ 90°, we therefore calculated the difference spectra Δm13 and Δm14 defined by Δmij = mij(δ ≠ 0) – mij(δ = 0) to examine whether their difference spectra have a similar property to LB′ and LD′. Figure 5 presents the difference spectra of m13 and m14 calculated at different azimuthal angles. As shown in Fig. 5, when ϕ = 0°, Δm13 = Δm14 = 0 since in this case the off-diagonal block elements are always equal to zero regardless of the overlay displacement. We also observe from Fig. 5 that the relation Δmij(–δ) = –Δmij(δ) for the off-diagonal block elements are only valid at ϕ = 90°, which can be readily interpreted according to Eq. (13). From this aspect, we know that LB′ (or LD′) seems to be a much better choice as the overlay indicator since the relation L′(–δ) = –L′(δ) is always valid at any azimuthal configuration.
Since LB′ is always equal to zero when δ = 0, we can thus compare the norm of the LB′ spectra () for a given overlay displacement at different azimuthal angles to choose a more sensitive azimuthal configuration. Figure 6 presents the variation of of the investigated grating sample at azimuthal angles varied from 0 to 90° with an increment of 15°. According to Fig. 6, we can observe that shows a larger value at ϕ = 60° and ϕ = 90°, which indicates that ϕ = 60° and ϕ = 90° will be the better azimuthal configurations. It was found that most of current overlay measurements based on Mueller matrix formalism in the reported literature were usually performed at ϕ = 90° [10–12]. However, as can observed from Fig. 6, the at ϕ = 60° is even larger than that at ϕ = 90°, which suggests that ϕ = 90° might not always be the best choice of azimuthal configurations in overlay measurement. Considering that there is no large difference between the values of at ϕ = 60° and ϕ = 90°, these two azimuthal configurations are both selected in the following analysis in order to make a comparison.
3.3 Linearity verification
Since the overlay measurement by Eq. (15) is based on an assumption that the scalar indicator has a linear relation with respect to the overlay displacement δ, it is therefore necessary to check the linearity assumption prior to the measurement. If the linearity assumption is verified, we also need to check the range of δ where the linearity assumption is valid. To this end, we varied δ in the range from –50 to 50 nm. We first calculated using the PC and mean weighting approaches at ϕ = 60° and ϕ = 90°, respectively. To make a comparison, we also adopted the Mueller matrix element of the 2 × 2 off-diagonal blocks (we took m13 as an example) as the overlay indicator and calculated the corresponding PC-based and mean-based indictor values using a similar equation to Eq. (14), i.e., , where is the weight associated with at the ith spectral point. Note that here the mean of the off-diagonal block element of the Mueller matrix in a spectral range was also the commonly adopted scalar overlay indicator in the reported literature [10, 11]. We performed linear fitting to the indicator values at different overlay displacements. To achieve a good fitting result, we ignored some ranges of δ where there showed a poor linear performance (See Figs. 7(d) and 8(d)). The fitting with a large coefficient of determination R2 and small fitting errors over a large range of δ indicates a good linear relation between the overlay indicator and the overlay displacement.
Figure 7 presents the results for . According from Fig. 7, we can observe that both the PC- and mean-based scalar indicators show a good linear relation with respect to δ varied in the range from –50 to 50 nm ϕ = 60°. In comparison, at ϕ = 90°, the PC- and mean-based scalar indicators only show a good relation with respect to δ within the range from –40 to 40 nm, especially the mean-based indicator. Figure 8 presents the results for . As can be observed from Fig. 8, the mean-based scalar indicator shows a poor linear relation with respect to δ, especially at ϕ = 60°. In comparison, the PC-based indicator shows a better linear performance. Specifically, the PC-based indicator shows a linear relation with respect to δ within the range from –20 to 20 nm at ϕ = 60° and from –50 to 50 nm at ϕ = 90°. However, we should note from Fig. 8(a) that the fitted linear curve at ϕ = 60° has a relatively large intercept of about –0.183, while the fitted linear curve ϕ = 90° only have an intercept of about 1.473e–4. The large intercept in the fitted linear curve is induced by the nonzero value of m13 at ϕ = 60° even when δ = 0. The impact of a large intercept in the fitted linear curve on the overlay measurement will be discussed in Section 3.4. The comparison between the results given in Figs. 7 and 8 reveals that LB′ exhibits a better performance than the off-diagonal block element of the Mueller matrix to be the overlay indicator, and the PC-based overlay indicator exhibits a more robust performance than the mean-based indicator in most cases.
In addition, in the PC-based scalar overlay indicator, we need to obtain the weights ωi in advance so as to further calculate the indicator value. The calculation of the weights ωi in the PC-based indicator involves the estimation of the covariance matrix of (or m13 for ). To do this, we need to calculate at different overlay displacements. For the results given in Fig. 7, we can estimate the covariance matrix of by using the (or m13 for Fig. 8) calculated at all the overlay displacements varied from –50 to 50 nm. However, when we try to measurement the overlay error ε based on the DBO target shown in Fig. 2, we will only have two overlay displacements (i.e., δ = d + ε and −d + ε). Therefore, it is necessary to check whether or not we can still obtain the covariance matrix of in this case. It is also necessary to compare the difference between the weights ωi estimated by using the calculated at all the overlay displacements from –50 to 50 nm and those estimated by using the calculated at two overlay displacements.
We took the estimated weights ωi associated with in the PC-based scalar overlay indicator at ϕ = 60° (corresponding to the results given in Fig. 7(a)) as an example. In the analysis, we randomly chose two groups of overlay displacement duplet from the range of –50 to 50 nm, namely δ = (15, –40) nm and (37, –12) nm. Then, we estimated the corresponding weights using the above two groups of overlay displacement duplet, and compared them with the weights estimated by using all the overlay displacements varied from –50 to 50 nm. Figure 9 presents the comparison results. According to Fig. 9, we can observe that there is a minor difference between the weights estimated by using only two overlay displacements and those estimated by using all the overlay displacements. It thus suggests that two overlay displacements are enough to estimate the weights ωi used in the PC-based overlay indicator.
3.4 Overlay measurement
According to the above analysis, we try to measure the overlay error based on the DBO target given in Fig. 2 in this section. In the DBO target, d is a designed shift. Considering that the value of d might have an influence on the final measurement results, we thereby first investigated the measurement errors at different shift values using the proposed overlay indicators. In the analysis, we fixed the overlay error at ε = 10 nm while varied the designed shift d from 2 to 50 nm with an increment of 2 nm. We then estimated the overlay error at each shift value by Eq. (15). The measurement error is defined as the absolute difference between the estimated overlay errors and the given overlay error (ε = 10 nm). When estimating the overlay errors, we used both and to make a comparison, and both and were calculated using the PC and mean weighting approaches, respectively. In addition, we estimated the overlay errors using at ϕ = 60° and ϕ = 90°, respectively. Considering the bad linear performance for at ϕ = 60°, as indicated in Fig. 8(c), we thereby only estimated the overlay errors using at ϕ = 90°.
Figure 10 presents the measurement errors at different shift values estimated by and at ϕ = 60° and ϕ = 90°, respectively. As show in Fig. 10, different shift values indeed lead to different measurement errors, and moreover the PC- and mean-based overlay indicators also exhibit different measurement accuracy at different shift values and azimuthal configurations. When using to estimate overlay errors, the azimuthal configuration of ϕ = 60° exhibits higher measurement accuracy than that of ϕ = 90° for both PC- and mean-based overlay indicators, especially for large shift values. It might be attributed to the relatively small linear range for the overlay displacements at ϕ = 90°, as predicted in Fig. 7. We can also observe from Fig. 10(a) that, when using to estimate overlay errors at ϕ = 60°, the PC and mean weighting approaches show comparative accuracy at small shift values (d < 20 nm), while for large shift values (d > 20 nm) the mean weighting approach exhibits higher accuracy than the PC weighting approach, which is really beyond our expect. In comparison, when using to estimate overlay errors at ϕ = 90° (see Fig. 10(b)), the mean weighting approach exhibits rather worse accuracy for large shift values (d > 30 nm) than the PC weighting approach. It might be because of the relatively poor linear performance of the mean weighting approach when the overlay displacement δ is large than 40 nm, as predicted in Figs. 7(b) and 7(d). As shown in Fig. 10(c), when using to estimate overlay errors at ϕ = 90°, the PC weighting approach shows better accuracy than the mean weighting approach in the whole range of designed shifts. The poor performance of the mean weighting approach even suggests that it might not be used for the overlay measurement.
We then tried to measure the overlay error based on the results shown in Fig. 10. In the simulation, we fixed the designed shift d at 20 nm. We varied the overlay errors from 2 to 30 nm with an increment of 2 nm and compared the measured overlay errors by Eq. (15) with the input (given) overlay errors. Random noise was also added to the simulated Mueller matrices, which was generated using a signal dependent noise model given in our previous work [36, 37]. The generated noise was dependent on both the sample under test and the measurement configurations (combination of the wavelength and the incidence and azimuthal angles). We then performed the sum decomposition to the simulated Mueller matrices by Eqs. (6)-(8) and calculated the logarithm of the estimated physically realizable Mueller matrices by Eq. (4). We also used and to measure overlay errors at ϕ = 60° and ϕ = 90° to make a comparison, and both and were calculated using the PC and mean weighting approaches, respectively.
Figure 11 presents the measurement results by and at ϕ = 60° and ϕ = 90°, respectively. As can be observed from Fig. 11, except the case of using the mean weighting approach at ϕ = 90°, which has been predicted in Fig. 10(c), the overlay indicators can achieve good measurement results in most cases. As shown in Figs. 11(a) and 11(d), when using to measure overlay errors at ϕ = 60°, both the PC and mean weighting approaches can lead to good measurement results as predicted in Fig. 10(a). The PC and mean weighting approaches can only achieve good measurement results for the overlay errors less than about 24 nm, when using to measure overlay errors at ϕ = 90°, as illustrated in Figs. 11(b) and 11(e). The result is consistent with that shown in Fig. 10(b), which is attributed to the relatively small linear range of the overlay displacements, as predicted in Figs. 7(c) and 7(d). If we want to measure overlay errors using at ϕ = 90°, Figs. 11(c) and 11(f) suggest that we can only use the PC weighting approach. In addition, the linearly fitted equation inserted in each subfigure in Fig. 11 also revealed the corresponding measurement error, since in an ideal situation the linearly fitted equation should have a slope of 1 and an intercept of 0. The measurement error shown in Fig. 11 might be induced by the random error as well as the not strictly established linear relation between the scalar indicator and the overlay displacement, as revealed in the fitting errors shown in Figs. 7 and 8.
Figures 11(c) and 11(f) indicate that, although the mean weighting approach does not work when using to measure overlay errors at ϕ = 90°, the PC weighting approach is still applicable to achieve good measurement results. Recalling the results presented in Fig. 8, we know that the PC weighting approach exhibits better linear performance than the mean weighting approach even at the azimuthal configuration of ϕ = 60°. We thereby want to examine whether or not we could achieve good measurement results using based on the PC weighting approach at ϕ = 60° or at other conical mountings, just as the indicator did in Fig. 11(a). Considering that the PC-based overlay indicator might have a better performance in the case of a small designed shift value, as illustrated in Fig. 10, we fixed d at 10 nm in the test. We assumed that the input overlay error ε = 5 nm. Obviously, the total overlay displacement δ (δ = d + ε or δ = d − ε) falls in the linear range, as indicated in Fig. 8(a). We then used based on the PC weighting approach to obtain the measured overlay error at ϕ = 60°. It was found that the measured overlay error = −24.560 nm, which is obviously wrong! The wrong overlay error achieved at ϕ = 60° might be because of the large intercept in the fitted linear curve, as revealed in Fig. 8(a), induced by the nonzero off-diagonal block element values of the Mueller matrix at ϕ = 60° even when δ = 0.
Considering the fact that the off-diagonal block elements of the Mueller matrix will not be zero even when the azimuthal angle ϕ has a minor offset from 0° or 90° in the case of δ = 0, we want to further examine whether or not we could achieve good measurement results using based on the PC weighting approach at the azimuthal configurations that slightly deviate from 90°. We still fixed the designed shift d at 10 nm and assumed that the input overlay error ε = 5 nm. We performed the test at the azimuthal angles varied from 89 to 90° with an increment of 0.1°. We first used the indicator to obtain the measured overlay errors at different azimuthal angles. As expected, the results shown in Fig. 12 indicate that even a small offset of 0.1° from 90° in the azimuthal angle will lead to a large measurement error of nearly 1 nm (a relative measurement error of nearly 20%). We then replaced the original indicator with a corrected one to subtract the nonzero when δ is absent from the corresponding value at δ ≠ 0, namely . As can be observed from Fig. 12, the corrected indicator leads to a much better measurement results than the original one.
We also used the corrected indicator to obtain the measured overlay error at ϕ = 60°. The result was found to be = 4.641 nm, which is also much better than the previous result of −24.560 nm obtained by using the original indicator . However, we should note that the corrected indicator involves the calculation of the off-diagonal block element value of the Mueller matrix at δ = 0, which is unrealizable based on the DBO target given in Fig. 2. It therefore suggests that if we want to use the off-diagonal block elements of the Mueller matrix as the overlay indicator, we can only perform overlay measurement at ϕ = 90°. Moreover, the azimuthal angle should be accurately aligned; otherwise large measurement error will be generated. Accurate alignment of azimuthal angle at ϕ = 90° might be achieved according to Property 2. In addition, we need to adopt LB′ as the overlay indicator, since it shows to be more robust than the off-diagonal block elements of the Mueller matrix even when ϕ has a large offset from 90°, as indicated in Fig. 10(a), and 11(a), and 11(d).
In this work, the differential Mueller matrix calculus was first introduced to investigate the Mueller matrices of a double-patterned grating structure, which helps to reveal the six elementary optical properties, i.e., LB, LD, LB′, LD′, CB and CD, hidden in the Mueller matrices. We found that among the six elementary optical properties only LB′ and LD′ exhibit the sensitivity to both the direction and amplitude of the overlay displacement δ. We thereby adopted the weighted sum of LB′ (LD′ could also be adopted), namely , as the scalar overlay indicator. To make a comparison, we also adopted the weighted sum of the Mueller matrix elements (we took m13 as an example) of the 2 × 2 off-diagonal blocks, namely , as the scalar overlay indicator. Two weighting approaches were used to calculate the indicator values. One is the commonly used mean weighting approach, and another is our proposed PC weighting approach based on PCA. Thereinto, the indicator using the weighting approach was also the commonly adopted scalar indicator for overlay measurement in the reported literature. The performances of the overlay indicators were examined at azimuthal angles ϕ varied from 0 to 90°, and we found that
- (1) The overlay indicator can only be used at ϕ = 90°, and moreover even at ϕ = 90°, only the PC weighting approach can achieve good measurement results while the mean weighting approach that was commonly adopted in the reported literature will lead to inaccurate results.
- (2) In addition, when using to measure overlay error based on the PC weighting approach at ϕ = 90°, it is required to accurately align the azimuthal angle since a minor offset of 0.1° from ϕ = 90° will lead to a large relative measurement error of nearly 20%.
- (3) The reason why the overlay indicator can only be used at ϕ = 90° was attributed to that the Mueller matrix elements of the 2 × 2 off-diagonal blocks will not be equal to zero any more at a general conical mounting (0° < ϕ < 90°) even when δ = 0.
- (4) In contrast, the overlay indicator can achieve good measurement results using both PC and mean weighting approaches even at a general conical mounting, since when δ = 0, we have LB′ = 0 for any azimuthal configuration, while when δ ≠ 0, we always have LB′ ≠ 0 for any conical mounting.
The above achievements consequently lead to a conclusion that LB′ (or LD′) has a better performance than the 2 × 2 off-diagonal block elements of the Mueller matrix to be the overlay indicator. We thereby suggest replacing the off-diagonal block elements of the Mueller matrix with LB′ (or LD′) for robust overlay measurement. It is also worth pointing out that the above conclusions are not limited to the double-patterned structures; it is also valid for the overlay between features at different layers. In addition, although the results given in Section 3.4 are achieved by simulation, the derived conclusions in this paper could be readily applied to experimental data for actually designed DBO targets. For experimental Mueller matrices, it is recommended to perform the sum decomposition first and then perform the differential decomposition to avoid possible unphysical differential Mueller matrices. Finally, we could use Eq. (15) to obtain the actual overlay error.
Appendix A Proof of Property 2
Besides the Stokes-Mueller formalism, we can also use the Jones matrix formalism to describe the change of polarization state of incident light after interaction with a sample. Under the Jones matrix formalism, the Jones matrix J associated with the zeroth-order diffracted light of a grating sample, which connects the incident Jones vector with the reflected one, can be written as21]
When the azimuthal angle ϕ = 0°, all diffraction orders are within the plane of incidence (this configuration is usually called planar diffraction) and the cross-polarization coefficients rps and rsp are always zero for any periodic nanostructure. According to Eq. (17), we have . When ϕ = 90°, all diffraction orders are on the surface of a cone with revolution symmetry around the direction of grating lines (x-direction shown in Fig. 1). According to Theorem 3 in , we have rps = rsp at this special symmetrical conical mounting. Moreover, when the overlay error δ = 0, we will have rps = rsp = 0, while when δ ≠ 0, we will have rps = rsp ≠ 0. Therefore, according to Eq. (17), we always have C = 0 whether δ is zero or not. That’s the conclusion given by Property 2. Here, it is also worth pointing out that at oblique incidence C will not be zero any more when ϕ is not equal to 0° or 90°, as illustrated in Fig. 4. We should note that the nonzero C does not necessarily suggest that the grating sample exhibits chirality or bianisotropy, which might be induced by the linear anisotropy along the p-s and ±45° axes. Nevertheless, the nonzero C at oblique incidence can be adopted as a useful indicator to align the grating sample with a nonzero overlay displacement, especially at the azimuthal configuration of ϕ = 90°.
National Natural Science Foundation of China (NSFC) (51405172, 51475191, 51575214, and 51525502), Natural Science Foundation of Hubei Province of China (2015CFA005) and the 111 Project (B16019).
References and links
1. M. Maenhoudt, J. Versluijs, H. Struyf, J. Van Olmen, and M. Van Hove, “Double patterning scheme for sub-0.25 k1 single damascene structures at NA = 0.75, λ = 193 nm,” Proc. SPIE 5754, 1508–1518 (2005). [CrossRef]
3. S. H. Park, D. O. Shin, B. H. Kim, D. K. Yoon, K. Kim, S. Y. Lee, S. H. Oh, S. W. Choi, S. C. Jeon, and S. O. Kim, “Block copolymer multiple patterning integrated with conventional ArF lithography,” Soft Matter 6(1), 120–125 (2010). [CrossRef]
5. J. Finders, M. Dusa, B. Vleeming, B. Hepp, M. Maenhoudt, S. Cheng, and T. Vandeweyer, “Double patterning lithography for 32 nm: critical dimensions uniformity and overlay control considerations,” J. Micro/Nanolith. MEMS MOEMS 8(1), 011002 (2009). [CrossRef]
6. W. Yang, R. Lowe-Webb, S. Rabello, J. Hu, J. Y. Lin, J. Heaton, M. Dusa, A. den Boef, M. van der Schaar, and A. Hunter, “A novel diffraction based spectroscopic method for overlay metrology,” Proc. SPIE 5038, 200–207 (2003). [CrossRef]
8. A. J. den Boef, “Optical wafer metrology sensors for process-robust CD and overlay control in semiconductor device manufacturing,” Surf. Topogr.: Metrol. Prop. 4(2), 023001 (2016). [CrossRef]
9. S. Peterhänsel, M. L. Gödecke, V. F. Paz, K. Frenner, and W. Osten, “Detection of overlay error in double patterning gratings using phase-structured illumination,” Opt. Express 23(19), 24246–24256 (2015). [CrossRef] [PubMed]
10. Y. N. Kim, J. S. Paek, S. Rabello, S. Lee, J. Hu, Z. Liu, Y. Hao, and W. McGahan, “Device based in-chip critical dimension and overlay metrology,” Opt. Express 17(23), 21336–21343 (2009). [CrossRef] [PubMed]
11. J. Li, Y. Liu, P. Dasari, J. Hu, N. Smith, O. Kritsun, and C. Volkman, “Advanced diffraction-based overlay for double patterning,” Proc. SPIE 7638, 76382C (2010). [CrossRef]
12. C. Fallet, T. Novikova, M. Foldyna, S. Manhas, B. H. Ibrahim, A. De Martino, C. Vannuffel, and C. Constancias, “Overlay measurements by Mueller polarimetry in back focal plane,” J. Micro/Nanolith. MEMS MOEMS 10(3), 033017 (2011). [CrossRef]
13. C. J. Raymond, “Scatterometry for Semiconductor Metrology,” in Handbook of Silicon Semiconductor Metrology (Marcel Dekker Inc., 2001).
14. V. F. Paz, S. Peterhänsel, K. Frenner, and W. Osten, “Solving the inverse grating problem by white light interference Fourier scatterometry,” Light Sci. Appl. 1(11), e36 (2012). [CrossRef]
15. M. A. Henn, H. Gross, F. Scholze, M. Wurm, C. Elster, and M. Bär, “A maximum likelihood approach to the inverse problem of scatterometry,” Opt. Express 20(12), 12771–12786 (2012). [CrossRef] [PubMed]
17. J. Li, J. J. Hwu, Y. Liu, S. Rabello, Z. Liu, and J. Hu, “Mueller matrix measurement of asymmetric gratings,” J. Micro/Nanolith. MEMS MOEMS 9(4), 041305 (2010). [CrossRef]
18. T. Novikova, P. Bulkin, V. Popov, B. H. Ibrahim, and A. De Martino, “Mueller polarimetry as a tool for detecting asymmetry in diffraction grating profiles,” J. Vac. Sci. Technol. B 29(5), 051804 (2011). [CrossRef]
19. D. J. Dixit, V. Kamineni, R. Farrell, E. R. Hosler, M. Preil, J. Race, B. Peterson, and A. C. Diebold, “Metrology for block copolymer directed self-assembly structures using Mueller matrix-based scatterometry,” J. Micro/Nanolith. MEMS MOEMS 14(2), 021102 (2015). [CrossRef]
20. X. Chen, C. Zhang, S. Liu, H. Jiang, Z. Ma, and Z. Xu, “Mueller matrix ellipsometric detection of profile asymmetry in nanoimprinted grating structures,” J. Appl. Phys. 116(19), 194305 (2014). [CrossRef]
21. X. Chen, H. Jiang, C. Zhang, and S. Liu, “Towards understanding the detection of profile asymmetry from Mueller matrix differential decomposition,” J. Appl. Phys. 118(22), 225308 (2015). [CrossRef]
22. R. M. A. Azzam, “Propagation of partially polarized light through anisotropic media with or without depolarization: A differential 4×4 matrix calculus,” J. Opt. Soc. Am. 68(12), 1756–1767 (1978). [CrossRef]
27. S. R. Cloude, “Conditions for the physical realizability of matrix operators in polarimetry,” Proc. SPIE 1166, 177–185 (1990). [CrossRef]
28. D. G. M. Anderson and R. Barakat, “Necessary and sufficient conditions for a Mueller matrix to be derivable from a Jones matrix,” J. Opt. Soc. Am. A 11(8), 2305–2319 (1994). [CrossRef]
29. O. Arteaga, “Useful Mueller matrix symmetries for ellipsometry,” Thin Solid Films 571, 584–588 (2014). [CrossRef]
30. Z. Sekera, “Scattering matrices and reciprocity relationships for various representation of the state of polarization,” J. Opt. Soc. Am. 56(12), 1732–1740 (1966). [CrossRef]
32. W. Ludwig and C. Falter, Symmetries in Physics: Group Theory Applied to Physical Problems (Springer, 1988).
33. I. T. Jolliffe, Principal Component Analysis, 2nd ed. (Springer, 2002).
34. M. G. Moharam, E. B. Grann, D. A. Pommet, and T. K. Gaylord, “Formulation for stable and efficient implementation of the rigorous coupled-wave analysis of binary gratings,” J. Opt. Soc. Am. A 12(5), 1068–1076 (1995). [CrossRef]
35. L. Li, “Formulation and comparison of two recursive matrix algorithms for modeling layered diffraction gratings,” J. Opt. Soc. Am. A 13(5), 1024–1035 (1996). [CrossRef]
36. X. Chen, S. Liu, H. Gu, and C. Zhang, “Formulation of error propagation and estimation in grating reconstruction by a dual-rotating compensator Mueller matrix polarimeter,” Thin Solid Films 571, 653–659 (2014). [CrossRef]
37. S. Liu, X. Chen, and C. Zhang, “Development of a broadband Mueller matrix ellipsometer as a powerful tool for nanostructure metrology,” Thin Solid Films 584, 176–185 (2015). [CrossRef]