Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Interfacial surface roughness determination by coherence scanning interferometry using noise compensation

Open Access Open Access

Abstract

The capability of coherence scanning interferometry has been extended recently to include the determination of the interfacial surface roughness between a thin film and a substrate when the surface perturbations are less than 10nm in magnitude. The technique relies on introducing a first-order approximation to the helical complex field (HCF) function. This approximation of the HCF function enables a least-squares optimization to be carried out in every pixel of the scanned area to determine the heights of the substrate and/or the film layers in a multilayer stack. The method is fast but its implementation assumes that the noise variance in the frequency domain is statistically the same over the scanned area of the sample. This results in reconstructed surfaces that contain statistical fluctuations. In this paper we present an alternative least-squares optimization method, which takes into account the distribution of the noise variance-covariance in the frequency domain. The method is tested using results from a simulator and these show a significant improvement in the quality of the reconstructed surfaces.

Published by The Optical Society under the terms of the Creative Commons Attribution 4.0 License. Further distribution of this work must maintain attribution to the author(s) and the published article's title, journal citation, and DOI.

1. INTRODUCTION

Surface metrology of form, flatness, roughness, and smoothness are important for quality assurance in many types of manufacturing. It is a particular issue for optical components or optical coatings where control of features in the nanometer or sub-nanometer range is required. Stylus-based surface profilometry is the conventional technique used to provide two-dimensional surface metrology. However, this technique can cause modification of the surface under measurement. Coherence scanning interferometry (CSI), previously referred to as scanning white light interferometry, is a well established, non-contact method that provides true three-dimensional measurements [1]. Optical microscopy provides lateral images without height information. Scattermeters measure the proportion of specular to diffuse reflection to calculate the root mean square roughness [2]. In comparison, CSI measures absolute heights at each pixel in the field of view with sub-nanometer vertical resolution [3]. This allows all the various surface roughness parameters measured using stylus profilometry to be computed using CSI [4].

CSI measures surface topography by locating and connecting peak positions in the interference pattern, referred to as an interferogram, at each pixel over the scanned area to reconstruct the measured surface. One of the prerequisites for the test surface is that the surface should have identical amplitude reflection coefficients over the field of view; otherwise, a phase shift on reflection occurs, which results in an erroneous vertical profile. Even if the refractive index is unchanged over the measurement area, problems can occur in the CSI measurement for transparent/semi-transparent thin films with thicknesses of <1.5μm. This is because the interferogram has multiple peaks corresponding to the interfacial surfaces in the thin film assembly. The peaks may be superimposed depending on the thin film structure. In the case where the films are >1.5μm, it is possible to detect and separate the peaks to reproduce the interfacial topographies [57]. However, this is not the case with thin films of a few hundred nanometers in thickness. A Fourier transform of the interferogram can be performed for investigations in this thickness regime. Subtle changes in phase and amplitude are compared with those synthesized mathematically [811].

One of the methods defined in the frequency domain approach uses a theory based on the helical complex field (HCF) function [12,13] and its extensions [14,15]. Interfacial surface roughness (ISR) has been determined using this method by introducing a first-order approximation to the HCF function, which enables fast real-time computation [3,16]. However, the method can introduce spurious surface roughness, as shown in a previous study [15].

This paper presents a methodology to reduce the spurious roughness which can occur with the existing ISR method and improves its computation stability for a wide variety of samples with no additional hardware, no changes in measurement procedure, and little extra computational effort.

2. THEORY

A. Helical Complex Field Function

The determined and synthesized HCF functions, which have a reflection coefficient averaged over the incident angle of the light of amplitude r¯ [15], are expressed as follows:

HCFd(ν;d)=r¯ref(ν)·F[I]SB+F[Iref]SB+,HCFs(ν,d)=r¯(ν,d)·exp(j4πνΔzHCFcosθ¯),
where the thin film assembly consists of a substrate represented by its subscript sub and the thickness of the L film layers expressed by d=[dsub,d1,,dL]. Note that dsub is not used in Eq. (1) but Δdsub is considered in the following discussion. The determined HCF function, HCFd, is given by the positive sideband of the Fourier transform of the actual interference signal I obtained from a test sample divided by that of a known reference material Iref. Whereas the synthesized HCF function, HCFs, is derived from a mathematical model of the test thin film structure [12]. Note that given any signal s, the positive sideband of the Fourier transform of the signal is termed F[s]SB+. The averaged incident angle of the CSI instrument is denoted by θ¯ and ν represents the light frequency. The unknown parameter ΔzHCF is associated with the surface height [15].

Assuming that the thin films are completely flat over the field of view, the set of film thicknesses is numerically determined to be d=d^ together with ΔzHCF by minimizing the squared error between the two functions in Eq. (1). Normally the interference signals over M test sample pixels and Mref reference sample pixels, typically a few hundred, are averaged to have each signal with less noise; i.e., I=1/MiIi is effectively used in Eq. (1) and so is Iref=1/MrefiIrefi.

B. First-Order Approximation to the Synthesized HCF Function

Let εpx be the noise induced in the interference signal at any pixel Ipx with its interfacial surface perturbation Δd, then the determined and the synthesized HCF functions are, respectively,

HCFpxd=r¯ref(ν)·F[Ipx+εpx]SB+F[Iref]SB+,HCFpxs(ν;d^+Δd)HCFd¯+j4πνcosθ¯·HCFd¯{Δdsub+l=1LGl(ν;d^)Δdl},
where
Gl(ν;d^)=1+14πνcosθ¯χ(d^)dl,arg(r¯)=χ,HCFd¯=r¯ref(ν)·F[E[Ipx+εpx]]SB+F[Iref]SB+.

Note that the condition E[Ipx+εpx]=E[Ipx] holds. It follows that HCFd¯ is smooth over the wavelength range of interest, as shown in Fig. 1(a).

 figure: Fig. 1.

Fig. 1. Determined HCF function of a 520 nm SiO2 thin film on a Si substrate: (a) The global determined HCF function HCFd¯, obtained from the full 21×21 matrix of four pixels; (b) the HCF function HCFpxd determined from four pixels at the edge of the measurement area; (c) the locally determined HCF function HCFpxd at the center of the measurement area.

Download Full Size | PDF

For the computations based on Eq. (2), the expressions for the HCF functions need to be re-written in a spectrally discrete manner (ν=[ν1,ν2,,νm]) as follows:

HCFpxd=[HCFpxd(ν1),HCFpxd(ν2),,HCFpxd(νm)],HCFpxsHCFd¯+Diag[HCFd¯]GΔd.

Accordingly the expression is re-written as a linear inverse problem with noise ϵo in the frequency domain:

HCFpxdHCFd¯+Diag[HCFd¯]GΔd+ϵo,
where
G=j4πcosθ¯[ν1ν1·G1(ν1)ν1·GL(ν1)ν2ν2·G1(ν2)νmνm·G1(νm)νm·GL(νm)],Diag[HCFd¯]=[HCFd¯(ν1)000HCFd¯(ν2)000HCFd¯(νm)],HCFd¯=[HCFd¯(ν1),HCFd¯(ν2),,HCFd¯(νm)],ϵo=[ϵo1,ϵo2,,ϵom]N(ϵo|0,σo2I).

Note that the noise ϵoi existing in the frequency domain is assumed to follow a normal distribution N(ϵoi|0,σo2) with the mean at zero and variance σo for iN. Given a random vector a, the operators E[a] and E[(aE[a])(aE[a])] are understood to be the ensemble and the variance-covariance matrix of the random vector respectively. The variance-covariance matrix of the noise ϵo is assumed to be σo2I, where I is the identity matrix.

C. Interfacial Surface Roughness Determination by the Present ISR Method

Using the discrete expressions introduced in the previous section, a merit function Jpx=HCFpxdHCFpxs2 is minimized for every pixel with respect to Δd such that ϵoN(ϵo|0,σo2I). The solution Δ^d is equivalent to that established by the maximum likelihood estimation under the assumption that each element of the noise ϵo is stochastically independent and has the same variance, i.e., the relation E[(ϵoE[ϵo])(ϵoE[ϵo])]=σo2I holds. Then, the solution of the linear inverse problem in Eq. (4) is given analytically by [15,17,18]

Δ^d=(GG)1GTu,
where
u={Diag[HCFd¯]}1[HCFpxdHCFd¯].

As expressed in Eq. (5), the vector u is an observed signal while Δd is an unknown original signal to be estimated. The problem can be re-expressed as

u=GΔd+ϵ,whereϵ={Diag[HCFd¯]}1ϵo,
where the noise ϵ and its variance σ2 have been re-defined for simplification. Accordingly, the problem and the merit function are simplified such that minJpx=uGΔd2 subject to ϵN(ϵ|0,σ2I).

D. ISR Methodology with Noise Compensation

Although the existing ISR method [15,16,19] has been used to determine the surface topography of a layer buried under a transparent thin oxide film, the method can induce spurious roughness caused by system and environmental noise in the signal. This is illustrated in Fig. 1 for data taken from a 520 nm SiO2 thin film. If the optimization process averages over the full scanned area as in Fig. 1(a), the result is a smoothly varying HCF function. However, if the HCF function is determined locally, as shown in Figs. 1(b) and 1(c), there is a different functionality.

Consider that we have a variance-covariance matrix Σ for the noise, then the optimal solution given by the least-squares error method in Eq. (5) is not valid. This is because the probability distribution of an observed signal u now follows N(u|GΔd,Σ). It follows that we need to modify the merit function to deal with this different probability distribution, otherwise the least-squares method would lead to an erroneous solution or become unstable. Without the modification, the ISR method measures a higher interfacial surface roughness.

1. ISR with Noise Compensation (ISR-NC) Determination

Let p(u) be the probability density function (PDF) of the observed signal u, then using the assumption that the spectral noise ϵ follows the normal distribution N(ϵ|0,Σ), the PDF also is a normal distribution. Therefore, the PDF and its log likelihood function L(Δd) are expressed as follows [18]:

p(u)=1(2π)m2|Σ|12exp[12(uGΔd)Σ1(uGΔd)],L(Δd)=logp(u)=12(uGΔd)Σ1(uGΔd)+C,
where C is a constant independent of u and Δd. Maximization of the log-likelihood function with respect to Δd is equivalent to minimization of the following merit function Jpx:
minimizeΔdJpx=(uGΔd)Σ1(uGΔd),subjecttoϵN(ϵ|0,Σ).

As with the existing ISR method, the optimal solution for this linear inverse problem Δ^d is obtained analytically [18] with the variance-covariance matrix as follows [17,18,20]:

Δ^d=(GΣ1G)1GΣ1u.

In statistical signal processing, this method is often referred to as pre-whitening [18]. One of the benefits of this method is that the multi-correlation (covariance) of the noise is also considered when minimizing the merit function. This means that the ISR with noise compensation (ISR-NC) method puts more importance on the wavelength domains with smaller noise variance when determining an optimal solution. The way in which the variance-covariance matrix is calculated is described in the following section.

2. Determination of the Variance-Covariance Matrix of the Noise

The variance-covariance matrix of the noise Σ is determined from the reference measurement with a known material. First the reference global interferogram Iref is computed by averaging over the interferogram at the i-th pixel Irefi resulting in HCFrefd¯. The HCF function at the i-th pixel on the reference sample HCFrefd,i is also obtained.

Defining ϵi=HCFrefd,iHCFrefd¯ as the noise at the i-th pixel its variance-covariance matrix Σo can be computed to match the expression in Eq. (9) as shown in Eq. (10). Note that Σo is determined by computing the sample variance and covariance over an area corresponding to Mref pixels (normally from20×20to50×50):

Σo=E[(HCFrefdHCFrefd¯)(HCFrefdHCFrefd¯)]1Mrefi=1Mref(HCFrefd,iHCFrefd¯)(HCFrefd,iHCFrefd¯),ΣDiag[HCFd¯]1Σo{Diag[HCFd¯]1},
where
HCFrefd¯(ν)=r¯ref(ν)·F[Iref]SB+F[Iref]SB+=r¯ref(ν),HCFrefd,i(ν)=r¯ref(ν)·F[Irefi]SB+F[Iref]SB++ϵi,Iref1Mrefi=1MrefIrefi.

Note that no additional process is required to obtain Σ since the measurement of a reference sample is a prerequisite for the existing HCF-based techniques [1315].

Figures 2 and 3 show the actual variance-covariance matrix of the noise and its variances (diagonal elements) obtained from a flat silicon surface used as a reference. It is clear that the noise variance is larger as the wavelength approaches its limits for both the real and imaginary parts. As expected, the noise variance is not constant over the spectral range of interest.

 figure: Fig. 2.

Fig. 2. Noise variance-covariance matrix Σo from a silicon reference sample with Mref=21×21pixels (actual CSI measurement): (a) real part, (b) imaginary part (color available online).

Download Full Size | PDF

 figure: Fig. 3.

Fig. 3. Noise variance in the frequency domain (actual CSI measurement). The diagonal element of the (a) real and (b) imaginary parts of the noise variance-covariance matrix illustrated in Fig. 2.

Download Full Size | PDF

3. COMPUTER SIMULATION

In this section a comparison is made between the existing ISR method and the ISR-NC method. Due to the approximations made there could be a discrepancy between the approximated HCF function and the original even if the interference signals are free from noise. Thus, the performance of each method is compared as follows: the ISR method free of noise (ISR-NF) the ISR method with noise (ISR), and the ISR method with noise compensation (ISR-NC).

A. Simulation Setup

For the model, we assume that a nanometer-sized feature is buried under a thin film as shown in Fig. 4. The number of pixels used for the reference measurement Mref is fixed as 32×32=1024 throughout the simulations. White Gaussian noise is added to the interferogram (time domain) and the light intensity is tuned to produce profiles similar to those obtained from the experimental data (Figs. 2 and 3) resulting in the noise variances shown in Fig. 5. Comparing the actual noise variances with those simulated as shown in Figs. 3 and 5, the Gaussian noise used in the simulations is considered to reasonably reproduce the actual noise in the frequency domain. Note that the wavelength range for the model is set from 400 to 730 nm, similar to that used previously [15].

 figure: Fig. 4.

Fig. 4. Schematic drawing of the model. The number of the pixels in the measurement area is M together with the pixels having the feature is Mf. Note that the global film thickness d^d.

Download Full Size | PDF

 figure: Fig. 5.

Fig. 5. Noise variance in the frequency domain: The diagonal element of the (a) real and (b) imaginary parts of the noise variance-covariance matrix given to the simulations. Note that the signal-to-noise (S/N) ratio is 2000.

Download Full Size | PDF

Table 1 shows the results from all the models tested. To simulate a high-performance instrument having a 4M pixel camera, such as the CCI HD (Taylor Hobson), the signals will be averaged over every four pixels to create an interference signal, so that 1024 signals in each measurement become 256 averaged signals.

Tables Icon

Table 1. Simulation Conditions Together with the Number of Pixels Allocated for the “Global” and “Featured” Areas

B. Results and Analysis

Comparisons between the noise-free ISR (ISR-NF), the ISR, and the noise robust ISR (ISR-NC) methods are made by examining the height of the buried substrate and the surface roughness (Sq) of the top surface and buried layer interface. Figure 6 shows three-dimensional images of the resulting computations using the three methods. The ISR-NC method yields the smoothest surfaces, which is also the case for all the models tested since the method is free from noise.

 figure: Fig. 6.

Fig. 6. Comparisons between the three computational methods on the sample: SiO2 (thicknes s=514nm) on a Si substrate; the feature height is 5 nm: (a) the ISR method (with noise), (b) the ISR-NC method (with noise), (c) the ISR-NF method (noise free). The S/N ratio is set at 102 to correspond to Sim 3-1 in Table 1 (color available online).

Download Full Size | PDF

1. Simulation 1: Performance Sensitivity to Variation in Thin Film Thickness

Thin films, of SiO2 and ZrO2, of varying thickness were investigated while other parameters remained unchanged, as shown in Table 1. The thicknesses in Table 1 correspond to odd integer multiples (3, 5, 7, 9) of the quarter-wavelength optical thickness (QWOT).

Although the feature heights determined by the ISR and ISR-NC methods were very similar, the height variance of the ISR-NC method was always less than that provided by the ISR method.

The surface roughness (Sq) of the top surface and the substrate determined by the ISR-NC method are smoother than those of the ISR method, as shown in Fig. 7. In addition, one simulation (Sim 1-2 with a 610 nm ZrO2 film) did not work properly with the ISR method due to an inaccurate approximation. This is discussed later in Section 4.

 figure: Fig. 7.

Fig. 7. Surface roughness (Sq) as a function of film thickness: Red circles, top ISR; red squares, Sub ISR; blue circles, top ISR-NC; blue squares, sub ISR-NC.

Download Full Size | PDF

2. Simulation 2: The Effect of Feature Height on Performance

The ISR methods, including ISR-NC, use a first-order Taylor expansion to the HCF function to make the problem linear [15,16,19]. This requires that the perturbation of the interfacial surface topography is “small.” In this experiment, the noise is fixed at S/N=103 and we evaluate the performance of the methods as a function of feature height with the fixed thin film thicknesses of 514 nm for SiO2 and 339 nm for ZrO2.

Figure 8 shows that the ISR methods with or without noise work well up to 10nm in feature height for the SiO2 film and up to 5nm for the ZrO2 film. The ISR-NC method gives a reasonable approximation up to 10nm for ZrO2 film. These results, however, do not necessarily prove the superiority of the ISR-NC method. The root cause of the deterioration in performance, which is basically proportional to the feature height, is the quality of the first-order approximation to the HCF function of interest. The HCF function is poorly approximated in some wavelength regions which have a high noise variation, as shown in Fig. 5.

 figure: Fig. 8.

Fig. 8. Feature height sensitivity as a function of feature height: black circles, ISR-NF; red triangles, ISR; blue squares, ISR-NC.

Download Full Size | PDF

3. Simulation 3: Noise Compensation Performance

In this set of simulations, the performance of each method as a function of the signal-to-noise (S/N) ratio is investigated. The other parameters remain unchanged, as shown in Table 1.

The ISR-NF, ISR, and ISR-NC methods give similar mean feature height values regardless of the noise level for the SiO2 film but this is not the case for ZrO2 film. The ISR method determines the height as 4.3 nm compared to the actual value of 5 nm, as shown in Fig. 9(b). As in the previous Section 3.B.2, this is due to a poor first-order approximation of the amplitude reflection coefficient in the smaller variance wavelength regions.

 figure: Fig. 9.

Fig. 9. Signal-to-noise ratio sensitivity to the determined feature height: black circles, ISR-NF; red triangles, ISR; blue squares, ISR-NC.

Download Full Size | PDF

The thin film and substrate surface roughnesses determined by the ISR and ISR-NC methods are proportional to the increase in the S/N ratio, as shown in Fig. 10. However, the level of surface roughness determined by the ISR-NC method is lower for all noise levels.

 figure: Fig. 10.

Fig. 10. Surface roughness (Sq) as a function of the S/N ratio: red circles, top ISR; red squares, sub ISR; blue circles, top ISR-NC; blue squares, sub ISR-NC.

Download Full Size | PDF

4. Simulation 4: Effect of Substrate Materials

All the variables except for the substrate material are unchanged in the simulations 4-1 and 4-2, as shown in Table 1. The substrate materials used are Si, SiC, BK7 glass, and Ge.

Similar to previous results in Sections 3.B.13.B.3, the ISR-NC method resulted in a smaller variance in feature height in both simulations 4-1 and 4-2. The film and substrate surfaces determined by the ISR-NC method are about an order of magnitude smoother than those from the ISR method.

5. Simulation 5: Effect of the Type of Deposited Film

Simulations 5-1 and 5-2 shown in Table 1 investigate the effect of different film materials for 350 nm and 700 nm thickness films. The ISR-NC method again provided more accurate buried surface topographies together with smaller variances. The reconstructed surfaces were about an order of magnitude smoother irrespective of the thin film material using the ISR-NC method.

Table 2 shows the effective QWOT values of the films used in the simulations. Usually films with thickness greater than QWOT×3 are considered to have enough features in the frequency domain for the HCF theory to work [15]. It follows that there should not be much difference in the simulated performance of the various films unless the first-order approximation to the HCF function is sufficiently accurate. However, the ISR method did not work for the Ta2O5 film in simulation 5-1 while the ISR-NC method did. This issue will be discussed in the following Section 4.

Tables Icon

Table 2. Corresponding Effective Quarter-Wavelength Optical Thickness Values at the Wavelength of 600 nm

4. DISCUSSION

As shown in Sections 3.B.1, 3.B.2 and 3.B.5, the ISR method does not work optimally resulting in erroneous buried feature heights such as those from simulation 2-2 with a feature height >10nm, simulation 1-2 with a ZrO2 600 nm film, and simulation 5-1 with a Ta2O5 350 nm film, as shown in Figs. 8(b) and 11, respectively. All these simulations show that the corresponding surfaces determined by the ISR-NC method are more accurately represented than those using the noise-free ISR method. The root cause of this problem lies in an inaccurate approximation to the HCF function. The lack of accuracy of the HCF function arises when the perturbation (feature height) of the interfacial topography is too large (>10nm) and when the approximated spectral amplitude reflection coefficient locally deviates from the true value resulting in a spike.

 figure: Fig. 11.

Fig. 11. Erroneous determined feature height (originally set as 5 nm): black circles, ISR-NF; red triangles, ISR; blue squares, ISR-NC.

Download Full Size | PDF

Consider first the simulation 2-2, which has a 20 nm feature height. If we compare the true HCF function with its first-order approximation, then Fig. 12(a) shows that the first-order approximation does not hold, especially in the wavelength region between 400 and 475 nm. Fig. 12(b) shows the difference between the true HCF function HCFpxd (without noise) and the approximated estimates by each method HCFpxs (in the presence of noise). Prior knowledge of the noise variance-covariance matrix Σ allows the ISR-NC method to put less importance on the value of the HCF function in the specific wavelength domains where the noise is large, i.e., from 400 to 450nm and from 700 to 730nm, as shown in Fig. 5. This is not the case for the ISR method and is the reason why the ISR-NC method provides more accurate determinations.

 figure: Fig. 12.

Fig. 12. HCF functions generated at the feature pixel (simulation 2-2 with 20 nm feature height): (a) true HCF function (without noise) denoted by “Org” and its first-order approximation by “aprx”; (b) spectral difference between the true HCF function HCFpxd and the HCF functions produced by each method HCFpxs (noise occurs in both ISR and ISR-NC, and NF stands for ISR-NF) (color available online).

Download Full Size | PDF

The second cause of inaccurate surface reconstruction observed in simulations 1-2 and 5-1 is due to an inaccurate approximation to the amplitude reflection coefficient. Consider simulation 5-1 using the Ta2O5 thin film. The first-order approximation to the HCF function is successful, as shown in Fig. 13(a), except for a spike observed at 440nm wavelength denoted by “aprx”. The solution provided by the ISR method defined in Eq. (5) is influenced by this spike, which reduces the fitting performance as shown in Fig. 13(b). The residual of HCFpxsHCFpxd2 at 435 nm wavelength is relatively small for the ISR method whereas that given by the ISR-NC method is large. It follows that the ISR-NC method does not attempt to fit the spike feature due to the noise variance-covariance matrix Σ.

 figure: Fig. 13.

Fig. 13. HCF functions generated at the feature pixel (simulation 5-1 with Ta2O5 film): (a) the true HCF function (without noise) denoted by “Org” and its first-order approximation by “aprx”; (b) the spectral difference between the true HCF function HCFpxd and the HCF functions produced by each method HCFpxs (noise exists for ISR and ISR-NC, and NF stands for ISR-NF); (c) the spectral difference between the real and imaginary parts of the true amplitude reflection coefficient and its first-order approximation. Note that the dotted lines (black and pink) represent the maximum deviations of the real (Re[rraprx]), imaginary (Im[rraprx]), and the reflectivity R, respectively (color available online).

Download Full Size | PDF

To confirm this further, an improvement in the performance of the ISR method was achieved by reducing the wavelength region used for numerical optimization to avoid the region in which the spikes occur.

To achieve a good fit between the determined and synthesized HCF functions in the frequency domain, there are two options: (1) using the ISR method with wavelength domains having less noise variance, such as from 430 to 700 nm in the examples above, or (2) using the ISR-NC method. The latter option enables the measurement of thinner films to be more stable owing to the wider wavelength domain for curve-fitting, irrespective of the noise characteristics.

5. CONCLUSIONS

Present methods for interfacial surface roughness measurement using CSI can be classified into two types: those that compute surface topographies in the time domain and those that determine surface topographies in the frequency domain. The methods belonging to the first group are used for films over 1.5μm in thickness whereas those in the second group are able to deal with thin films less than 1.5μm. The frequency domain methods usually use the least-squares optimization to fit the mathematical model to the measurement signal. However, the basic assumption for the method is that the noise is normally distributed and thus least-squares is not always suitable. In the examples above, the noise variance of the HCF function was not constant over the wavelength domain from 430 to 730 nm. The noise variance is always larger at the ends of the spectral region of interest where the light intensity is low. Therefore the noise variance-covariance matrix should be used in the numerical optimization of the ISR method. Such a matrix is obtained from the measurement of a known flat reference material and will vary depending on the environmental situation and the particular light source used.

Although the ISR method using the HCF function successfully determined the roughness of the thin film top surfaces and buried surfaces [15,16], spurious surface roughness in the determined substrate surfaces could be observed. This paper has presented an effective solution to that problem by introducing the noise variance-covariance matrix, which only involves a small computation when measuring the reference surface. Measurement of the reference surface is required anyway to counteract unknown changes in the phase and amplitude of the light provided by the optical system of the CSI instrument. Using these signals at the same time for the noise analysis is a beneficial side effect.

The reproducibility of the ISR-NC method was better than the existing ISR method for all the computer simulations in the presence of noise for determination of interfacial topography and surface roughness (Sq). The method was also effective over a wide wavelength range, thus allowing use of more features of the HCF function for the curve-fitting and hence better reproducibility. The noise used in the computer simulations is realistic since the noise variance-covariance matrix obtained from the flat silicon surface in Figs. 2 and 3 is similar to the noise in Fig. 5. Incorporation of noise compensation to the ISR method will improve the measurement accuracy.

Funding

Engineering and Physical Sciences Research Council (EPSRC) (EP/J017361/1, EP/M014797/1).

Acknowledgment

Hirokazu Yoshino is grateful to Taylor Hobson Ltd. for funding a studentship at Loughborough University.

REFERENCES

1. P. de Groot, “Coherence scanning interferometry,” in Optical Measurement of Surface Topography (Springer, 2011), Chap. 9, pp. 187–208.

2. T. R. Thomas, “Other measurement topics,” in Rough Surfaces, 2nd ed. (Imperial College, 1999), Chap. 3, pp. 35–61.

3. H. Yoshino, R. Smith, J. M. Walls, and D. Mansfield, “The development of thin film metrology by coherence scanning interferometry,” Proc. SPIE 9749, 97490P(2016).

4. F. Blateyron, “The areal field parameters,” in Characterisation of Areal Surface Texture (Springer, 2013), pp. 15–43.

5. B. S. Lee and T. C. Strand, “Profilometry with a coherence scanning microscope,” Appl. Opt. 29, 3784–3788 (1990). [CrossRef]  

6. P. J. De Groot and X. C. de Lega, “Transparent film profiling and analysis by interference microscopy,” Proc. SPIE 7064, 70640I (2008).

7. A. Bosseboeuf and S. Petitgrand, “Application of microscopic interferometry techniques in the MEMS field,” Proc. SPIE 5145, 1–16 (2003).

8. S. Kim and G. Kim, “Method for measuring a thickness profile and a refractive index using white-light scanning interferometry and recording medium therefor,” U.S. patent 6,545,763 (8  April  2003).

9. P. J. de Groot, “Interferometry method for ellipsometry, reflectometry, and scatterometry measurements, including characterization of thin film structures,” U.S. patent 7,403,289 (22  July  2008).

10. Y. Ghim and S. Kim, “Spectrally resolved white-light interferometry for 3D inspection of a thin-film layer structure,” Appl. Opt. 48, 799–803 (2009). [CrossRef]  

11. I. Abdulhalim, “Spectroscopic interference microscopy technique for measurement of layer parameters,” Meas. Sci. Technol. 12, 1996–2001 (2001). [CrossRef]  

12. D. Mansfield, “Apparatus for and a method of determining characteristics of thin-layer structures using low-coherence interferometry,” WO patent PCT/GB2005/002,783 (19  January  2006).

13. D. Mansfield, “The distorted helix: thin film extraction from scanning white light interferometry,” Proc. SPIE 6186, 61860O (2006).

14. H. Yoshino, P. M. Kaminski, R. Smith, J. M. Walls, and D. Mansfield, “Refractive index determination by coherence scanning interferometry,” Appl. Opt. 55, 4253–4260 (2016). [CrossRef]  

15. H. Yoshino, A. Abbas, P. M. Kaminski, R. Smith, J. M. Walls, and D. Mansfield, “Measurement of thin film interfacial surface roughness by coherence scanning interferometry,” J. Appl. Phys. 121, 105303 (2017). [CrossRef]  

16. D. Mansfield, “Extraction of film interface surfaces from scanning white light interferometry,” Proc. SPIE 7101, 71010U (2008).

17. T. Lewis and P. Odell, “A generalization of the Gauss-Markov theorem,” J. Am. Stat. Assoc. 61, 1063–1066 (1966). [CrossRef]  

18. K. Sekihara, Introduction to Statistical Signal Processing (Kyouritsu Shuppan, 2011).

19. D. I. Mansfield, “Apparatus for and a method of determining surface characteristics,” U.S. patent 13/352,687 (12 July 2012).

20. A. Albert, “The Gauss-Markov theorem for regression models with possibly singular covariances,” SIAM J. Appl. Math. 24, 182–187 (1973). [CrossRef]  

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (13)

Fig. 1.
Fig. 1. Determined HCF function of a 520 nm SiO 2 thin film on a Si substrate: (a) The global determined HCF function HCF d ¯ , obtained from the full 21 × 21 matrix of four pixels; (b) the HCF function HCF px d determined from four pixels at the edge of the measurement area; (c) the locally determined HCF function HCF px d at the center of the measurement area.
Fig. 2.
Fig. 2. Noise variance-covariance matrix Σ o from a silicon reference sample with M ref = 21 × 21 pixels (actual CSI measurement): (a) real part, (b) imaginary part (color available online).
Fig. 3.
Fig. 3. Noise variance in the frequency domain (actual CSI measurement). The diagonal element of the (a) real and (b) imaginary parts of the noise variance-covariance matrix illustrated in Fig. 2.
Fig. 4.
Fig. 4. Schematic drawing of the model. The number of the pixels in the measurement area is M together with the pixels having the feature is M f . Note that the global film thickness d ^ d .
Fig. 5.
Fig. 5. Noise variance in the frequency domain: The diagonal element of the (a) real and (b) imaginary parts of the noise variance-covariance matrix given to the simulations. Note that the signal-to-noise (S/N) ratio is 2000.
Fig. 6.
Fig. 6. Comparisons between the three computational methods on the sample: SiO 2 (thicknes s = 514 nm ) on a Si substrate; the feature height is 5 nm: (a) the ISR method (with noise), (b) the ISR-NC method (with noise), (c) the ISR-NF method (noise free). The S/N ratio is set at 10 2 to correspond to Sim 3-1 in Table 1 (color available online).
Fig. 7.
Fig. 7. Surface roughness ( S q ) as a function of film thickness: Red circles, top ISR; red squares, Sub ISR; blue circles, top ISR-NC; blue squares, sub ISR-NC.
Fig. 8.
Fig. 8. Feature height sensitivity as a function of feature height: black circles, ISR-NF; red triangles, ISR; blue squares, ISR-NC.
Fig. 9.
Fig. 9. Signal-to-noise ratio sensitivity to the determined feature height: black circles, ISR-NF; red triangles, ISR; blue squares, ISR-NC.
Fig. 10.
Fig. 10. Surface roughness ( S q ) as a function of the S/N ratio: red circles, top ISR; red squares, sub ISR; blue circles, top ISR-NC; blue squares, sub ISR-NC.
Fig. 11.
Fig. 11. Erroneous determined feature height (originally set as 5 nm): black circles, ISR-NF; red triangles, ISR; blue squares, ISR-NC.
Fig. 12.
Fig. 12. HCF functions generated at the feature pixel (simulation 2-2 with 20 nm feature height): (a) true HCF function (without noise) denoted by “Org” and its first-order approximation by “aprx”; (b) spectral difference between the true HCF function HCF px d and the HCF functions produced by each method HCF px s (noise occurs in both ISR and ISR-NC, and NF stands for ISR-NF) (color available online).
Fig. 13.
Fig. 13. HCF functions generated at the feature pixel (simulation 5-1 with Ta 2 O 5 film): (a) the true HCF function (without noise) denoted by “Org” and its first-order approximation by “aprx”; (b) the spectral difference between the true HCF function HCF px d and the HCF functions produced by each method HCF px s (noise exists for ISR and ISR-NC, and NF stands for ISR-NF); (c) the spectral difference between the real and imaginary parts of the true amplitude reflection coefficient and its first-order approximation. Note that the dotted lines (black and pink) represent the maximum deviations of the real ( Re [ r r aprx ] ), imaginary ( Im [ r r aprx ] ), and the reflectivity R , respectively (color available online).

Tables (2)

Tables Icon

Table 1. Simulation Conditions Together with the Number of Pixels Allocated for the “Global” and “Featured” Areas

Tables Icon

Table 2. Corresponding Effective Quarter-Wavelength Optical Thickness Values at the Wavelength of 600 nm

Equations (14)

Equations on this page are rendered with MathJax. Learn more.

HCF d ( ν ; d ) = r ¯ ref ( ν ) · F [ I ] S B + F [ I ref ] S B + , HCF s ( ν , d ) = r ¯ ( ν , d ) · exp ( j 4 π ν Δ z HCF cos θ ¯ ) ,
HCF px d = r ¯ ref ( ν ) · F [ I px + ε px ] S B + F [ I ref ] S B + , HCF px s ( ν ; d ^ + Δ d ) HCF d ¯ + j 4 π ν cos θ ¯ · HCF d ¯ { Δ d sub + l = 1 L G l ( ν ; d ^ ) Δ d l } ,
G l ( ν ; d ^ ) = 1 + 1 4 π ν cos θ ¯ χ ( d ^ ) d l , arg ( r ¯ ) = χ , HCF d ¯ = r ¯ ref ( ν ) · F [ E [ I px + ε px ] ] S B + F [ I ref ] S B + .
HCF px d = [ HCF px d ( ν 1 ) , HCF px d ( ν 2 ) , , HCF px d ( ν m ) ] , HCF px s HCF d ¯ + Diag [ HCF d ¯ ] G Δ d .
HCF px d HCF d ¯ + Diag [ HCF d ¯ ] G Δ d + ϵ o ,
G = j 4 π cos θ ¯ [ ν 1 ν 1 · G 1 ( ν 1 ) ν 1 · G L ( ν 1 ) ν 2 ν 2 · G 1 ( ν 2 ) ν m ν m · G 1 ( ν m ) ν m · G L ( ν m ) ] , Diag [ HCF d ¯ ] = [ HCF d ¯ ( ν 1 ) 0 0 0 HCF d ¯ ( ν 2 ) 0 0 0 HCF d ¯ ( ν m ) ] , HCF d ¯ = [ HCF d ¯ ( ν 1 ) , HCF d ¯ ( ν 2 ) , , HCF d ¯ ( ν m ) ] , ϵ o = [ ϵ o 1 , ϵ o 2 , , ϵ o m ] N ( ϵ o | 0 , σ o 2 I ) .
Δ ^ d = ( G G ) 1 G T u ,
u = { Diag [ HCF d ¯ ] } 1 [ HCF px d HCF d ¯ ] .
u = G Δ d + ϵ , where ϵ = { Diag [ HCF d ¯ ] } 1 ϵ o ,
p ( u ) = 1 ( 2 π ) m 2 | Σ | 1 2 exp [ 1 2 ( u G Δ d ) Σ 1 ( u G Δ d ) ] , L ( Δ d ) = log p ( u ) = 1 2 ( u G Δ d ) Σ 1 ( u G Δ d ) + C ,
minimize Δ d J px = ( u G Δ d ) Σ 1 ( u G Δ d ) , subject to ϵ N ( ϵ | 0 , Σ ) .
Δ ^ d = ( G Σ 1 G ) 1 G Σ 1 u .
Σ o = E [ ( HCF ref d HCF ref d ¯ ) ( HCF ref d HCF ref d ¯ ) ] 1 M ref i = 1 M ref ( HCF ref d , i HCF ref d ¯ ) ( HCF ref d , i HCF ref d ¯ ) , Σ Diag [ HCF d ¯ ] 1 Σ o { Diag [ HCF d ¯ ] 1 } ,
HCF ref d ¯ ( ν ) = r ¯ ref ( ν ) · F [ I ref ] S B + F [ I ref ] S B + = r ¯ ref ( ν ) , HCF ref d , i ( ν ) = r ¯ ref ( ν ) · F [ I ref i ] S B + F [ I ref ] S B + + ϵ i , I ref 1 M ref i = 1 M ref I ref i .
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.