Four-level transmission-type surface relief grating profiles with nearly flat efficiency over a spectral octave are designed by rigorous electromagnetic diffraction theory. Parametric optimization of the relief depths and transition points of the profile steps of these leads to efficiencies in the range 50–60% over the entire octave if the ratio of the grating period and the mean spectral wavelength is greater than ~ 3.
© 2006 Optical Society of America
There is increasing demand for compact and low-cost spectrometers in, e.g., environmental, biological, and chemical sensing, and in process analysis [1–4]. High spectral resolution is not necessarily required — sometimes it is sufficient to resolve only tens of adjacent spectral channels or to monitor just a few characteristic spectral features — but the dispersion must typically be moderately high to achieve a compact design. The spectral range should often be wide (perhaps extending over an octave), and reasonably uniform spectral (and spatial) response is much preferred, particularly in pushbroom imaging spectrometers, to obtain fast and accurate spectral measurements [5–8].
There are several good reasons to employ transmission gratings rather than reflection gratings in compact imaging spectrometers [9,10]. Transmission gratings allow the construction of compact, low-cost, low-f-number instruments  and even direct-vision GRISM  based constructions . However, the optimization of transmission grating profiles for high and uniform efficiency over a wide spectral range has received only limited attention [6, 8, 12]. Such designs, with a flat octave-wide spectral response, are provided here. Since the spectrum is usually converted to digital form, variations in the spectral response diminish the accuracy of the analysis. Hence responses independent of the wavelength are of interest e.g., in imaging spectrometers.
To obtain dispersion values of interest in compact spectrometers, the ratio of the grating period d and the center wavelength of the spectral response is chosen to be 3–10. Four-level profile shapes considered because they have a sufficient number of degrees of freedom for optimization and because they are compatible with an accurate microlithographic fabrication process.
The paper is organized as follows. In sect. 2 we introduce the type of profile to be considered, including the free parameters to be optimized, and justify this choice. The design method based on parametric optimization and rigorous electromagnetic grating theory is presented in sect. 3 along with the merit functions used in the optimization. In sect. 4 we provide specific design results for selected values of the ratio d/ . Finally, conclusions are drawn in sect. 5.
2. Choice of grating profile type
During the past two decades great advances in diffractive optics [13, 14] have been achieved by applying techniques such as electron beam lithography, thin-film deposition, and reactive ion etching to fabricate microrelief profile in dielectric substrates. In grating fabrication these techniques facilitate far greater control over the profile shape than the more traditional ruling and holographic exposure  methods do; however, they are not inherently ideal for high-resolution spectroscopic applications because of ghosts introduced by stitching of the writing fields of electron beam exposure systems and the difficulty in generating large gratings on curved substrates. Fortunately these are not primary issues in our applications, and therefore we use the freedoms in profile shape allowed by lithographic fabrication. On the other hand, the relatively high dispersion required to achieve compact devices requires d/ ratios of the order of 3–10. One then speaks of resonance-domain gratings, which must be analyzed by rigorous electromagnetic theory [16, 17]. We use the Fourier Modal Method (FMM) with fast factorization  in the design process.
The choice of the grating profile type to be considered is a critical issue. On one hand it should be realizable at high precision because profile-shape errors of just tens of nanometers degrade the performance substantially in the resonance domain. On the other hand, a sufficient number of parameters should be available for successful optimization. To put the latter aspect into perspective we note that high efficiencies (above 96% for unpolarized light) can be achieved even with binary profiles if Bragg incidence is used and the profile depth as well as the fill factor are optimized . However, because of the Bragg selectivity, this is possible only over a severely limited spectral range. Similarly, triangular profiles can yield high efficiencies if the profile height (and possibly also the angle of incidence) is optimized; however, even if a ‘split” triangle is used to provide an additional degree of freedom , adequate spectral flattening over a wide spectral range can not be achieved.
Returning to the issue of feasibility of precise lithographic fabrication, we refer to a double-exposure electron beam lithographic process introduced by David , which is available to us in a somewhat modified form . This process is capable of generating four-level profiles of the type shown in Fig. 1 with nearly vertical sidewalls, sharp and exactly placed transitions, and highly precise depth levels. The free parameters for optimization are now the transition point locations xj and the profile depths zj, with j = 1,2,3, and also the angle of incidence θ if so desired. This choice of general profile shape is motivated not only by feasibility of fabrication, but also by our experience in designing diffractive elements with spatially variable local period for monochromatic light of wavelength λ0[20–22]: it was observed that not much can be gained by using more than four levels if the transition points xj are optimized. The results of Refs. [20–22] are not applicable here since we are not optimizing the profile for one value of the ratio d/λ0 at a time, but for an extended range Δλ with a fixed ratio d/ .
3. Design method
Parametric optimization with the Nelder-Mead simplex method is employed to search for optimum values of xj and Zj. In order to evaluate the merit of each configuration, FMM is used to calculate the spectral efficiency curves ηTE(λ) and ηTM(λ) of diffraction order - 1 at N sample values λn of the wavelength λ within the chosen spectral interval (the subscripts TE and TM denote TE and TM polarization, respectively). The number diffraction orders M used during optimization was typically M ~ 15 in TE polarization and M ~ 20 in TM polarization to ensure a satisfactory trade-off between convergence and computational cost (which increases ∝ M 3 in FMM), and the final results were verified using a larger number of orders. The most natural starting point of optimization is a regular four-level profile with equally spaced values of xj and Zj, but since the Nelder-Mead algorithm converges to the nearest local minimum several initial distributions with modified values of xj and Zj may have to be tried to find an acceptable solution of this nonlinear optimization problem.
In the first stage of the design procedure we use a least-squares-type merit function of the form
is the mean value of the efficiencies at λ = λn for unpolarized light. The mean value is allowed to evolve freely during this optimization stage, i.e., no attempt is yet made to ensure a high overall efficiency. As a result a flat response is obtained fairly rapidly but typically reduces to values of the order of 20%. The aim of the second optimization stage is the improve the efficiency without sacrificing the achieved uniformity of the spectral response. To this end one can, for example, replace with some goal value ηg, which is gradually raised during optimization until no further improvement is possible without a substantial increase of MF. Alternatively, some other form of merit function can be used during the second stage, as long as it is designed to push up the value of one way or the other.
Keeping in mind the dispersion requirements in our applications, we are primarily interested in values of d/ in the range 3–10. We choose a spectral range extending over a full octave in the near-infrared region, namely 1 μm < λ < 2 μm, and provide specific designs for d = 5 μ and d = 11 μm. This spectral range is of interest in numerous applications of pushbroom spectrometers, but designs can easily be obtained for other values of by appropriate scaling and using correct values for the (wavelength-dependent) refractive index of the substrate (incidence from an SiO2 substrate to air is assumed here, and the conditions z 3 > z 2 > z 1 are maintained).
The main results are collected in Figs. 2–4 and Tables 1 and 2. In addition to spectral flattening (green lines in Fig. 2), we also optimized the profiles for maximum efficiency using defined by Eq. (2) as a merit function (red lines in Fig. 2). This is seen to improve the efficiency substantially if d = 5 μm, but not when d = 11 μm. In both cases the flattened efficiency curves are rather similar for unpolarized light, with the efficiency varying in the range 52–60%. Even though the optimization result in the maximum-efficiency case at d = 11 μm is nearly the same as the non-optimized result, the transition points are shifted noticeably, and flattening of the efficiency curve changes especially the last step height. With d = 5 μm the lateral and vertical shifts of the parameters are even more remarkable. Finally, fabrication errors of ±20 nm in xj or Zj lead to efficiency degradation of the order of 2%.
It is remarkable that the efficiency curves in the two cases, d = 11 μm and d = 5 μm, are so similar. However, this is true for unpolarized light only, as illustrated in Fig. 4. Here the efficiency curves are plotted separately for TE and TM polarized incident fields, and the average curve for unpolarized light (already shown in Fig. 3) is given for comparison. Clearly, if d = 11 μm, the operation of the grating is nearly polarization-independent, which is understandable because the grating period-to-wavelength ratio approaches the ‘scalar regime’ d/λ ≫ 1. Even though the structure is nearly polarization independent, analysis with scalar theory yields response strongly dependent on the wavelength thus proving that the scalar theory is not valid yet. However, if d = 5 μm, the response is strongly polarization-sensitive (as is typical in the resonance domain) but the TE and TM contributions compensate for each other.
The results have been presented explicitly for only two values of d, but these are representative in the entire range d > 3. If d > 11 μm, the optimized values of xj/d and zj given in Table 1 for the flattened response with d = 11 μm still give a rather good results. If the period is reduced from 11 μm to 5 μm, the values of the parameters required for flat response change, the efficiency curve for unpolarized light remain rather unchanged, but the polarization sensitivity increases. The tabulated results for d = 5 μm or d = 11 μm can be used as a starting point for optimization at nearby values of the period.
We have used parametric optimization and rigorous electromagnetic grating theory to design multilevel transmission-grating profiles with thus-far unparalleled spectral flatness over a one-octave range. The profile type was chosen to have the degrees of freedom offered by a specific lithographic fabrication process. When used in optical configurations that also provide sufficient spatial uniformity in, e.g., pushbroom imaging spectrometers, these grating profiles are ideal for numerous spectral sensing and monitoring applications.
The work of T. Vallius and J. Turunen was supprted by the Academy of Finland (projects 106410 and 207523). The European Union Network of Excellence on Micro-Optics (NEMO, www.micro-optics.org) and discussions with Pasi Laakkonen are acknowledged.
References and links
1. B. Braam, J. Okkonen, M. Aikio, K. Makisara, and J. Bolton, “Design and first test results of the Finnish airborne imaging spectrometer for different applications, AISA”, in Imaging Spectrometry of the Terrestial Environment, G. Vane, ed., Proc. SPIE 1937, 142–151 (1993). [CrossRef]
2. S. H. Kong, D. D. L. Wijngaards, and R. F. Wolffenbuttel, “Infrared micro-spectrometer based on diffraction gratings,” Sensors and Actuators A 92, 88–95 (2001). [CrossRef]
3. F. Salem and M. Kafatos, “Hyperspectral image analysis for oil spilling mitigation,”, in Proceedings of 22nd Asian Conference on Remote Sensing, (CRISP, Singapore, 2001) pp. 748–753.
4. E. Herrala and J. Okkonen, “Imaging spectrograph and camera solutions for industrial applications,” Int. J. Pattern Recogn. Artif. Intellig. 10, 43–54 (1996). [CrossRef]
5. R. O. Green, “Spectral calibration requirement for Earth-looking imaging spectrometers in the solar-reflected spectrum,” Appl. Opt. 37, 683–690 (1998). [CrossRef]
6. P. Mouroulis, D. W. Wilson, P. D. Maker, and R. E. Muller, “Convex grating types for concentric imaging spectrometers,” Appl. Opt. 37, 7200–7208 (1998). [CrossRef]
7. P. Mouroulis, “Spectral and spatial uniformity in pushbroom imaging spectrometers,” in Imaging Spectrometry VJ. B. Rafert, W. J. Slough, C. A. Rohde, A. Pilant, L. J. Otten, A. D. Meigs, A. Jones, and E. W. Butler, eds. Proc. SPIE3753, 133–141 (1999). [CrossRef]
8. T. Hyvarinen, E. Herrala, and A. Dall’Ava, “Direct sight imaging spectrograph: a unique add-on component brings spectral imaging to industrial applications,” in Digital Solid State Cameras: Design and Applications, G. M. Williams, ed., Proc. SPIE 3302, 165–175 (1998). [CrossRef]
9. E. Herrala, J. Okkonen, T. Hyvarinen, M. Aikio, and J. Lammasniemi, “Imaging spectrometer for process industry applications,” in Optical Measurements and Sensors for the Process Industries, C. Gorecki and R. W. Preater, eds., Proc. SPIE 2248, 33–40 (1994). [CrossRef]
10. D. E. Battey and J. B. Slater, “Compact holographic imaging spectrograph for process control applications,” in Optical Methods for Chemical Process Control, S. Farquharson, ed., Proc. SPIE 2069, 60–64 (1997). [CrossRef]
11. E. Cianci, V. Foglietti, F. Vitali, D. Lorenzetti, A. Notargiacomo, and E. Giovine “Micromachined silicon grisms: high resolution spectroscopy in the near infrared,” Microelectron. Eng. 53, 543–546 (2000). [CrossRef]
12. P. Laakkonen, M. Kuittinen, J. Simonen, and J. Turunen, “Electron-beam-fabricated asymmetric transmission gratings for microspectrometry,” Appl. Opt. 39, 3187–3191 (2000). [CrossRef]
13. H. P. Herzig, ed., Micro-optics: Elements, Systems and Applications (Taylor & Francis, London, 1997).
14. J. Turunen and F. Wyrowski, eds., Diffractive Optics for Industrial and Commercial Applications (Wiley-VCH, Berlin, 1997).
15. M. C. Hutley, Diffraction Gratings (Academic Press, Orlando, 1982).
16. R. Petit, ed., Electromagnetic Theory of Gratings (Springer, Berlin, 1980). [CrossRef]
17. J. Turunen, M. Kuittinen, and F. Wyrowski, “Diffractive optics: electromagnetic approach,” in Progress in Optics, E. Wolf, ed., vol. XL, chap. V (Elsevier, Amsterdam, 2000).
18. L. Li, “Use of Fourier series in the analysis of discontinuous periodic structures,” J. Opt. Soc. Am. A 13, 1870–1876 (1996). [CrossRef]
19. E. Noponen and J. Turunen, “Binary high-frequency-carrier diffractive optical elements: electromagnetic theory” J. Opt. Soc. Am. A 11, 1097–1109 (1994). [CrossRef]
21. E. Noponen, J. Turunen, and A. Vasara, “Electromagnetic theory and design of diffractive-lens arrays,” J. Opt. Soc. Am. A 10, 434–443 (1993). [CrossRef]
22. K. Blomstedt, E. Noponen, and J. Turunen, “Surface-profile optimization of diffractive imaging lenses,” J. Opt. Soc. Am. A 18, 521–525 (2001). [CrossRef]
23. C. David “Fabrication of stair-case profiles with high aspect ratios for blazed diffractive optical elements,”Microelectron. Eng. 53, 677–680 (2000). [CrossRef]
24. K. Jefimovs, Ph.D. Thesis (University of Joensuu, 2003).