Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Back-of-the-envelope image quality estimation using the national image interpretability rating scale

Open Access Open Access

Abstract

For an optical imaging system, the critical output is the image itself, and therefore, the quality of that image is of utmost importance. To estimate or predict the image quality (IQ), a simulation/model is typically created to yield an output image, given an imaging system and an object/scene. The IQ is typically graded based on the imaging system along with the scene. Developing an imaging simulation and creating input scenes to produce IQ results can be time-consuming, leading to a desire for a simple method to estimate the predicted IQ. This work develops a national image interpretability rating scale (NIIRS) IQ value based on simplifying assumptions for remote-sensing purposes. While the results are on the optimistic side, this back-of-the-envelope IQ estimation allows the process of developing a new imaging system to move forward more in parallel rather than in series, i.e., developing an imaging full simulation/model in parallel with designing/procuring hardware.

© 2019 Optical Society of America

Corrections

Jason Mudge, "Back-of-the-envelope image quality estimation using the national image interpretability rating scale: erratum," Appl. Opt. 58, 8839-8839 (2019)
https://opg.optica.org/ao/abstract.cfm?uri=ao-58-32-8839

1. INTRODUCTION

Imaging systems have been in existence for centuries and for a variety of purposes. The desire for various types of imaging systems gives rise to the need for an image quality (IQ) estimation or prediction of the particular imaging system. This naturally leads to adjusting the IQ by changing aspects of the imaging system for either improved IQ or to gain in cost reduction by reducing IQ. In a sense, the image produced should be correct, not too good or not too poor, for the particular application. Unfortunately, there is not a perfect method to judge, predict, or measure, for that matter, the IQ. However, there are some generally accepted metrics [1]. The IQ metric applied for this work is the national image interpretability rating scale (NIIRS), which appears to have gained the most attention [1,2] in remote sensing. For this type of sensing, a simplified system is provided and shown in Fig. 1.

 figure: Fig. 1.

Fig. 1. Simplified imaging system and object/scene (Lambertian).

Download Full Size | PDF

There are several versions of the NIIRS equation, known as the general image quality equation (GIQE), in existence for the visible spectrum. In this work, GIQE version 3.0 is utilized due to its simpler and possibly more accurate [3] form and is

NIIRS=11.81+3.32log10(0.0254)+3.32log10(RERGSD)1.48HGSNR,
where GSD is the ground sampled distance (in meters, not inches), RER is the relative edge response, SNR is the signal-to-noise ratio, H is the overshoot-height due to a sharpening filter, and G is a noise gain term again associated with a sharpening filter. These terms are defined in depth in Refs. [1,2] for the visible spectrum (technically the RER and GSD are actually the geometric means, but if the values are square, then it is the same as in any one space. Along with this, the skewness in RER tends to cancel with the GSD skewness in version 3.0). Typically, a simulation is generated to evaluate each of these terms in Eq. (1), producing an NIIRS IQ score, but this can be time-consuming. This paper details how to initially circumvent a lengthy simulation development to give an effective (albeit optimistic) back-of-the-envelope IQ estimate (back-of-the-envelope calculation or Fermi question implies a simple calculation typically performed on the back side of an envelope). This can be thought of as the next higher level calculation to the simple radiometric calculation.

In place of a complicated image generating simulation, we propose a simplified technique to estimate the components in Eq. (1) and generate a design area for the desired/required IQ. With this, the hardware design can progress forward, and the initial vendor identification and procurement process of required hardware can begin, which typically has a long lead time. At the same time, a detailed simulation can be created to fine-tune the potential design at a later date [4]. In the following section, approximations are made to yield such an estimated IQ for an initial stab at an imaging system design from an IQ perspective.

2. IQ ESTIMATION

Estimation of the GIQE components is the first step in generating a back-of-the-envelope imaging system design. For this analysis, it is assumed that there is not a sharpening filter turned on, i.e., H=1 and G=1, but this could be added to the analysis should the desire be there. As stated, the application in the work is for remote sensing, leading to the approximation that the object is far away, i.e., zf. For GSD, the value is

GSD=|1zf|ppzfpp,
where f is the imaging system’s effective focal length, z is the distance to the object from the imaging system (see Fig. 1), and pp is the pixel pitch. Next, the signal-to-noise ratio (SNR) is calculated based on Refs. [2,5], and the result is
SNRπ4QE(r2r1)LAt(f/#)21(π4QE(r2)LAt(f/#)2)+(π4QE(r1)LAt(f/#)2)=π4QELAt(f/#)2(r2r1)r2+r1,
where QE is the quantum efficiency, r2 and r1 are the reflectivities of two Lambertian extended surfaces assumed to be 15% and 7%, respectively [2], L is the radiance on the scene (typically the Sun in many cases), A is the pixel size in area, t is the detector integration time, and f/#=fD (f-number) where D is the effective aperture diameter (entrance pupil). The noise (more accurately, the noise standard deviation) is the quadrature addition of the object reflectivities (15% and 7%, but the 15% dominates the noise calculation) since the two signals are added or subtracted in this particular case. The imaging system is assumed to be shot-noise-limited, which is very often the case and is done for ease (although, noise could be dominated by read noise in low light conditions).

Finally, the last component required in Eq. (1) is RER, and this is a bit trickier to estimate. Using the definition of the Fourier transform given in Goodman [6], it is assumed the optical transfer function (OTF) is a triangle function, which implies a square aperture in place of the more traditional circular aperture for mathematical ease and is diffraction-limited for this same ease. We are concerned with an approximate IQ estimate, so we leave out all other distortions and aberrations again (that is the job of a detailed simulation). The OTF is

OTF(σ)=1|σ1λf/#|,
where σ is a one-dimensional special frequency (working in one spatial dimension, either σx- or σy-, simplifies the problem and is not limiting), and λ is the optical wavelength (typically this is the average wavelength). It should also be noted that a square aperture and circular aperture have the same incoherent cutoff frequency of 1λf/# when the length of the sides of the square equals the diameter of the circular aperture. However, the circular aperture drops down a shade below the square aperture OTF in between the zero frequency and the cutoff [6]. To generate the RER, the object being imaged is an edge or step function also known as the Heaviside function. The Fourier transform of the Heaviside function is
O(σ)=F{H(x)}=12(δ(σ)+1jπσ),
where F{·} is the Fourier transform operator and H(x)=12(1+sgn(x)). The sgn(x) is the sign function and is defined in Ref. [6] but should call out that sgn(0)=0, which is equivalent to H(0)=12. Again, the Heaviside function represents an edge object in one dimension (effectively a slanted edge object in two dimensions) that, along with the OTF, determines the edge response. Multiplying the Fourier transformed object with the OTF in the frequency domain gives the Fourier transform of the image as
I(σ)=OTF(σ)·O(σ)=12(1|σ1λf/#|)(δ(σ)+1jπσ),
and the image of this one-dimensional edge is
i(x)=F1{I(σ)}=12(1+sgn(x)λf/#π2x).
Some care should be given to Eq. (7), as it is valid only in the limit according to Bracewell [7]. The transform is not valid at or even very near x=0, but only in the limit. And in this case, from the physics of the problem, it is only valid when 12(1+sgn(x)λf/#π2x)<12 for x<0 and 12(1+sgn(x)λf/#π2x)>12 for x>0 or, equivalently, λf/#π2<|x|. This is because the image value cannot be above the halfway mark for x<0 nor can it be below the halfway mark for x>0. If this were not the case, it would violate causality. The RER can now be calculated and is [2]
RER=i(pp2)i(pp2)=(12λf/#π2pp).
Putting the GIQE together gives
NIIRS=5.03+3.32log10(12λf/#π2ppzfpp)1π4QELAt(f/#)2·(r2r1)r2+r1,
and with some algebra, Eq. (9) reduces to
NIIRS=5.03+3.32log10(fzpp(12λf/Dπ2pp))2f/Dr2+r1πQELt·pp(r2r1),
where it has been assumed that pp=A, i.e., the pixel pitch equals the square root of a square pixel size. This implies a fill factor of 100% and a square pixel, which is not particularly unreasonable. Equation (10) is the key to an initial design of an imaging system and is a simple calculation, e.g., a spreadsheet calculation. The main IQ parameters are exposed and can be traded to yield a desired IQ.

Before moving to applying the above formula with an example, there is some optimization that should be examined or at least thought through. For most of the optical parameters, it is clear how the adjustments affect the IQ. For example, increasing D improves IQ. This increases the SNR and RER, yielding improved IQ with respect to NIIRS (although practically, this is to a point in that depth of focus can become an issue, and creating a diffraction-limited system with large diameter optics has its difficulties). On the other hand, the system focal length plays a role in the GSD, RER, and SNR. Increasing the focal length reduces the GSD and increases the RER, both of which are good for IQ, but it has the ill effect of reducing the SNR. These values can be played against each other to reach an optimum. To do this, the first derivative is taken with respect to the system focal length and set equal to zero

dNIIRSdf0.
This equation is then solved for the optimum focal length. After some calculus and algebra, the optimum focal length is not analytically attainable because it is a transcendental equation in focal length (as is the case for a pixel pitch optimization). Nonetheless, in the following section Eq. (10) is plotted as a function of focal length (or f/#), which shows a maximum numerically.

3. APPLICATION

As is customary, a simple example to show the usefulness of Eq. (10) is provided, and it is well worth the effort in this case. Using values for a low-Earth orbit (LEO) nadir looking satellite for a visible wavelength gives the parameters as z=800km, λ=600nm, pp=4μm, D=0.25 m, r2=0.15, r1=0.07, QE=0.6, t=1msec, and L=1.7×1019ph/sm2·sr(a slightly overcast day on Earth). As for the focal length chosen, a set of reasonable focal lengths are spanned (0.25mf8m). For these focal lengths explored, the largest lower limit of x for a valid Eq. (7) is 1.88 μm, and the value pp2 is evaluated at 2 μm. This meets the constraint discussed above in the paragraph after Eq. (7), i.e., x is large enough such that Eq. (7) is valid.

Each of the components that goes into computing the NIIRS score is plotted in Figs. 24, and as is traditionally done, the values are plotted as a function of f-number (f/#). The first plot is shown in Fig. 2 for the GSD. The RER is fundamentally bounded between zero and unity, and the plot in Fig. 3 abides by this fact.

 figure: Fig. 2.

Fig. 2. GSD plotted as a function of the imaging system’s f/#. The GSD is monotonically decreasing in f/#.

Download Full Size | PDF

 figure: Fig. 3.

Fig. 3. Imaging system’s RER as a function of f/#. The RER is monotonically decreasing in f/# between zero and one.

Download Full Size | PDF

 figure: Fig. 4.

Fig. 4. SNR as a function of the imaging system’s f/# as defined in Refs. [1,2]. The SNR is monotonically decreasing in f/#.

Download Full Size | PDF

The SNR is reduced as the f-number is increased (shown in Fig. 4) and is the last component for the GIQE. It should be recognized that the equation used from Ref. [5, see p. 84] has a small angle approximation (high f-numbers) imbedded within it. This implies the improvement of SNR as the f-number is reduced and is not to the extent shown in Fig. 4 or given in Eq. (3) at very low f-numbers.

Finally, Fig. 5 shows the NIIRS values for a string of f/#’s. Notice that none of the components that make up NIIRS has a maximum or minimum, but the NIIRS IQ score does. For the optimum with respect to IQ purposes, the NIIRS score is 4.1 for an f/15 system. From Ref. [2], the ability to identify, detect, etc., is detailed out for various NIIRS scores. Given the 4.1 NIIRS value for this example, the performance can be evaluated based on Ref. [2], and an example of a NIIRS value greater than or equal to 4 and less that 5 is “Detect an open missile silo door.”

 figure: Fig. 5.

Fig. 5. Composite NIIRS score shown as a function of the imaging system’s f/#. For this system, the optimum or peak NIIRS value is 4.1 for an f/ 15 (or focal length of 3.75 m).

Download Full Size | PDF

However, by looking at Eq. (10), we find another interesting value to plot the components of NIIRS as well as NIIRS itself against instead of the traditional f-number. The value is dimensionless and is f/pp, which is equal to GSD/z (similar triangles), which is basically the system angular sampling and is a sole quantity (f-number is not because the function is not strictly a function of the ratio of focal length to aperture diameter). This is shown to be the case in Eq. (12) below,

NIIRS=5.03+3.32log10(1z(fpp)(12λ/Dπ2(fpp)))2/Dr2+r1πQELt·(r2r1)(fpp).
The NIIRS values are shown in Fig. 6 as a function of f/pp and show a peak for the ratio value of 950,000.

 figure: Fig. 6.

Fig. 6. NIIRS as a function of the imaging system’s f/pp. NIIRS value at 4.1 gives an optimum f/pp of 950,000.

Download Full Size | PDF

4. DISCUSSION

Without question, this is an optimistic IQ predictor/estimator. It is for a square aperture, not the more typical circular aperture [6], and the result does not include wavefront aberrations, atmospheric turbulence, focus error, motion blur, and pixel blurring [4], to name a few distortions that comprise the total OTF (or RER). With regard to the SNR, there has been no consideration to transmission losses due to the atmosphere and optical throughput, which would find their way into a more detailed simulation.

Another popular method to help determine imaging performance is the Rayleigh criterion. This criterion for a square aperture is λf/# (or 1.22λf/# for a circular aperture) when diffraction-limited. This value represents the minimum distance the imaging system can separate two points sources (provides a minimum angular resolution). This presumes sufficient sampling (minimum of two and preferably three samples across the two incoherently summed point spread function (PSF) peaks) and enough SNR. In some respects, the criterion can be thought of as a single value that represents the entire OTF. We have not discussed sampling or “Q” [4] for that matter, but they are important when it comes to IQ. In fact, increasing IQ can be traded for an amount of acceptable aliasing [4]. Continuing with Q, Eq. (8) can be rewritten in terms of Q=λf/#pp, giving,

RER=(12Qπ2).
Extending the example above, the NIIRS value can be plotted as a function of Q and is done in Fig. 7. There is a nice discussion on the “Story of Q” given in Ref. [4], but in this work, we wanted to show how NIIRS varies with Q with this back-of-the-envelope solution.

 figure: Fig. 7.

Fig. 7. NIIRS as a function of the “Q.” The Q value for a peak NIIRS is 2.2.

Download Full Size | PDF

The methodology used in the visible can be readily applied in the infrared. The analogous reference for the infrared is again written by Leachtenauer et al., provided in Ref. [8], where the infrared GIQE is defined. There are some differences, since the “a” and “b” values along with other constants are not the same as they are in the visible 3.0 equation [8]. The GIQE in the infrared is more like the visible version 4.0 than the 3.0 and is

NIIRS=10.751+3.32log10(0.0254)+log10(RERaGSDb)1.48H0.344GSNR,
where again the GSD is in meters, not inches. Substituting in the RER, GSD, and SNR calculated above gives
NIIRS=5.455+log10((12λf/Dπ2pp)a(zfpp)b)0.688f/DL2+L1πQEt·pp(L2L1),
whereas before, we have assumed the sharpening filter is not on. For infrared, the high and low radiance values are defined by the scene thermal emission at a high and low temperature and not by reflectivities, as in the visible spectrum. The value for the high radiance, L2, is determined based on emission radiance of a blackbody at 302 K and the low is L1 at 300 K [9]. The equation is for photon counting detector only and not a thermal detector, e.g., microbolometer. To convert to a thermal detector, the signal and noise must be again adjusted.

Lastly, only IQ is considered in this work, and the focal length optimum is also with respect to just IQ. But in a complete design the field-of-view (FoV) IQ also must be considered. Another important aspect not considered is size, weight, and power consumption, known as SWaP.

5. SUMMARY

A first-cut IQ design tool for an imaging system is provided as given in Eq. (10) to generate a general design area. The tool provides an optimistic NIIRS IQ score without the need for a complicated numerical simulation/model. This work provides an upper bound on the IQ, allowing a vendor’s search for hardware procurement and detailed simulation to progress more in parallel than in a series. Using this tool, an optimum focal length (or f/#) or f/pp ratio can be determined numerically for IQ.

Acknowledgment

I would like to thank many of the individuals who were encouraging over the many years of working in the area of optical sciences and other technical areas. These conversations were not necessarily related to this topic in particular but simply a seed. Without these intelligent individuals in my life such as Richard L. Kendrick of Raytheon Space and Airborne Systems, this work would not have been possible. Thanks to all in my life particularly Georgina Baca. Parts of this research were supported by Golden gate Light Optimization, LLC. In memory of Theodore Tarbell.

REFERENCES

1. J. C. Leachtenauer and R. G. Driggers, Surveillance and Reconnaissance Imaging Systems: Modeling and Performance Prediction (Artech House, 2001).

2. J. C. Leachtenauer, W. Malila, J. Irvine, L. Colburn, and N. Salvaggio, “General image-quality equation: GIQE,” Appl. Opt. 36, 8322–8328 (1997). [CrossRef]  

3. S. T. Thurman and J. R. Fienup, “Analysis of the general image quality equation,” Proc. SPIE 6978, 69780F (2008). [CrossRef]  

4. R. D. Fiete, Modeling the Imaging Chain of Digital Cameras (SPIE, 2010).

5. J. E. Greivenkamp, Field Guide to Geometric Optics, Vol. FG01 of SPIE Field Guides (SPIE, 2004).

6. J. W. Goodman, Introduction to Fourier Optics, 2nd ed. (McGraw-Hill, 1996).

7. R. N. Bracewell, The Fourier Transform and Its Applications, 2nd ed. (McGraw-Hill, 1986), pp. 130–131.

8. J. C. Leachtenauer, W. Malila, J. Irvine, L. Colburn, and N. Salvaggio, “General image-quality equation for infrared imagery,” Appl. Opt. 39, 4826–4828 (2000). [CrossRef]  

9. M. T. Eismann and S. D. Ingle, Utility Analysis of High-Resolution Multispectral Imagery (Environmental Research Institute of Michigan, 1995), Vol. 3, p. 36.

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (7)

Fig. 1.
Fig. 1. Simplified imaging system and object/scene (Lambertian).
Fig. 2.
Fig. 2. GSD plotted as a function of the imaging system’s f/#. The GSD is monotonically decreasing in f/#.
Fig. 3.
Fig. 3. Imaging system’s RER as a function of f/#. The RER is monotonically decreasing in f/# between zero and one.
Fig. 4.
Fig. 4. SNR as a function of the imaging system’s f/# as defined in Refs. [1,2]. The SNR is monotonically decreasing in f/#.
Fig. 5.
Fig. 5. Composite NIIRS score shown as a function of the imaging system’s f/#. For this system, the optimum or peak NIIRS value is 4.1 for an f/ 15 (or focal length of 3.75 m).
Fig. 6.
Fig. 6. NIIRS as a function of the imaging system’s f/pp. NIIRS value at 4.1 gives an optimum f/pp of 950,000.
Fig. 7.
Fig. 7. NIIRS as a function of the “Q.” The Q value for a peak NIIRS is 2.2.

Equations (15)

Equations on this page are rendered with MathJax. Learn more.

NIIRS=11.81+3.32log10(0.0254)+3.32log10(RERGSD)1.48HGSNR,
GSD=|1zf|ppzfpp,
SNRπ4QE(r2r1)LAt(f/#)21(π4QE(r2)LAt(f/#)2)+(π4QE(r1)LAt(f/#)2)=π4QELAt(f/#)2(r2r1)r2+r1,
OTF(σ)=1|σ1λf/#|,
O(σ)=F{H(x)}=12(δ(σ)+1jπσ),
I(σ)=OTF(σ)·O(σ)=12(1|σ1λf/#|)(δ(σ)+1jπσ),
i(x)=F1{I(σ)}=12(1+sgn(x)λf/#π2x).
RER=i(pp2)i(pp2)=(12λf/#π2pp).
NIIRS=5.03+3.32log10(12λf/#π2ppzfpp)1π4QELAt(f/#)2·(r2r1)r2+r1,
NIIRS=5.03+3.32log10(fzpp(12λf/Dπ2pp))2f/Dr2+r1πQELt·pp(r2r1),
dNIIRSdf0.
NIIRS=5.03+3.32log10(1z(fpp)(12λ/Dπ2(fpp)))2/Dr2+r1πQELt·(r2r1)(fpp).
RER=(12Qπ2).
NIIRS=10.751+3.32log10(0.0254)+log10(RERaGSDb)1.48H0.344GSNR,
NIIRS=5.455+log10((12λf/Dπ2pp)a(zfpp)b)0.688f/DL2+L1πQEt·pp(L2L1),
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.