Stochastic inversion of ocean color data using the cross-entropy method

Mhd. Suhyb Salama; Fang Shen

doi:10.1364/OE.18.000479

1. Introduction

The aim of ocean color data inversion is to determine inherent optical properties of the water upper-layer from observed remote sensing reflectance. Theoretically, inherent optical properties (IOPs) can be derived from the radiance distribution and its depth derivative [1]. Ocean color data provide, however, the radiance at the surface in a few directions only. Therefore, semi-analytical models were developed to facilitate the inversion of ocean color data. These models are based on approximations that link remote sensing reflectance and the IOPs [2–6]. The general form of these models is that water remote sensing reflectance is proportional to the backscattering coefficient and inversely proportional to the absorption coefficient. Inversion of ocean color data using semi-analytical models has been investigated in many studies [7–12]. The scientific procedure to derive IOPs from ship/space borne ocean color data can be divided into three steps: i- forward modeling, use a semi-analytical ocean color model; ii- parametrization, define the minimal set of IOPs whose values completely characterize the observed remote sensing reflectance using the forward model; iii- inversion, use of ocean color observations to infer the actual values of IOPs. While the first two steps are mainly inductive, the third step is deductive. This paper is devoted to the third step, i.e. explain, implement and validate an inversion method for ocean color data. It is out of the scope of this work to review the literature on ocean color inversion methods and compare them, one may consult [13] for more details on this subject.

Generally, inversion of ocean color data falls under one of two methods, namely analytical deterministic or stochastic methods. Deterministic methods are based on gradient or pseudo-gradient techniques and have been extensively used for ocean color inversion [11, 14–16]. The main drawback of gradient-based methods is that they do not properly handel non-convex objective functions or many local optima. On the other hand, stochastic methods are less prone to be trapped in local optima and can deal with non-convex functions. The basic idea of stochastic methods is to systematically partition the region of feasible solutions into smaller subregions and move between them using random search techniques. Stochastic optimization techniques have been recently adopted for ocean color inversion, e.g. genetic algorithms [17, 18]. Maritorena et al. [11] used simulated annealing to optimizing for the parameters in their semi-analytical ocean color model. On the other hand, Kempeneers et al. [19] employed simulated annealing to derive ocean constituents. Swarm optimization [20] and ant colony method [21] were also used to derive water optical constituents from remote sensing reflectance. Salama and Stein [22] used entropy-based method to decompose and quantify the errors of derived IOPs.

There are other stochastic methods that have not been investigated yet for ocean color inversion, e.g. threshold acceptance [23], stochastic comparison method [24], tabu search [25] and cross-entropy [26]. Cross-entropy is one of the most significant developments in stochastic optimization and simulation in recent years [27]. It is a stochastic iterative method that searches for sequence of solutions which converges probabilistically to the optimal solution. The main objective of this paper is to develop a stochastic inversion algorithm for ocean color data based on the cross-entropy method. The performance of the algorithm and its stability to noise will be analyzed using simulated data. Validation exercises will be carried out against ocean color data of in-situ measurements and satellite match-up.

The reminder of this paper is organized as follow: in Section 2 we describe the ocean color paradigm: used ocean color model and its parametrization. The principles of cross-entropy are introduced in Section 3.1, followed by mathematical derivations for ocean color inversion in Section 3.2. The implementation of the inversion algorithm is presented in Section 3.3. In-situ measurements and ocean color satellite match-up data are described in Section 4 along with the employed initial values and statistical analysis. Inversion’s performance and stability are analyzed in Section 5, followed by extensive validation exercise with in-situ data in Section 6. Thoroughly discussions of the developed algorithm, its results, limitations and possible extension are presented in Section 7. Main conclusions of this work are listed in Section 8.

2. Semi-analytical ocean color model

Remote sensing reflectance leaving the water surface can be related to physical and biological properties of water constituents using the model [7]:

R s_{w} (λ) = \frac{t}{n_{w}^{2}} \sum_{i = 1}^{2} g_{i} {(\frac{b_{b} (λ)}{b_{b} (λ) + a (λ)})}^{i}

where, Rs _w(λ) is remote sensing reflectance leaving the water surface at wavelength λ;g_i are constants taken from [7]; t and n _w are the sea-air transmission factor and water index of refraction, respectively. Their values are taken from [7, 11,28]. The parametersb_b(λ) anda(λ) are the bulk backscattering and absorption coefficients of the water column, respectively.

Four independently-varying constituents are considered to affect the optical properties of the water column, namely: phytoplankton green pigment i.e. chlorophyll-a(Chla), dissolved organic matter or gelbstoff, detritus and suspended particulate matter (SPM). The bulk absorption a(λ)and backscattering b_b(λ) coefficients are modeled as being the sum of absorption and backscattering effects from water constituents:

a (λ) = a_{w} (λ) + a_{chl a} (λ) + a_{dg} (λ)

b_{b} (λ) = b_{b, w} (λ) + b_{b, spm} (λ)

where, the subscripts denote the contribution of: water (w), chlorophyll-a(chla), combined effects of detritus and glebstoff (dg) and suspended particulate matter (spm). The absorption and backscattering coefficients of water molecules, a _w andb _b,w, were obtained from [29, 30],respectively. The total absorption of chlorophyll-aa _chla is approximated as [14]:

a_{chl a} (λ) = (a_{0} (λ) + a_{1} (λ) \ln a_{chl a} (440)) a_{chl a} (440)

where a ₀(λ) anda ₁(λ) are empirical coefficients. The absorption effects of detritus and gelbstoff are combined due to the similar spectral signature [11] and approximated using the model[31]:

a_{dg} (λ) = a_{dg} (440) \exp [- s (λ - 440)]

where s is the spectral exponent. The backscattering coefficient of SPMb _b,spm is parameterized as [32]:

b_{b, spm} (λ) = b_{b, spm} (550) {(\frac{550}{λ})}^{y}

where y is the spectral shape parameter of backscattering. The scattering phase function of SPM was assumed to follow the Petzold’s San Diego Harbor scattering phase function [33].

Derived IOPs are called the set of IOPs and expressed in a vector notation asiop [16]:

iop = [a_{chl a} (440), a_{dg} (440), b_{b, spm} (550), y, s]

3. Inversion of ocean color data using the cross-entropy method

3.1. Entropy and cross-entropy

Entropy is a numerical measure of information associated with probability distribution of derived IOPs or any hydrological parameter [34].For a population with N sets of IOPs it is expressed as the Shannon entropy [35]:

𝓗 (g) = - 𝔼 \ln g (iop) \approx - \sum_{1}^{N} g (iop) \cdot \ln g (iop)

where iop is the set of derived IOPs [Eq. (7)]; 𝔼 is the expectation;g(iop) is the probability distribution function (pdf) of the IOPs. The base of the logarithm is taken as e in which case the entropy is measured in “nats”. Shannon’s entropy calculated by Eq. (8) is defined to be the average amount of information contained in the IOPs. It should be noted that the entropy of IOPs does not depend on the actual values of IOPs, but only on its distributiong(iop).

The joint-entropy between two pdfs g and f is:

𝓗 (g, f) = - \sum_{1}^{N} g (iop) \cdot \ln f (iop)

The cross-entropy is the Kullback-Leibler distance [36] which measures the divergence between the two distributiong and f as:

𝓓 (g, f) = - \sum_{1}^{N} g (iop) \ln \frac{g (iop)}{f (iop)}

Based on Eq. (10), the lower the expected cross-entropy, the closer is the distribution f to g.Therefore, minimizing the cross-entropy leads to the maximal similarity between the two distributions and vice versa. Eq. (10) canbe rewritten in terms of Eqs. (8) and (9) as:

𝓓 (g, f) = 𝓗 (g, f) - 𝓗 (g)

where, 𝓗(g) and 𝓗(g,f) are the entropy of g and the joint-entropy betweeng and f, respectively.

3.2. Inversion

The unknown IOPs [Eq. (7)] can be derived by matching modeled reflectance [Eq. (1)] to observed water remote sensing reflectance. The sought solution iop is the set of IOPs that produces the best-fit spectrum to the observed spectrum,Rs _w(λ). The best-fit spectrum can be searched by designing a performance function that measures the agreement between the observed and modeled reflectance. This function ϕ(iop)is arranged so that small values represent a close agreement, i.e. least-square:

ϕ (iop) = \sum_{i = 1}^{m} {[R s_{w} (i) - R s_{w_{m}} (i)]}^{2}

where, m is the number of spectral bands;Rs _w(i) andRs _{w_m}(i) are the observed and modeled water remote sensing reflectance at the ith wavelength, respectively. The objective now is to search for a set of IOPs, iop, that minimizes Eq. (12) to a very small valueε _min, such:

ε_{min} = min ϕ (iop)

The basic idea of cross-entropy method [27,37] is to generate a family of probability distribution functions pdfs for the IOPs, f, and then converge them to an optimal pdfg. The optimal distribution g has all of its mass concentrated around the sought solution of the inversion problems iop, i.e. the variance of the optimal pdf is zero.

We start by converting the deterministic problem in Eq. (12) to a random one. The randomization can be performed by computing the probability of ϕ(iop) as being less than a certain value ε such:

ℓ = ℙ (ϕ (iop) \leq ε)

Equation (14) can be associated with anestimation problem of the form [38]:

ℓ = 𝔼_{f} I_{{ϕ (iop) \leq ε}} = \sum_{i = 1}^{N} I_{{ϕ ({iop}_{i}) \leq ε}} f ({iop}_{i}, {iop}^{*})

where 𝔼_f is the expectation with respect to the pdff;I _{{ϕ(iop_i)≤ε}} is the indicator function, i.e. indicating thatϕ(iop _i) has values ≤ε;f(iop _i,iop ^*) are the discrete probability densities of IOPs;iop _i is a randomly generated set of IOPs fromf(iop _i,iop ^*) using initial mean iop ^*. Equation (15) is generally called the associated stochastic problem (ASP).

In importance sampling method [39], the ASP (15)can be rewritten using the optimal density g as:

ℓ = 𝔼_{g} I_{{ϕ (iop) \leq ε}} \frac{f (iop, {iop}^{*})}{g (iop)}

The change of measure with density in Eq. (16) is [26]:

g^{*} (iop) = \frac{I_{{ϕ (iop) \leq ε}} f (iop, {iop}^{*})}{ℓ}

The idea now is to choose the reference parameter iop ^* such that the distance between f and g ^* is minimal. A suitable measure is the cross-entropy in Eqs. (10) and (11). The optimal pdf can be obtained by minimizing 𝓓(g ^*,f) in Eq. (11) which is equivalent to maximizing -𝓗(g ^*, f) inEq. (9). Substituting Eq. (17) in (9) and maximizing it for the reference parametersiop ^* we will have:

iop = \arg max_{{iop}^{*}} = 𝔼_{f} I_{{ϕ (iop) \leq ε}} \ln f (iop, {iop}^{*})

For pdfs belonging to the natural exponent family solution of (18) can analytically be derived as [26,27]:

{iop}^{*} = \frac{\sum_{i = 1}^{N} I_{{ϕ ({iop}_{i}) \leq ε}} {iop}_{i}}{\sum_{i = 1}^{N} I_{{ϕ ({iop}_{i}) \leq ε}}}

Equation (19) can be used as an update formula of iop ^* in our iterative procedure as described in the next Section 3.3. Equation (19) is basically the average of IOP vectorsiop _i=1,…N that produced best-fit spectra to the observed spectrum, thusϕ_i≤ε.

The proofs of Eqs. (15) to (19) are out the scope of this work. The detailed derivations of the cross-entropy method and various applications are given in [26,27]. Their symbols are used, as possible, in this manuscript. De Boer et al. [40] provides an excellent tutorial on the cross-entropy method. Its application to continuous multi-extremal optimization is given in [37] with approachable examples. Many links to references and examples on cross-entropy method can be found athttp://www.cemethod.org.

3.3. Algorithm

Practically, the IOPs are derived using iterative procedure such thatε approaches ε _min and the probability of the solution iop approximates 1, i.e. a degenerated pdf aroundiop with zero variance.

The algorithm is implemented in the following steps:

For the first iteration, t = 0, choose the initial values of the meanμ ₀ = iop ^* ₀ and standard deviation σ ₀. Sections 4.2 and 7.6 give more details on initial values.
Generate IOPs vectors,iop ₁,…,iop _N fromf(iop _i,iop ^*), e.g. normal distribution N(μ _t,σ _t). Accept or reject each generated IOP depending whether its value is within a predefined physical bounds, i.e. constraints.
Forward the generated IOPs vectors to spectra using Eq. (1). Keep the reference between each IOPs set and its remote sensing reflectance.
Compute the performance functionϕ(iop _i) in Eq. (12).
Set ε equal to the sample quantile (1 - ρ),where ρ is predefined value, e.g. ~10^-2 and evaluate the indicator functionI _{ϕ(iop_i)≤ε}. This is simply achieved by ordering the values ofϕ(iop _i) and selecting the elite samples that belong to the sample quantile (1 - ρ).
Corresponds these elite samples to their “elite” IOPs.
Derive an updated value of iop ^* from Eq. (19) and compute the new value ofσ _t+1 from the elite sample of IOPs.
Forward the updated IOPs set iop ^* using Eq. (1) and evaluate the least square in Eq. (12).
If ε = ε _min andσ _t ≤ γ, whereγ is a predefined small value ~ 10^-5, terminate, otherwise set t = t+1 and iterate from step 2 to9. The criterion σ _t ≤γ is equivalent to ℓ ~ 1 in Eq. (16). This stopping criterium is adjusted for noisy data to cope with fluctuated values of ε as follow. Keep track of the last best ten candidates which have smallest values of ε.Iterate while the variance of these ten candidates is larger than, say 10^-5.

4. Materials and analysis

4.1. Data sets

Evaluation and validation of the proposed inversion method is carried out using four data sets: simulated, noisy-simulated, in-situ measured and ocean color match-up data sets. Simulated data consists of radiative transfer simulations, at 30°sun zenith, of synthesized IOPs [28, IOCCG dataset]. IOCCG spectra were simulated assuming the solar irradiance model of Gregg and Carder[41] and a cloud free sky. A wind speed of 5 m/s is applied, and the water body is assumed homogeneous. Spectral bands were set from 400 nm to720 nm, with a spacing of 10 nm. Inelastic scattering, such as Raman scattering, chlorophyll fluorescence, etc. were excluded from the simulations. Noisy-simulated data is basically the simulated IOCCG spectra with random noises added to them.In-situ measured data of water radiance and IOPs are taken from the NOMAD data set, version 2.a [42, NOMAD data set]. This version of the NOMAD data set was developed in support of the Ocean Optics XIX, IOP Algorithm Workshop (2008). Ocean color match-up data consist of observations from the Sea Wide Field-of-view Sensor (SeaWiFS) that were concurrent with the NOMAD data set, version1.3 [42, SeaWiFS match-up data set]. More details on NOMAD data sets can be found on SeaWiFS Bio-optical Archive and Storage System (SeaBAAS):http://seabass.gsfc.nasa.gov/seabasscgi/nomad.cgi.

4.2. Analysis

The generated IOPs are constrained to their physical bounds as described in 3.3. The constraints are set to 10^-4 and 100 m^-1 for{a _chla(440),a _dg(440),b _spm(550)}, and between 10^-4 and 2.5 for y and between 10^-4 and 0.03 nm^-1 for s. The initial values of μ ₀ = iop are taken from Lee etal. [14] with s = 0.011nm^-1, and briefly described in appendix (A). In case of limited number of bands, mostly the red bands in NOMAD and SeaWiFS match-up, we used a fixed initial value forb_b, as 0.025 m^-1 instead of Eq. (A.3). Initial values ofσ ₀ are set as ϛμ ₀,where ϛ is a factor varying between 2 and 10 at step 2 interval.This choice was to make the algorithm self contained and to limit the freedom ofinitialization to μ ₀. Optimal values ofσ ₀ are then searched using simulated annealing method[43]. Energy function is set to Eq. (15), with the updating in Eq. (19), and “temperature” parameters is set to σ ₀. The procedure iterates through a range of values and selects σ ₀ that has the minimalε. Using simulated annealing to initialize σresults in an algorithm with two loops, an outer loop iterating throughσ ₀ and an inner loop, which is basically the algorithm in(3.3). A normal distribution, N(μ, σ), is used to generate the pdfs of IOPs. This, however, does not imply that the actual variability of IOPs follows the normal distribution, because the variance of the final pdf is zero. The number of samples in the generated pdfs is set to 100 and maximum number of iteration is set equal to 100.

Goodness-of-fit parameters (slope, intercept, bias and R ²)between derived and known values are computed using model-II regression [44] for log-transformed data. The slope and intercept are for a model-II regression line between derived and known values. Perfect fit leads to unity slope and zero intercept and bias. The intercept is computed for the derived values on the Y axis. The values of R ² is computed as the squared correlation coefficient between derived and known values. The bias is estimated for the log transformed data as:

bias = 𝔼 (\log {iop}_{known} - \log {iop}_{derived})

Computing the bias as shown in Eq. (20)means that negative bias indicates overestimation while positive bias indicates underestimation. The root mean square of error RMSE is calculated using Eq. (2.1) from[28] as:

RMSE = {[{(n - 2)}^{- 1} \sum_{i}^{n} {(\log {iop}_{known} - \log {iop}_{derived})}^{2}]}^{0.5}

where n is the number of data points. The fraction of valid retrievalf_r is also computed during the statistical analysis. Derived IOP is considered un-valid if its value falls outside the defined constraints or trapped tozero solution. In our analysis we will use goodness-of-fit parameters obtained from inversion of noise-free IOCCG data set as a benchmark for comparison. In other words, we will compare goodness-of-fit parameters resulting from the inversion of noisy, NOMAD and SeaWiFS match-up data sets to those computed from the optimal situation of noise-free IOCCG data set.

5. Performance

5.1. IOCCG data set

Figure 1 shows derived versus known values of IOPs from the IOCCG simulated data set. Goodness-of-fit parameters are detailed in Table 1. Derived values of Chlaabsorption fits the measured values with 5% off-unity slope and positive intercept~0.32. The RMSE value is below 0.35 and, in general, retrieveda _chla(440) values are overestimated with negative bias -0.16 but with strong correlation to known values,R ²= 0.963. The slope of deriveda _dg(440) deviate from unity by up to 13% with intercept~0.16 and RMSE ~0.45. The a _dg(440) values are underestimated, positive bias of 0.15, with high R ²~0.97. The over/under estimation of respectivelya _chla(440) anda _dg(440) seem to be compensated in the computed to talabsorption. The slope of derived a(440) is 9% off-unity with positive intercept of 0.144. The RMSE ~0.19 and bias ~0.003 values are lower than that of individual absorption coefficients, i.e.a _chla(440) ora _dg(440). The opposite could be observed for the increasedR ² value up to 0.99. The best results, in term of the overall goodness-of-fit, are obtained for the backscattering coefficient with slope deviating from unity by 2% and negative intercept ~-0.16. RMSE value ofb _b,spm(550) is the lowest among derived IOPs. The positive bias also shows that, in general, the derived values ofb _b,spm(550) are underestimated with high correlation, R ² ~0.99, to known values however.

Table 1. IOCCG: RMSE and regression (type II) goodness-of-fit parameters between derived and known values of IOCCG data set. n is the number of data points.

View Table | View all tables in this article

5.2. Satiability to noise

The stability of the proposed inversion to sensor noise is analyzed by adding realistic values of noise-equivalent-radiance to simulated spectra of IOCCG data set. Recently reported noise-equivalent-radiance values of the Medium Resolution Imaging Spectrometer(MERIS) sensor [45] were used to simulate the noise. MERIS noise-equivalent-radiance were converted to remote sensing reflectance using the table of Neckel and Labs [46], hereafter called MERIS-NER. The noise were generated using the normal distribution with mean equal to MERIS-NER and standard deviation equal to that of IOCCG simulated spectra. Additional condition was imposed such that the generated values are within ±70% of the original signal. It is believed that a maximum of ±70% off the observed reflectance value isa realistic threshold for an acceptable noise level. This mechanism of adding noise will leverage an average noise level, i.e. noise to signal ratio, about ± 32% of the original reflectance. The average is calculated over all spectral bands and over all spectra. Table 2 shows the introduced noise level per wavelength averaged over the IOCCG spectra as obtained from the noise level of MERIS-NER and standard deviation of IOCCG reflectance.

Fig. 1. IOCCG: Derived versus known values of IOPs of the IOCCG data set. (a): absorption of Chla at 440 nm; (b): absorption ofdg at 440 nm; (c): the backscattering of SPM at 550 nm; and (d): the total absorption at 440 nm.
Download Full Size | PDF

Table 3 shows goodness-of fit parameters between derived and known IOPs as computed from the model-II regression. Derived values ofa _chla(440) are the most affected among other IOPs. The slope of the regression line is now off by 50% from the 1:1 line with large intercept ~1.7. The RMSE increased by five folds, bias by four folds and theR ² values is reduced to 0.712, in comparison to noise-free data(Table 1). Goodness-of-fit of deriveda _dg is also degraded due to noise, to a lesser extent thana _chla(440). The slope is ~40% off-unity and RMSE and bias both increased by three folds, whereas R ² is reduced to 0.834. Derived values of total absorption coefficient seem to be stable to noise. The slope, intercept and RMSE roughly increased by two-to-three folds in comparison to their counterparts in Table 1. The bias, is an order of magnitude larger than that in noise-free case with negative value however, indicating overestimation. R ² is slightly reduced to 0.96.Derived b _b,spm(550) values have 28% off-unity slope and 1.2 intercept. There is almost five folds increase in the RMSE value and two folds increase in the bias value. The R ² is reduced to 0.85 in comparison to 0.99 of the noise-free case. In general, IOPs values are underestimated at the lower-end, small values, and overestimated at upper end, i.e. large values.

Table 2. Noise level: Introduced noise level, per wavelength, averaged over the spectra of IOCCG data set. MERIS-NER and IOCCG standard deviation (std)were used to generate the noise from a normal distribution. MERIS-NER is shown as a percentage of the average original spectra. Only few wavelengths are shown.

View Table | View all tables in this article

Table 3. Noisy IOCCG: RMSE and regression (type II)goodness-of-fit parameters between known values and derived IOPs from noisy IOCCG spectra.n is the number of data points.

View Table | View all tables in this article

6. Validation

6.1. NOMAD data set

NOMAD data set consists of matches between measurements of remote sensing reflectance and IOPs: 1279 matches for a _chla(443), 1126matches for a_dg(443) and 369 matches for the back-scattering coefficient b_b,spm(405). The combined effect of detritus and gelbstoff, in Eq. (5), was assumed to be comparable to the sum of measured values of detritus and gelbstoff absorptions. Figure 2 shows derived versus measured values of IOPs andTable 4 details goodness-of-fit parameters.

The method is adequate to derive the absorption coefficients of Chla,dg and the total absorption. The R ² values are above 0.8 with slope and intercept that are of comparable magnitudes to the values in Table 1. The slope of deriveda _chla(440) is 4.5% off-unity with intercept of about -0.31. The RMSE value is merely twice, 0.64, as that of simulated IOCCG data while the bias is positive and about 0.18. Goodness-of-fit parameters of deriveda _dg(440) are slightly degraded when compared to the values inTable 1. The slope is off the 1:1 line by 20%with negative intercept ~-0.26. The main observation is that the derived values ofa _dg(440) are underestimated with positive bias of ~0.8.The fit of derived total absorption coefficient follows the same trend as that presented inTable 1, i.e. it has better fit to measured values than a _chla(440) ora _dg(440) alone with R ²~0.91. The slope is less than 4% off-unity with negative intercept. RMSE and bias values are, respectively, about 3 and 35 folds larger than those in Table 1.

Fig. 2. NOMAD: Derived versus measured values of IOPs of the NOMAD data set. (a): absorption of Chla at 440 nm; (b): absorption ofdg at 440 nm; (c): the backscattering of SPM at 550 nm; and (d): the total absorption at 440 nm.
Download Full Size | PDF

Derived values of backscattering coefficients are less accurate in comparison to other IOPs and Table 1. The values ofb _b,spm(550) are underestimated at the lower end and over estimated at the higher end. There are also 5% of non valid retrievals. The solution was, basically, trapped to zero in these data points. The regression line has aslope which is 34% off-unity and a large intercept up to 2. RMSE and bias values are two folds, 0.363 and -0.108, larger that their counterpart in Table 1. The negative bias ofb _b,spm(550) indicates that there is slight overestimation. R ² values is now 0.73 as compared to 0.99 inTable 1. Possible reasons for deriving less accurate IOPs from the validation data sets are discussed later in Section 7.

Table 4. NOMAD: RMSE and regression (type II) goodness-of-fit parameters between measured and derived IOPs IOPs values using the NOMAD data set.n is the number of data points.

View Table | View all tables in this article

6.2. SeaWiFS match-up data set

Match-up data set contains spectra of SeaWiFS and NOMAD-measured IOP(s) that have similar geographical location and sampling time. SeaWiFS match-up data set consisted of 132matches for the absorption coefficients and 29 matches for the backscattering coefficient.Figure 3 shows derived IOPs from SeaWiFS spectra versus measured values from NOMAD data set. Goodness-of-fit parameters are detailed inTable 5.

Derived values of absorption coefficient are relatively accurate withR ² being above 0.7. Especially chlorophyll-a absorption withR ² value above 0.85, 5% off-unity slope and small intercept value ~-0.2. The RMSE value is comparable to that in Table 1 and did not exceed 0.5. The magnitude of total bias, 0.064, is even smaller than its counterpart in Table 1, in absolute values. The positive bias of 0.064 indicates that thea _chla(440) are slightly underestimated. Goodness-of-fit parameters of derived a _dg(440) are comparable to those obtained from the optimal case (Table1). The slope is still ~15% off-unity with intercept of about -1. The RMSE value has increased to 1 and R ² is reduced to 0.7. Total absorption is derived with 8% off-unity slope and -0.4 intercept. The RMSE and bias values slightly increased, compared to Table 1, to 0.45and 0.25. There is also a strong correlation, R ² ~0.91between derived and measured values. Derived values of backscattering coefficient, similar to the NOMAD case, are less reliable. Model-II regression line has a slope that is 17%off-unity and large intercept of 0.93. The RMSE values slightly increased to 0.25 and theR ² value dropped to 0.49.

Table 5. SeaWiFS match-up: RMSE and regression (type II)goodness-of-fit parameters between measured and derived IOPs values using SeaWiFS match-up data set. n is the number of data points.

View Table | View all tables in this article

7. Discussions

7.1. Performance with simulated data

The developed inversion algorithm showed a good performance with simulated IOCCG data. The slope of model-II regression line was close to unity for all IOPs with intercept that did not exceed 0.35. The RMSE values were also acceptable and less than 0.5 whereas theR ² value were above 0.96 for all IOPs and reaching 0.99 for the total absorption and backscattering coefficients. The RMSE value, ~ 0.32, of Chla absorption coefficient is one eighth of RMSE ~ 2.26 value reported in Salama et al. [16].They used the same ocean color model (section 2) and IOCCG data set but employing a nonlinear minimization technique, i.e. constrained levenberg-marquardt.

Fig. 3. SeaWiFS match-up: Derived SeaWiFS IOPs versus measured IOPs of the NOMAD data set. (a): absorption of Chla at 440 nm;(b): absorption of dg at 440 nm; (c): the backscattering of SPM at 550 nm; and (d): the total absorption at 440 nm.
Download Full Size | PDF

The underestimation of a _dg(440) and overestimation ofa _chla(440) compensated each other when computing the total absorption coefficient resulting in the smallest bias 0.003 which is almost 20 folds less than that of the backscattering coefficient.

The results of Fig. 1 support the statement made in Section 4.2: assuming a normal distribution to carry out the inversion does not imply that the actual variability of IOPs follows the normal distribution, because the variance of the solution pdf is zero. The discreet distribution of known values in Fig. 1(a) is uniform and close to uniform for known values in Fig. 1(b) and Fig. 1(c).

7.2. Stability

Measurement errors are related to intrinsic noise of the sensor and to residuals from subsequent corrections. Each sensor, ship or space borne, has a noise level that is related to its specification and sensitivity losses over time. In case of noisy data the cost function [Eq. (12)] will have a noise component. Generated IOPs and the updating in Eq. (19) are, however, independent of sensor noise. Therefore it is expected that the inversion method will filter out observation’s noise as it was shown in Fig. 4. There is no mathematical prove for this noise-filtering effects but we could numerically demonstrate it in Table 3. The ability of cross-entropy method to filter noise during optimization was also demonstrated by Rubinstein and Kroese [27, chapter 6, pp.203–226] using numerical examples. This is one of the major advantages of the proposed method. It can basically filter the noise up to a maximum of ±70% of the observed signal, equivalent to 32% averaged over spectra and wavelengths (Table 2). Figure 4 is shown to give a visual perception of the introduced noise level and the stability of inversion from noisy data. In Fig. 4 we plot the best-fit spectra to the noisy-data from which we derived the IOPs, noisy data and original noise-free spectra. We selected 6 spectra, indexed in the data itself as 1, 100, 200, 300, 400, 500. Figure 4 clearly shows how the method was able to filter considerable range of the noise. The derived IOPs are also acceptable for the imposed noise level withR ² > 0.7 and RMSE < 0.6 (Table 3). Chlorophyll-a absorption coefficient is the most affected by the introduced noise. This is because noise can randomly introduces dips in reflectance at the absorptions bands of chlorophyll-a around 440 nm and 665 nm which might be interpreted as high chlorophyll-a absorption. Derived values of absorption coefficient seem to be stable to sensor noise as the sum a _dg(440) anda _chla(440) may compensates the over/underestimations in both parameters. The small value of bias in derivedb _b,spm can be explained by the random nature of noises. Equal under/over estimation might be introduced by noise, thus eliminating each others when computing the bias.

7.3. Validation

The develop inversion algorithm was validate using ocean color data ofin-situ measurements and satellite match-up. The derived values ofa _chla(440) and a(440) are in general reliable. The slope values of model-II regression were 3–6 % off-unity with small values of intercept and bias and RMSE values below 0.6. TheR ² exceeded 0.8 and 0.9 for Chla and total absorption coefficients respectively. Derived values ofa _dg(440) were less accurate in both data sets with slope values about 15–20% off-unity and large values of intercept, bias and RMSE. An overestimation of a _chla(440) is associated with underestimation of a _dg(440) and vice versa. This type of degeneracy, is due to the overlapped absorption peaks at 440 nm of Chla,detritus and glebstoff. This trend was clearly illustrated by increased accuracy of total absorption coefficient over the accuracy of individual components, i.e.a _chla(440) anda _dg(440). Although derived values of backscattering coefficient were accurate and somehow stable to noise (Tables1 and 3), less reliable results of SPM backscattering were obtained from NOMAD and Sea-WiFS match-up data. Model-II regression line deviated by up to 35% from unity with large intercept values. Moreover, the solution was trapped to zero in 5% of backscattering data. The R ² values were reduced to 0.7 for the NOMAD and to 0.49 for SeaWiFS match-up. The sample size of SPM backscattering in the SeaWiFS match-up was, however, small 29. The small sample number increased the vulnerability of statistics in Table5 to be influenced by outliers, i.e. the two upper-left points in Fig. 3(c). For instance, removing these two points will improved the goodness-of-fit of derivedb _b,spm(550) in SeaWiFS match-up data set to:[slope = 1.286, intercept = 1.532, RMSE = 0.143, bias = 0.024 andR ² = 0.855]. The fraction of valid retrieval in this case will be fr = 27/29 = 0.931. There are other sources of uncertainty than the overlapped blue-absorption feature ofa _chla(440) and a(440) and the small sample of backscattering in SeaWiFS match-up data, these are discussed in (7.4).

Fig. 4. Examples of inversion from noisy-IOCCG data set. Six spectra of the noisy-IOCCG spectra are shown: (a) 1st; (b) 100th; and to (f) the last spectrum number 500. The black lines are spectra derived from the noisy spectra (dark gray lines) using the proposed stochastic inversion. Original noise-free spectra are shown in light gray lines. Maximum noise level is kept to ±70% of the original values.
Download Full Size | PDF

7.4. Uncertainty

Ocean color data inversion is associated with many sources of uncertainty that may affect the accuracy of derived IOPs. These sources can be summarized in four major components://

i- Spectral characteristics: NOMAD and SeaWiFS match-up spectra have on average 10 and 13 spectral bands, respectively. Their bands cover the spectral range from411 nm to 683 nm. In IOCCG data set we used 33 bands covering the spectral range from 400nm to 720 nm at 10 nm interval. The spectral characteristics of observed remote sensing reflectance has direct effect on the accuracy of derived IOPs. Large number of spectral bands will increase inversion’s degree-of-freedom, i.e. number of band minus number of unknowns. In turn, there will be a higher probability in obtaining a better spectral fit to the observed spectrum, hence derived IOPs are less ambiguous. The extension of the spectral bands to 720 nm may improve the accuracy of derived Chla absorption and SPM backscattering coefficient. The red absorption peak of Chla round 665 nm is unique and facilitate a good separation ofa _chla(440) anda _dg(440) from the total absorption. Further in the red part of the spectrum, >680 nm, water remote sensing reflectance is almost a direct function of SPM backscattering coefficient [47]. This direct functionality of b _b,spm(550) andRs _w(> 680) may stabilize the inversion at this spectral range leading to more accurate values ofb _b,spm(550).

ii- Measurement error: We showed in Section 5.2 that although derived IOPs were stable to sensor noise, their accuracies were slightly degraded. In addition to sensor noise, measurement errors could be related to residuals from subsequent corrections. For example, ocean color satellite spectrum contains additional residuals from atmospheric correction (AC) and post-AC adjustments [48] that affect the retrieval of IOPs. The accumulation of sensor noise and subsequent correction error will increase the uncertainty of derived IOPs [22].

iii- Model approximation: Employed approximations in the forward-model(Section 2) may not precisely describe the optical processes that have caused the observed spectrum. The model and its parametrizations did not take into consideration the bidirectional effects of remote sensing reflectance. Assuming an isotropic angular distribution of the up-welling radiation may impose an additional error component that will propagate to the derived IOPs [49,50]. Moreover, each of the used parametrization has its own limitation. Equation (4) ignores the different phytoplankton species and the wide variability of Chlaabsorption as measured in nature [51–53]. Equation(5) combines the absorption effects of detritus and gelbstoff in one spectral shape and magnitude. In Eq. (6) the backscattering ratio of SPM is set equal to the Petzolds integrated volume scattering data ~0.0182 which may not represent the actual values of sea particles [54]. Moreover, the power law of the backscattering spectral shape as modeled in Eq. (6) is inaccurate in the presence of absorption from non algae particles [55].

iv- Uniqueness: There are many sets of IOPs that may have caused the observed spectrum [56]. The proposed method derived most of these sets as elite samples and update the next iteration. The final solution can, there for, be regarded as the optimal average of all probable sets of IOPs. This averaging on the one hand reduces the probability of having a spiked solution, i.e. large error, and on the other hand derives a smoothed solution between all possible sets of IOPs. Derived IOPs have, thus, an intrinsic error component that is associated with the used inversion method.

Spectral characteristics and measurement error are the major reasons of deriving more accurate IOPs values from IOCCG than from NOMAD and SeaWiFS match-up data set.

7.5. Convergence and processing time

The convergence of the method is guaranteed [27,proposition 3.19 page 83]. Table 6 gives insight about the average number of iterations that were needed for convergence. From Table 6 we can approximate the maximum number of iteration at 95% of confidence to be 31 iterations, i.e. 15+2×8. In Table 6 we, however, did not consider the preprocessing iterations to select the optimal initial value of σ ₀, as described in Sections 4.2. The values of Table 6can roughly be multiplied by factor of five to consider the total iterations that were needed to select the optimal initial values σ ₀ and derive the IOPs.

The total processing times averaged over spectra per data set are shown in Table 7. Total processing time was calculated as the overall time needed to generate the initial pdf, using simulated annealing, and derive the IOPs using cross-entropy. Processing time per spectrum seems to be a function of bands number and the degree of noise. On average, noisy-IOCCG spectrum has the largest processing time followed by noise-free IOCCG, NOMAD and SeaWiFS spectra as averaged over the corresponding data set. There might be a trade-off between processing time and number of bands. NOMAD data set has an average of 10 bands per spectrum and longer processing time in comparison to SeaWiFS match-up data set which has an average of 13 bands per spectrum and the shortest processing time. These observations (Tables6 and 7) are not conclusive and are meant to give the reader an idea about number of iterations and processing time as related to used data sets.

Table 6. Number of iteration: Statistical parameters for number of iterations per spectrum averaged over each data set. Average and standard deviation values were rounded up to the closest integer.

View Table | View all tables in this article

Table 7. Total processing time: Statistical parameters for total processing time (in seconds) per spectrum averaged over each data set.

View Table | View all tables in this article

7.6. Limitation

The major limitation of the presented inversion algorithm is its sensitivity to the starting pdf. Initializing a pdf from a normal distribution, N(μ,σ), requires the two parameters μ ₀ =iop and σ ₀. The initial values ofσ ₀ were set as ϛμ0 to reduce the freedom of initialization. We used simulated annealing [43] to define the optimal value of σ ₀.Defining the value of σ ₀ using simulated annealing will limit the initialization to μ ₀ on the cost of speed and increased dependency on μ ₀. Values ofμ ₀ were initialized using the expressions of Lee etal. [14] as briefed in appendix (A).

7.7. Algorithm extension

The proposed inversion algorithm can easily be extended to include the variability of Chla absorption that may correspond to different phytoplankton species. There are two approaches to derive the natural variability of Chlaabsorption: i- include one unknown for each Chlaabsorption curve; ii- find the Chla absorption curve that results in the best-fit. The first approach, i, is based on the assumption that Chla absorption varies within one observed spectrum or satellite pixel. This approach seems to be more suitable for coarse satellite pixel and /or productive coastal and estuarine waters, where different phytoplankton species may co-exist and/or the variability of Chla absorption is large. Whereas the second approach,ii, assumes that Chla absorption is constant for a spectrum, but vary between different spectra, or satellite pixels. This approach,ii, is more suitable for open ocean waters and do not require large increase of the number of unknowns as in the first approach i. We, therefore applied approach ii to derive the variability of Chla absorption from NOMAD and SeaWiFS match-up data sets. Hereafter, we give an explanation on how to derive the variability of Chla absorption from ocean color data using a modified version of our inversion algorithm (3.3).

We simply added another unknown to Eq. (7). This unknown, denoted as ι, is the index of a Chla absorption curve in the used data set. We used reported values of normalized Chla absorption curves [51–53]. Equation (4)is adapted to becomea _chla(λ) =a _chla(440)×ā _chla(λ), whereā _chla(λ) is the normalized absorption coefficient, i.e. Chla absorption normalized to its value at 440 nm. Values of ι are drawn from a uniform distribution such that each normalized absorption coefficient gets an equal chance to be selected during the inversion. The elite samples in step 6, Section 3.3, is adjusted forι such that only the best elite sample is selected. We applied the modified algorithm on NOMAD and SeaWiFS match-up data sets, only derived values ofa _chla(440) anda _dg(440) will be discussed.

Deriving the natural variability of Chla absorption coefficient has slightly improved the accuracy of a _chla(440)and a _dg(440) when derived from NOMAD and SeaWiFS match-up spectra. The RMSE between known and derived values of normalized Chlaabsorption coefficients is computed using similar form to Eq. (21). Figure5 shows the RMSE values ofā _chla(λ)variability as derived from NOMAD and SeaWiFS match-up data sets. Since we normalized bya _chla(440), small values of RMSE are expected in the vicinity of the blue absorption feature of Chla. RMSE values, however, increase from 440 nm to longer wavelength reaching to a maximum value around 560nm and gradually decrease to a local minimum around 675 nm, the red absorption feature of Chla. RMSE values in Figs. 5(a)and 5(b) approximately mirror a standard Chla absorption curve with RMSE values of NOMAD being larger than those of SeaWiFS, especially at the red part of the spectrum. This is due to the reduced number and extent of NOMAD’s spectral bands. The missing bands were mainly in the red spectral region which increased the uncertainty of derived Chla absorption at the red bands. The advantage of this modification is that we were able to derive an indication of Chla variability, rather than using the fixed regression coefficientsa ₀ and a ₁ in Eq. (4) or one curve of normalized Chla absorption coefficient.

Potential implication of including phase function variation was not considered in the current work. Future work should, however, include changes in phase function, especially spectral ones.

8. Conclusions

In this paper we developed a stochastic inversion algorithm based on the cross-entropy method to derive inherent optical properties from ocean color data. The proposed inversion method was validated using four data sets: IOCCG and its noisy version, NOMAD and SeaWiFS match-up data sets. The followings are concluded:

Fig. 5. RMSE between derived and known values of normalized Chla absorptioncoefficient,ā _chla(λ) of(a): NOMAD data set and (b): SeaWiFS match-up data set.
Download Full Size | PDF

The method is self contained and easy to implement with far fewer inputs, e.g. information about curvature is not needed. The only needed input is the observed reflectance and some initial values, in case the initializations of Lee etal. [14] can not be applied.
In all validation exercises, the derived IOPs have acceptable accuracy,R ² > 0.7 (after removing the outliers) and RMSE did not exceed 1.1.
The derived IOPs are stable to sensor noise. The inversion can filter noise up to a maximum of ±70% of observed remote sensing reflectance.
The method can easily be modified to derive the variability of chlorophyll-a absorption coefficient that may correspond to different phytoplankton species.
The developed inversion was validated against variety of data sets, simulated, measured and satellite match-up. With careful initialization, the proposed inversion could be used for global derivation of IOPs from ocean color data.

A. Initial values

The initial values of μ ₀ = iop were adopted from the work of Lee et al. [14]and used to initialize the pdfs of IOPs from N(μ ₀,σ ₀):

a_{chl a} (440) = 0.072 r_{1}^{- 1.62}

a_{dg} (440) = a_{chl a} (440)

b_{b, spm} (550) = 30 a_{w} (640) R s_{w} (640)

y = 3.44 [1 - 3.17 \exp (- 2.01 r_{2})]

where, r ₁ and r ₂ are the ocean color ratios: r ₁ =Rs _w(440)/Rs _w(550) andr ₂ =Rs _w(440)/Rs _w(490), respectively. The values of σ ₀ were related toμ ₀ by a scaling factor ϛ and computed using simulated annealing.

Acknowledgement

The authors would like to thank NASA Ocean Biology Processing Group and individual data contributors for maintaining and updating the SeaBASS database; the European Space Agency (ESA) for supporting this research through the DRAGON-II project, Nr: 5351. This research was financed by the SKLEC grant, Nr. 2008KYYW04 from the State Key Laboratory of Estu-arine and Coastal Research (SKLEC), East China Normal University (ECNU). The authors are indebted to Yunxuan Zhou from SKLEC for providing scientific and technical supports. Two anonymous reviewers are acknowledged for improving the quality of the manuscript.

References and links

1. J. Zaneveld, “New developments of the theory of radiative transfer in the ocean,” in “Optical Aspects of Oceanography,” N. Jerlov, ed. (Academic Press,, London,1973), pp. 121–134.

2. S. Duntley, “Light in the sea,” J.Opt. Soc. Am. 53, 214–233(1963). [CrossRef]

3. H. Gordon, O. Brown, and M. Jacobs, “Computed relationship between the inherent and apparent optical properties of a flat homogeneous ocean,”Appl. Opt. 14,417–427 (1975). [CrossRef] [PubMed]

4. A. Morel and L. Prieur, “Analysis of variation in ocean color,” Limnology and Oceanography 22, 709–722(1977). [CrossRef]

5. R. Walker, Marine Light Field Statistics, Wiley serie on pure and Appl. Opt. (John Wiley & Sons, INC., NW,1994).

6. J. Kirk, “The relationship between the inherent and apparent optical properties of surface waters and its dependence on the shape of the volume scattering function,” (Oxford University Press, 1994), p. 283.

7. H. Gordon, O. Brown, R. Evans, J. Brown, R. Smith, K. Baker, and D. Clark, “A semianalytical radiance model of ocean color,” J. Geophys. Res. 93,10909–10924 (1988). [CrossRef]

8. S. Garver and D. Siegel, “Inherent optical property inversion of ocean color spectra and its biogeochemical interpretation I. Time series from the sargasso sea,” J. Geophys. Res. 102,18607–18625 (1997). [CrossRef]

9. F. E. Hoge and P. E. Lyon, “Satellite retrieval of inherent optical properties by linear matrix inversion of ocean radiance models: An analysis of model and radiance measurement errors,” J. Geophys. Res. 101, 16631–16648(1996). [CrossRef]

10. Z. Lee, K. Carder, C. Mobley, R. Steward, and J. Patch, “Hyperspectral remote sensing for shallow waters. 1.A semianalytical model,” Appl. Opt. 37, 6329–6338(1998). [CrossRef]

11. S. Maritorena, D. Siegel, and A. Peterson, “Optimization of a semianalytical ocean color model for global-scale applications,” Appl. Opt. 41, 2705–2714(2002). [CrossRef] [PubMed]

12. Z. Lee, K. Carder, and R. Arnone, “Deriving inherent optical properties from watercolor: A multiband quasian-alytical algorithm for optically deepwaters,” Appl. Opt. 41,5755–5772 (2002). [CrossRef] [PubMed]

13. H. Gordon, “Inverse methods in hydrologic optics,” Oceanologia 44,9–58 (2002).

14. Z. Lee, K. Carder, C. Mobley, R. Steward, and J. Patch, “Hyperspectral remote sensing for shallow waters: 2.Deriving bottom depths and water properties by optimization,”Appl. Opt. 38,3831–3843 (1999). [CrossRef]

15. A. Albert and P. Gege, “Inversion of irradiance and remote sensing reflectance in shallow water between 400 and 800 nm for calculations of water and bottom properties,” Appl. Opt. 45,2331–2343 (2006). [CrossRef] [PubMed]

16. M. S. Salama, A. Dekker, Z. Su, C. Mannaerts, and W. Verhoef, “Deriving inherent optical properties and associated inversion-uncertainties in the Dutch lakes,” Hydrology and Earth System Sciences 13,1113–1121 (2009). [CrossRef]

17. H. Zhan, Z. Lee, P. Shi, C. Chen, and K. Carder, “Retrieval of water optical properties for optically deepwaters using genetic algorithms,” IEEE Trans. Geosci. Remote Sens. 41, 1123–1128(2003). [CrossRef]

18. M. Chami and D. Robilliard, “Inversion of oceanic constituents in case i and ii waters with genetic programming algorithms,” Appl. Opt. 41, 6260–6275(2002). [CrossRef] [PubMed]

19. P. Kempeneers, S. Sterckx, W. Debruyn, S. De Backer, P. Scheunders, Y. Park, and K. Ruddick, “Retrieval of oceanic constituents from ocean color using simulated annealing,” in “Geoscience and Remote Sensing Symposium,” Vol. 8 of IGARSS (IEEE International, 2005), vol. 8 ofIGARSS, pp. 5651–5654.

20. W. Slade, H. Ressom, M. Musavi, and R. Miller, “Inversion of ocean color observations using particle swarm optimization,” IEEE Trans. Geosci. Remote Sens. 42, 1915–1923(2004). [CrossRef]

21. R. Souto, H. Campos Velho, S. Stephany, and M. Kampel, “Chlorophyll concentration profiles from in situ radiances by ant colony optimization,” in “4th AIP International Conference and the 1st Congress of the IPIA,” Vol. 124of Journal of Physics (2008), pp.1–12.

22. M.S. Salama and A. Stein, “Error decomposition and estimation of inherent optical properties,” Appl. Opt. 48,4947–4962 (2009). [CrossRef] [PubMed]

23. G. Dueck and T. Scheur, “Threshold accepting: A general purpose optimization algorithm appearing superior to simulated annealing,” Journal of Computational Physics 90,161–175 (1990). [CrossRef]

24. W. Gong, Y. Ho, and W. Zhai, “Stochastic comparison algorithm for discrete optimization with estimation,” in “Proceedings of the 31st IEEEConference,” Vol. 1 of Decision and Controle (1992), pp. 795–800.

25. F. Glover, “Tabu search: A tutorial,”Interfaces 20, 74–94(1990). [CrossRef]

26. R. Rubinstein, “The cross-entropy method for combinatorial and continuous optimization,” Methodology and Computing in Applied Probability 2,127–190 (1999). [CrossRef]

27. R. Rubinstein and D. Kroese, The Cross-Entropy Method: A unified approach to combinatorial optimization, Monte-Carlo simulation, and machine learning, Information Science and Statistics (Springer, New York,2004). [PubMed]

28. Z. Lee, “Remote sensing of inherent optical properties: Fundamentals, tests of algorithms, and applications,” Tech. Rep. 5, International Ocean-Colour Coordinating Group(2006).

29. R. Pope and E. Fry, “Absorption spectrum (380–700nm) of pure water: II, Integrating cavity measurements,” Appl. Opt. 36, 8710–8723(1997). [CrossRef]

30. C. Mobley, Light and water radiative transfer in natural waters(Academic Press, 1994).

31. A. Bricaud, A. Morel, and L. Prieur, “Absorption by dissolved organic-matter of the sea(yellow substance) in the UV and visible domains,” Limnology and Oceanography 26, 43–53(1981). [CrossRef]

32. O. Kopelevich, “Small-parameter model of optical properties of seawaters,” in “Ocean Optics,” Vol. 1Physical Ocean Optics , A. Monin, ed. (Nauka, 1983), pp.208–234.

33. T. Petzold, “Volume scattering functions for selected ocean waters,” in “Light in the Sea,” Vol. 12, J. Tyler, ed. (Dowden, Hutchinson and Ross, Stroudsburg, Pa. USA, 1977), pp.150–174.

34. V. Singh, Entropy-based parameter estimation in Hydrology, Vol. 30of Water Science and Technology Library (Kluwer Academic Publishers, Dordrecht, 1998).

35. C. Shannon, “A mathematical theory of communication,” Bell Syst. Tech. J. 27, 379–423,623-656 (1948).

36. S. Kullback and R. Leibler, “On information and sufficiency,” The Annals of Mathematical Statistics 22, 79–86(1951). [CrossRef]

37. D. Kroese, S. Porotsky, and R. Rubinstein, “The cross-entropy method for continuous multi-extremal optimization,” Methodology and Computing in Applied Probability 8, 383–407 (2006). [CrossRef]

38. R. Rubinstein and D. Kroese, Simulation and the Monte Carlo Method, Wiley Series in Probability and Statistics (2008), 2nd ed.

39. R. Srinivasan, Importance sampling: Applications in communications and detection (Springer-Verlag, Berlin, 2002).

40. T. De Boer, D. Kroese, S. Mannor, and R. Rubinstein, “A tutorial on the cross-entropy method,” Annals of Operations Research 134, 19–67 (2005). [CrossRef]

41. W. Gregg and K. Carder, “A simple spectral solar irradiance model for cloudless maritime atmospheres,” Limnology and Oceanography 35, 1657–1675 (1990). [CrossRef]

42. J. Werdell and S. Bailey, “An improved in-situ bio-optical data set for ocean color algorithm development and satellite data product validation,” Remote Sensing of Environment 98, 122–140 (2005). [CrossRef]

43. S. Kirkpatrick, C. Gelatt, and M. Vecchi, “Optimization by simulated annealing,” Science 220, 671–680 (1983). [CrossRef] [PubMed]

44. E. Laws, Mathematical Methods for Oceanographers: An introduction (John Wiley and Sons, New York, 1997).

45. R. Doerffer, “Analysis of the signal/noise and the water leaving radiance finnish lakes,” Tech. Rep., Brockmann Consult (2008).

46. H. Neckel and D. Labs, “Improved data of solar spectral irradiance from 0.33 to 1.25 mm,” Solar Physics 74, 231–249 (1981). [CrossRef]

47. D. Doxaran, M. Babin, and E. Leymarie, “Near-infrared light scattering by particles in coastal waters,” Opt. Express 15, 12834–12849 (2007). [CrossRef] [PubMed]

48. J. Werdell, B. Franz, S. Bailey, L. Harding, and G. Feldman, “Approach for the long-term spatial and temporal evaluation of ocean color satellite data products in a coastal environment,” in “Proceedings of SPIE, the International Society for Optical Engineering,” Vol. 6680 of Coastal ocean remote sensing, (2007), pp. 66800G.1–66800G.12.

49. A. Morel and B. Gentili, “Diffuse reflectance of oceanic waters II. bidirectional aspects,” Appl. Opt. 32, 6864–6879 (1993). [CrossRef] [PubMed]

50. A. Morel and B. Gentili, “Diffuse reflectance of oceanic waters .3. Implication of bidirectionality for the remote-sensing problem,” Appl. Opt. 35, 4850–4862 (1996). [CrossRef] [PubMed]

51. A. Bricaud, M. Babin, A. Morel, and H. Claustre, “Variability in the chlorophyll-specific absorption coefficients of naturnal phytoplankton: Analysis and parameterization,” J. Geophys. Res. 100, 13,321–13,332 (1995). [CrossRef]

52. A. Bricaud, A. Morel, M. Babin, K. Allali, and H. Claustre, “Variations of light absorption by suspended particles with chlorophyll a concentration in oceanic (case 1) waters: Analysis and implications for bio-optical models,” J. Geophys. Res. 103 (1998). [CrossRef]

53. K. Carder, F. Chen, Z. Lee, S. Hawes, and D. Kamykowski, “Semianalytical moderate-resolution imaging spectrometer algorithms for chlorophyll-a and absorption with bio-optical domains based on nitrate-depletion temperature,” J. Geophys. Res. 104, 5403–5421. (1999). [CrossRef]

54. A. Whitmire, E. Boss, T. Cowls, and W. Pegau, “Spectral variability of particulate backscattering ratio,” Opt. Express 15, 7019–7031 (2007). [CrossRef] [PubMed]

55. D. Doxaran, K. Ruddick, D. McKee, B. Gentili, D. Tailliez, M. Chami, and M. Babin, “Spectral variations of light scattering by marine particles in coastal waters, from visible to near infrared,” Limnology and Oceanography 54, 1257–1271 (2009). [CrossRef]

56. M. Sydor, R. Gould, R. Arnone, V. Haltrin, and W. Goode, “Uniqueness in remote sensing of the inherent optical properties of ocean water,” Appl. Opt. 43, 2156–2162 (2004). [CrossRef] [PubMed]

IOPs, n = 500	fr	slope	intercept	RMSE	bias	R²
a _chla(440)	1	1.053	0.313	0.322	-0.160	0.963
a _dg(440)	1	1.134	0.159	0.451	0.150	0.965
a(440)	1	1.086	0.144	0.189	0.003	0.992
b _b,spm(550)	1	0.977	-0.161	0.150	0.058	0.991

wavelength [nm]	400	440	490	560	670	690
MERIS-NER noise±[%]	4.330	2.097	1.786	1.594	5.629	5.408
std of IOCCG [sr^-1]	0.390	0.324	0.266	0.498	0.199	0.224
introduced noise ±[%]	32.69	32.11	29.20	33.19	32.28	32.93

IOPs, n = 500	fr	slope	intercept	RMSE	bias	R²
a_chla (440)	1	1.499	1.665	1.168	-0.221	0.712
a_dg (440)	1	1.398	0.325	1.264	0.592	0.834
a (440)	1	1.191	0.361	0.423	-0.035	0.963
b_b,spm(550)	1	1.284	1.204	0.752	0.092	0.846

IOPs	n	fr	slope	intercept	RMSE	bias	R²
a_chla (440)	1279	1	0.955	-0.306	0.644	0.175	0.808
a_dg (440)	1126	1	1.196	-0.261	1.085	0.792	0.807
a (440)	1125	1	1.036	-0.312	0.549	0.382	0.913
b_b,spm (550)	369	0.951	1.344	2.025	0.363	-0.108	0.726

IOPs	n	fr	slope	intercept	RMSE	bias	R²
a_chla (440)	132	1	0.954	-0.207	0.459	0.064	0.866
a_dg (440)	131	1	0.849	-1.006	1.009	0.529	0.704
a (440)	131	1	0.938	-0.392	0.445	0.2531	0.912
b_b,spm (550)	29	1	1.165	0.932	0.250	-0.028	0.491

Stochastic inversion of ocean color data using the cross-entropy method

Abstract

1. Introduction

2. Semi-analytical ocean color model

3. Inversion of ocean color data using the cross-entropy method

3.1. Entropy and cross-entropy

3.2. Inversion

3.3. Algorithm

4. Materials and analysis

4.1. Data sets

4.2. Analysis

5. Performance

5.1. IOCCG data set

5.2. Satiability to noise

6. Validation

6.1. NOMAD data set

6.2. SeaWiFS match-up data set

7. Discussions

7.1. Performance with simulated data

7.2. Stability

7.3. Validation

7.4. Uncertainty

7.5. Convergence and processing time

7.6. Limitation

7.7. Algorithm extension

8. Conclusions

A. Initial values

Acknowledgement

References and links

Cited By

Figures (5)

Tables (7)

Equations (25)

Optics Express

data set	μ	σ	min	max
IOCCG	11	5	3	29
noisy-IOCCG	15	8	4	80
NOMAD	17	6	4	91
SeaWiFS match-up	12	3	6	23

data set	μ	σ	min	max
IOCCG	2.121	0.600	1.174	4.625
noisy-IOCCG	3.150	1.146	1.362	10.947
NOMAD	1.017	0.369	0.500	3.636
SeaWiFS match-up	0.762	0.202	0.493	1.289