Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group
  • Applied Spectroscopy
  • Vol. 67,
  • Issue 11,
  • pp. 1332-1338
  • (2013)

Agricultural Case Studies of Classification Accuracy, Spectral Resolution, and Model Over-Fitting

Not Accessible

Your library or personal account may give you access

Abstract

This paper describes the relationship between spectral resolution and classification accuracy in analyses of hyperspectral imaging data acquired from crop leaves. The main scope is to discuss and reduce the risk of model over-fitting. Over-fitting of a classification model occurs when too many and/or irrelevant model terms are included (i.e., a large number of spectral bands), and it may lead to low robustness/repeatability when the classification model is applied to independent validation data. We outline a simple way to quantify the level of model over-fitting by comparing the observed classification accuracies with those obtained from explanatory random data. Hyperspectral imaging data were acquired from two crop-insect pest systems: (1) potato psyllid (<i>Bactericera cockerelli</i>) infestations of individual bell pepper plants (<i>Capsicum annuum</i>) with the acquisition of hyperspectral imaging data under controlled-light conditions (data set 1), and (2) sugarcane borer (<i>Diatraea saccharalis</i>) infestations of individual maize plants (<i>Zea mays</i>) with the acquisition of hyperspectral imaging data from the same plants under two markedly different image-acquisition conditions (data sets 2a and b). For each data set, reflectance data were analyzed based on seven spectral resolutions by dividing 160 spectral bands from 405 to 907 nm into 4, 16, 32, 40, 53, 80, or 160 bands. In the two data sets, similar classification results were obtained with spectral resolutions ranging from 3.1 to 12.6 nm. Thus, the size of the initial input data could be reduced fourfold with only a negligible loss of classification accuracy. In the analysis of data set 1, several validation approaches all demonstrated consistently that insect-induced stress could be accurately detected and that therefore there was little indication of model over-fitting. In the analyses of data set 2, inconsistent validation results were obtained and the observed classification accuracy (81.06%) was only a few percentage points above that obtained using random data (66.7-77.4%). Thus, our analysis highlights a potential risk of model over-fitting and emphasizes the importance of testing for this important aspect as part of developing reliable and robust classification models.

PDF Article
More Like This
Using continous wavelet analysis for monitoring wheat yellow rust in different infestation stages based on unmanned aerial vehicle hyperspectral images

Qiong Zheng, Wenjiang Huang, Huichun Ye, Yingying Dong, Yue Shi, and Shuisen Chen
Appl. Opt. 59(26) 8003-8013 (2020)

Robustness of analyses of imaging data

Christian Nansen
Opt. Express 19(16) 15173-15180 (2011)

A moderate-spectral-resolution transmittance model based on fitting the line-by-line calculation

Heli Wei, Xiuhong Chen, Ruizhong Rao, Yingjian Wang, and Ping Yang
Opt. Express 15(13) 8360-8370 (2007)

Cited By

You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.