We study a cascade of linear shift-invariant processing modules (correlators), each augmented with a nonlinear threshold as a means to increase the performance of high-speed optical pattern recognition. This configuration is a special class of multilayer, feed-forward neural networks and has been proposed in the literature as a relatively fast best-guess classifier. However, it seems that, although cascaded correlation has been proposed in a number of specific pattern recognition problems, the importance of the configuration has been largely overlooked. We prove that the cascaded architecture is the exact structure that must be adopted if a multilayer feed-forward neural network is trained to produce a shift-invariant output. In contrast with more generalized multilayer networks, the approach is easily implemented in practice with optical techniques and is therefore ideally suited to the high-speed analysis of large images. We have trained a digital model of the system using a modified backpropagation algorithm with optimization using simulated annealing techniques. The resulting cascade has been applied to a defect recognition problem in the canning industry as a benchmark for comparison against a standard linear correlation filter, the minimum average correlation energy (MACE) filter. We show that the nonlinear performance of the cascade is a significant improvement over that of the linear MACE filter in this case.
© 2001 Optical Society of AmericaFull Article | PDF Article