Classification of steel using laser-induced breakdown spectroscopy combined with deep belief network

Guanghui Chen; Guanghui Chen; Qingdong Zeng; Qingdong Zeng; Qingdong Zeng; Wenxin Li; Xiangang Chen; Mengtian Yuan; Mengtian Yuan; Lin Liu; Honghua Ma; Boyun Wang; Yang Liu; Lianbo Guo; Huaqing Yu; Huaqing Yu

doi:10.1364/OE.451969

1. Introduction

Steel is an important material applied in many industrial fields, and could be made into different alloys with special performance to meet various industrial needs. Consumption of crude steel worldwide amounted to 1,878 million tons in 2020 and the average growth rate reached 3% in the last five years [1]. A large amount of steel products have been generated due to high consumption of steel. The classification rate of the materials of Cu, Al and Mg is only 86-95% by an optical sorting method [2]. Traditionally, the steel-sorting application is low precision and takes long time, which will influence the quality of steel products made of scrap in the future [3]. To save steel resources and ensure product quality, rapid identification of steel is a crucial step in the process of recycling and reusing steel waste. However, traditional analytical methods, such as atomic absorption spectroscopy (AAS) [4], inductively coupled plasma-mass spectrometry (ICP-MS) [5], and spark discharge optical emission spectroscopy (SD-OES) [4,5], need relatively long analysis time, which is not conducive to the rapid detection of steel in industrial fields [4–6]. Therefore, a more convenient, precise, and rapid classification method for steel is necessary.

Laser-induced breakdown spectroscopy (LIBS), is an atomic emission spectroscopy technique that utilizes laser pulse to create a plasma and the composition of samples could be analyzed by emission lines of the plasma [7,8]. LIBS has great potential prospects in real-time detection of industrial field because it has many advantages, including easy or no sample pretreatment, fast analyzing speed, multi-element analysis capability, and micro damage detection for samples [9,10]. In recent years, some research groups have combined LIBS with machine learning (ML) methods for material classification and obtained meaningful results [11]. For example, Zhang et al. developed LIBS assisted with random forest (RF) to detect nine steel grades and achieved higher performance [12]. Lin et al. used partial least-squares discriminant analysis (PLS-DA) and support vector machine (SVM) classification methods to identify 40 steel alloys, while the identification accuracy was 96.25% and 95%, respectively [13]. Tang et al. used LIBS combined with self-organizing maps (SOM) and K-means algorithm to identify 20 kinds of polymers and the accuracy was 99.2% [14]. During PLS-DA classification model training, Kim et al. applied baseline removal and root-mean-square-based normalization (BR-RMSN) to mitigate the noise effect of LIBS signals, thereby improving the classification accuracy from 91.2% to 95.5% for all five metal types [15]. Eden et al. proposed a metal scrap classification based on LIBS coupled with the pre-cluster-based regression and PLS-DA for same-base alloys; the results indicated that the proposed method is better than conventional classification algorithms [16]. Campanella et al. developed the application of back-propagation artificial neural networks (BPANN) to LIBS analysis for non-ferrous automotive scraps, which obtained satisfactory performance for the on-line classification of Al alloys [17]. Evelyne et al. used soft independent modeling of class analogy (SIMCA) to classify the alloy species, and the results indicated that the model was robust [18]. Jeyne et al. achieved the discrimination of 80 metal samples by applying multivariate normalization method combined with k-nearest neighbor (KNN) [19]. However, limitations remain in that most of the aforementioned methods need to identify the composition of samples in advance to determine and select feature information as input by using different selection principles.

In general, the classification accuracy is greatly affected by the quality of input. However, some uncertainty factors of spectral signal remain in sample measuring progress such as self-absorption [20] and spectral interference [21]. Meanwhile, only a nuanced difference is observed in the intensity of spectral lines due to the similarities in composition of steel products, which increases the uncertainty of LIBS signals and the uncertainty may lead to the poor accuracy of multivariable model [22]. In some cases, to avoid the problem of the hard spectral line selection, spectral interference, and less input information, the LIBS full-spectra signals with a large amount of available variables were used as input for the classification model [23]. For example, Soichi et al. proposed a signal preprocessing method that takes the full spectra of pelletized hydrothermal deposits into an appropriate form as input for artificial neural network (ANN), which improves the identification accuracy to 90.1% [24]. However, besides the spectral lines of the element analyzed, the full-spectra signals also contain much redundant information such as spectral noise and background, which may increase the difficulty of extracting feature information. Thus, we need to solve the complex non-linear properties in LIBS signals by applying an ML method with strong adaptive capacity and higher precision.

Deep learning can extract the important feature information automatically from large and complex sets of data by utilizing hierarchical architectures, which is a more accurate and efficient method [25]. As a representative of deep learning, deep belief network (DBN) is suitable to deal with the nonlinear problem for classification in industrial application that depends on the model structure. Composed of several restricted Boltzmann machines (RBMs), DBN is a randomly generated neural network that can learn probability distribution through input data [26]. In general, increasing hidden layers can learn high-level features from input and improve the DBN model performance, but the high-level features are not easy to encode in DBN with a deeper hidden layer [27], which may increase the computational cost and leads to a negative effect of classification performance. Recently, Vrábel et al. compared the performance of principal component analysis (PCA) and RBM for dimension reduction in large data of soil powders mixed with gypsum, and the results show that RBM is better than PCA in terms of less training time, applicability to large datasets, and extensibility of the method [28]. Zhao et al. applied LIBS combined with DBN to classify the contaminated soil and achieved satisfactory classification [29]. However, to the best of our knowledge, publications on LIBS combined with DBN algorithm for steel classification are rarely reported.

In this study, the structures of DBN model were designed and optimized to improve the classification accuracy. To evaluate the performance of LIBS coupled with DBN, the predicted accuracy of 13 steel types by DBN were compared with other methods (SOM, LDA, KNN, and BP) when full-spectra signals (8400 variables) were used as input. Meanwhile, the same evaluation procedure was conducted by using intensities of characteristic spectra lines (84 variables) as input. Finally, the results of two different inputs indicate that LIBS coupled with DBN has better application prospects in steel classification.

2. Experimental setup and methodology

2.1 Experimental setup

The schematic of the mobile LIBS setup used in this study is illustrated in Fig. 1(a). A compact Q-switch Nd:YAG laser (Ultra 50, Bigsky Co. Ltd., US) operating at a center wavelength of 532 nm and the repetition rate of 10 Hz with a maximum of 29 mJ per pulse. The pulse duration is 7 ns. The laser beam is transmitted through by a special optical fiber (core diameters: 1 mm), reflected by a dichroic mirror and then focused onto the surface of the sample to generate plasma by a plano-convex lens (focal length = 100 mm, Φ = 25.4 mm). Then the emission of LIBS plasma was collected into a compact spectrometer (AvaSpec-ULS2048-6-USB2) by using an optical fiber (core diameter: 400 µm). The spectrometer, equipped with six 2048-pixel CCD cameras (Sony 554), has six channels with a spectral resolution of 0.08 ∼ 0.11 nm in the wavelength range of 291 ∼ 1020 nm. The delay time between the laser pulse and the spectrometer was set as 1.3 µs and the integration time of 1.1 ms was used in this work. The used mobile LIBS setup is shown in Fig. 1(b), which is composed of the main case and the probe. The compact laser, spectrometer, circuit system, optical system, control system, and power system are integrated into the main case (size: 40 × 50 × 70 cm, weight: 50 kg). The probe (size: 22 × 15 × 10 cm, weight: 2 kg) is equipped with lens, optical fibers and dichroic mirror. Experimental environment: Intel Core^TM i5-6200U CPU (2.30 GHz) with 8 GB of computer memory.

Fig. 1. LIBS setup: (a) schematic and (b) prototype

Download Full Size | PDF

2.2 Sample preparation

A total of 39 samples from 13 different brands of special steel (Inner Mongolia North Heavy Industries Group Corp. Ltd.) were used in this study. The matrix element of all samples is Fe with a concentration higher than 90%. The reference concentrations of trace elements of 13 steel brands are listed in Table 1, which are measured by their physicochemical laboratory using inductively coupled plasma-atomic emission spectrometry (ICP-AES) and ICP-MS. For each sample, 30 spectra were collected from different positions on the surface, and each spectrum was an average of 10 measurements, resulting in 1170 spectra for the 39 samples of the 13 steel brands for LIBS analysis.

Table 1. Concentration information (wt.%) of certified elements in 13 brands of steel tube

View Table | View all tables in this article

2.3 Deep belief network (DBN)

A DBN consists of two procedures: (1) utilizing the strategy of training RBMs layer by layer to construct the deep network preliminarily, and (2) fine-tuning the weight parameters of the entire network by back propagation (BP). The structure of DBN is adjusted by the feature of input data. The number of neurons in the output layer is consistent with the number of steel brands. The schematic of DBN is shown in Fig. 2(a). The DBN model was implemented in MATLAB ver. R2017a (MathWorks Corporation, USA).

Fig. 2. Schematic of (a) DBN and (b) RBM

Download Full Size | PDF

Input: The intensity value at each wavelength of LIBS data is used as an input variable.

Pre-training: The purpose of pre-training procedure is to obtain proper values of the entire network. DBN, which is utilized RBMs as learning modules, is a significant advance in deep learning [26]. RBM is an energy-based model that is directly inspired by statistical physics. It is a two-layer network with visible neurons v_i = {0,1} and hidden neurons h_j = {0,1}. The schematic of RBM is shown in Fig. 2(b). The energy function of RBM can be written as follows:

(1)$$E({\nu ,h} )={-} \sum\limits_i^n {{a_i}{\nu _i}} - \sum\limits_j^m {{b_j}{h_j}} - \sum\limits_{ij}^{nm} {{W_{ij}}{\nu _i}{h_j}}$$

where w_ij (i = 1, 2, . . ., m; j = 1, 2, . . ., n) is the weight between v_i and h_j; m and n are the number of hidden neurons and visible neurons, respectively; and a_i and b_j are biases of v_i and h_j, respectively. RBM is trained to reconstruct the inputs and minimize the reconstruction error between the original and reconstruct inputs. In addition, Hinton et al. [30,31] introduced a method called contrastive divergence (CD) that it only takes k iterations to achieve an approximate convergence state to train the RBM, which can reduce the reconstruction error effectively. In the training step, the feature information of the previous RBM is then used as input to the next RBM. Each RBM layer is trained respectively to ensure that feature information is retained as much as possible when feature variables are mapped to different feature spaces. For LIBS data, pre-training of DBN can prevent overfitting and enhance the model generalization.

Fine-tuning: As described above, we obtain the proper initialization of the DBN model in the pre-training step. To use the DBN as a classifier, we add an output layer to predict the label of training sets. Then, all parameters in different layers of the network are adjusted by the supervised learning algorithm such as BP. The dimension of LIBS spectra is reduced by DBN to obtain representative feature information from training sets, and BP was later applied to the classification model, which was useful to improve the classification accuracy. This fine-tuning progress is performed over training sets for a certain number of epochs or until the classification accuracy is no longer improved. Using this mechanism, the features with high correlations between neurons in the lower layers can be captured in deeper layers of the DBN model [32], which is beneficial to extract important feature information from large and complex sets of LIBS data.

Compared with ANN, DBN uses a layer-by-layer unsupervised learning to pre-train the networks, which is beneficial to set the proper initial weights of the networks. DBN has the advantages to improve the model performance by avoiding over-fitting and enhancing the model generalization [33]. However, it is difficult to optimize the structure of the DBN model. The calculation time is significantly influenced by the quality of input and the structure of the DBN model. The number of layers and neurons of DBN is adjusted by the features of the LIBS data. Therefore, to improve the modeling efficiency, the number of hidden layers and neurons in each layer is optimized to establish a suitable DBN model in the following sections.

3. Results and discussion

3.1 Data pre-treatment

As mentioned in section 2.1, the LIBS full spectra ranged from 291 nm to 1020 nm, and the intensity of a spectrum has 11,022 variables. The LIBS spectra are shown in Fig. 3 and spectra lines have weak intensity in the wavelength range over 800 nm. To reduce the irrelevant variates from LIBS data and accelerate the training time, the spectrum with 8400 variables in the range of 291 ∼ 797 nm is used as full-spectra signals. To avoid obvious order of magnitude difference in input variables, the intensities of these signals were normalized to the 0–1 range. The normalized computational formula was written as follows:

(2)$$Y = (X - {X_{\min }})/({X_{\max }} - {X_{\min }})$$

where the X and Y is the original value and normalized value of the intensity of spectral lines; X_max and X_min is the maximum and minimum value of X, respectively. One-ninth of the spectra of each steel brand was randomly chosen as test set and the remaining spectra were randomly divided into training and validation sets in 4:1 proportion. The spectra from different steel samples were evenly distributed in each dataset. Figure 4(a) and (b) shows the LIBS spectrum with specified lines and the first two principal components (PCs) from the training sets of 13 different brands, respectively. The contribution rates of first two PCs accounted for 79.81% of the total variability. It can be seen that the spectra of different steel samples were difficult to classify. Therefore, 832 spectra (training sets), 208 spectra (validation sets), and 130 spectra (test sets) were used to evaluate the performance of the DBN classification model. The results are obtained by after five-fold cross-validations.

Fig. 3. LIBS spectra of various brands of special steel

Download Full Size | PDF

Fig. 4. (a) LIBS spectrum with specified lines; (b) the score of the first two PCs from the training sets

Download Full Size | PDF

3.2 Data pre-treatment

The classification depends on the difference of LIBS full-spectra or several spectral lines corresponding to the composition of steels. However, the LIBS full-spectra often have much unnecessary redundant information, which interferes with the quality of input data and increases the difficulty for rapid classification. Moreover, Shin et al. [34] proved that compared with feature information of full-spectra signals extracted by PCA, the modeling time of using some characteristic spectral line in peak values as input could be reduced by a factor of 20 or more, without losing any precision.

Thus, in this study, we compare the performance of the classification model under two different inputs (full-spectra signals and characteristic spectral lines in peak values) respectively. The results are used as a standard to verify the suitable input for LIBS combined with DBN model for the steel classification. In the selection of spectral lines, the spectral lines information of matrix element Fe has little difference in different samples and is not helpful to classify the brand of samples. In addition, the spectral peak intensity values of Fe are much higher than the values of spectral lines of trace elements in LIBS measurements. To avoid obvious order of magnitude difference in input variables, Fe lines are not used in this work. Based on the National Institute of Standards and Technology (NIST), considering little or no self-absorption and few spectral interferences, the intensities of 17 detected characteristic spectral lines (including 5 data points around each spectral line) of trace elements are selected as input variables and the data dimension is reduced to 84. The spectral lines used for the classification are listed in Table 2.

Table 2. Main spectral lines of special steels

View Table | View all tables in this article

3.3 Influence of various DBN constructions

The DBN model can extract more representative features than the original input [35]. Under the limitation of computer hardware, the performance of the DBN model is closely related to the number of hidden layers and neurons in the hidden layer. However, selecting the optimal parameters for DBN is difficult and the selection is heuristic or based on previous experience [28]. To explore the influence of neurons and hidden layers for the DBN model, after five-fold cross-validations, the accuracy of three different datasets and the training time were adopted to evaluate the performance of the DBN model.

The number of hidden layers (1, 2, 3, 4, and 5) and neurons (30, 40, 50, 60, and 70) in one hidden layer is set in turn for each experiment. The value of other parameters, namely, number of epochs, momentum, and learning rate input variable are 200, 0.05, and 0.5 respectively.

The performance of different neurons (30, 40, 50, 60, and 70) for the DBN model with one hidden layer were evaluated and shown in Fig. 5(a). As shown in Fig. 5, the uncertainty of the results is obtained with five-fold cross-validations and this is represented by the error bar. Under some parameters, the calculated results do not change, and the error bar is 0. The probable reason is that after trained with a large number of epochs, a certainly and hardly optimized value of classification accuracy could be obtained by DBN model. As the neurons in the hidden layer increased, the accuracy of the model could be improved basically while the training time was increasing. However, DBN is constructed by RBMs, and RBM is a probability distribution model. Due to random of network in computational process, the DBN model with 50 neurons might get trapped in relatively poor local minima, which leaded the longer training time than the model with 60 and 70 neurons. These results demonstrate that the best accuracy of all datasets (including training, validation, and test sets) was 99.61% and the training time was 216.68 s. When the number of neurons is over 60, the performance of the DBN model almost did not change with the growing number of neurons. Increasing the numbers of neurons in first hidden layer is beneficial to set a large network. Generally, the classification performance of the DBN model depends on the size of network structure such as the number of hidden neurons and hidden layers. The DBN model with large and complex network structure could achieve high classification results for training data, but leaded to more long training time. Considering the dimension-reduction performance and the reliability of the DBN model, the neurons in the first hidden layer was set as 70 to guarantee that more representative features can be extracted from high-dimensional full-spectra signals by the DBN model with multilayers.

Fig. 5. Optimization of DBN model of different parameters: the number of (a) neurons and (b) hidden layers

Download Full Size | PDF

Additionally, we used different DBN structures (including 8400-70-13, 8400-70-40-13, 8400-70-60-40-13, 8400-70-60-40-30-13, and 8400-70-60-40-40-30-13) for evaluation in the same datasets, the results of which are shown in Fig. 5(b). When the DBN structure was 8400-70-40-13, the best accuracy (99.92%) could be obtained and the training time was the shortest (234.6 s). Meanwhile, a larger number of hidden layers would not only increase the training time, but also may lead to unsatisfactory predictability of the deep network, such as the accuracy of the DBN with 5 hidden layers decreasing to 95.3%. The possible reason is that the more hidden layers significantly increase the number of parameters and increase the difficulty of parameter adjustment in the training process. Obtaining a proper convergence state for the deep network is difficult. To maintain a balance between accuracy and training time, the number of hidden layers is set to 2. In summary, compared with the best accuracy of the DBN with 1 hidden layer in the training, validation, and test sets, the accuracy of the DBN with 2 hidden layers was improved from 99.69%, 99.61%, and 99.23% to 99.92%, 99.9%, and 100%, respectively. Moreover, the results show that compared with 1 hidden layer, the training time of the DBN with 2 layers is not significantly increased. The reason may be that the training time, most of which is caused by the fine-tuning progress, is influenced by the size of structure of DBN model. However, the DBN model with large network structure could achieve high classification accuracy and the more suitable pre-trained network could reduce the time of fine-tuning by BP algorithm. Therefore, the training time could be reduced with more neurons and more hidden layers under specific conditions. Using full-spectra data, the DBN with 2 hidden layers has good feature expression ability, which is beneficial for classification of unknown samples. Thus, 8400-70-40-13 was selected as the structure of the DBN model in this study.

3.4 Comparison with input of characteristic spectral lines

In this experiment, a comparison of new input variables, which were the spectral lines listed in Table 2, was conducted. The same parameter tuning progress was conducted and the dimension of the input layer of the model is adjusted to 84 (the dimension of characteristic spectral lines). As the dimension of input is significantly reduced, the training time in the training progress of 500 epochs are all decreased near the range from 20 s to 30 s, where the difference is negligible. As shown in Fig. 6(a), the performance of the DBN model with one hidden layer reaches 99.88% and is less affected by the number of neurons in the range from 30 to 70. However, Fig. 6(b) shows that the DBN model with multiple hidden layers not only needs to take large epochs to improve the accuracy, but also reduces the value of the maximum accuracy range from 99.88% to 91.54%. Moreover, the DBN model with 5 hidden layers is too complex for the input date with low dimension and cannot converge to a proper state to predict the label of samples. Compared with full-spectra data, using the spectral lines as input, the DBN model (structure: 84-30-13) can achieve high accuracy and reduce the training time significantly, which is beneficial for the classification of known samples with certified elements.

Fig. 6. Optimization of DBN model by using characteristic spectral lines as input: the number of (a) neurons and (b) hidden layers

Download Full Size | PDF

The best result of the DBN model for the test set is presented in Fig. 7. The circle represents the actual label, and the scatter represents the predicted label by the DBN model. The result demonstrates that the spectra of 42CrMo (sample label: 10) and 42CrMoA (sample label: 5) was easily mutually misclassified. 42CrMoA is an upgrading steel of 42CrMo and can be considered as the same steel brands. The probable reason is that the samples of 42CrMo and 42CrMoA are the nearest neighbor type listed in Table 1 and have similar content of main trace elements (Mn, Cr, and Cu) with rich spectra lines, thereby resulting in difficulty of classification.

Fig. 7. Best prediction of DBN model for the test set

Download Full Size | PDF

3.5 Comparison with other ML methods

To further evaluate the predictive abilities of the DBN for steel classification, the results are compared with those of other ML models such as KNN, SOM, BP, and LDA, which are all widely applied in LIBS analysis. The k value and distance metric of KNN model is set to 10 and Euclidean respectively. The number of neurons and iterations of SOM model were set to 36 and 1000, respectively. The network structure of BP model was set as 8400-70-40-13 and 84-30-13, respectively. The Mahalanobis distances was used for LDA model to classify the type of spectra data. Then, 5-fold cross-validation was conducted and the cross-validation results of other methods by using two different input variables (full spectra: 8400 variables, spectra lines: 84 variables) are shown in Table 3. By using spectra lines as input, the accuracies of other classification models were improved significantly, as shown by the test results in Fig. 8. Figure 8(a) presents the result of the SOM in which 16 spectra of the 4 steel brands were misclassified and the test accuracy reached 87.69%, which was the worst performance in all methods. The accuracy of the LDA model was improved to 92.31%, while 10 spectra of the 3 steel brands were misclassified, as shown in Fig. 8(b). In Figs. 8(c) and (d), the results of the KNN and BP models show that 1 spectrum and 2 spectra of the 2 steel brands were misclassified, respectively. Although KNN reaches high accuracy that only 1 spectrum of 15NiCuMoNb5 was misidentified as 25#, the two samples have obvious content difference in elements of Mn (1.05% and 0.65%), Ni (1.17% and 0%), and Cu (0.605% and 0%). These results illustrate that KNN may have poor discrimination for various steel brands. In particular, BP only mistook the spectra of 42CrMoA and 42CrMo, the result of which was consistent with the DBN model.

Fig. 8. Prediction of other ML methods: (a) SOM, (b) LDA (c) KNN, and (d) BP

Download Full Size | PDF

Table 3. Performance of different methods in two inputs

View Table | View all tables in this article

Figure 8 and Table 3 show that when the characteristic spectra lines were selected as input, other ML methods, such as KNN and BP, could also obtain better classification, and the performance of DBN was not significantly different. When the full spectra was used as input, DBN could classify the sample perfectly in test sets, but the accuracies range of other ML methods are decreased from (87.69% ∼ 99.23%) to (62.31% ∼ 96.16%). A similar phenomenon compared with spectra lines, the identification model performed better by using the full spectra as input, which can be found in other studies [36,37]. The reason may be that the subtle differences of similar steel samples, such as unique matrix effects, only could be found in the full-spectra with all variables, thereby leading to better performance [37]. The results demonstrate that DBN can extract representative information from complex and high-dimensional LIBS datasets and has great ability in classifying special steel with similar compositions. To sum up, the DBN model presented the best classification accuracy compared with all the other methods.

4. Conclusions

In this study, a novel method of identification for 13 steel brands by using mobile LIBS coupled with DBN was developed and applied. To improve the modeling efficiency, the different structures of DBN were optimized to evaluate the identification performance. The optimized results show that the neurons in the first hidden layer has less influence on accuracy and the classification performance can be improved by selecting the appropriate numbers of hidden layers for the DBN model. Otherwise, depending on the dimension size of input, more hidden layers reduce the accuracy and cost long training time. By using two inputs (full-spectra signals and characteristic spectra lines), after five-fold cross-validations, the test accuracies of the DBN model with the optimized structure (8400-70-40-13 and 84-30-13) could reach 100% and 98.46%, respectively, while the training times were 234.6 s and 18.71 s, respectively. Moreover, compared with the accuracies range (62.31% ∼ 96.16%) of other ML methods in test sets, only the DBN model can reach an accuracy over 99% in each dataset when the input was full-spectra signals. This result shows that DBN has great feature-extraction abilities for complex and high-dimensional LIBS datasets and exhibited better performance for steel classification. Furthermore, as a deep learning method, DBN can be applied in transfer learning for more similar samples by using the parameters of the model trained early and improving the modeling efficiency in industrial applications, especially online platforms with large datasets.

Funding

Natural Science Foundation of Xiaogan City (XGKJ2021010003); Hubei Provincial Department of Education (T201617); Natural Science Foundation of Hubei Province (2021CFB607); National Natural Science Foundation of China (61705064).

Disclosures

The authors declare that there are no conflicts of interest related to this article.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

References

1. https://worldsteel.org/wp-content/uploads/2021-World-Steel-in-Figures.pdf

2. S. P. Gundupallia, S. Haitb, and A. Thakur, “A review on automated sorting of source-separated municipal solid waste for recycling,” Waste Manage. 60, 56–74 (2017). [CrossRef]

3. S. Dworak, H. Rechberger, and J. Fellner, “How will tramp elements affect future steel recycling in Europe? – A dynamic material flow model for steel in the EU-28 for the period 1910 to 2050,” Resourc. Conser. Recycl. 179, 106072 (2022). [CrossRef]

4. M. A. Khater, “Laser-induced breakdown spectroscopy for light elements detection in steel: State of the art,” Spectrochim. Acta, Part B 81, 1–10 (2013). [CrossRef]

5. M. S. Afgan, Z. Y. Hou, and Z. Wang, “Quantitative analysis of common elements in steel using a handheld m-LIBS instrument,” J. Anal. At. Spectrom. 32(10), 1905–1915 (2017). [CrossRef]

6. R. Noll, C. F. Begemann, M. Brunk, S. Connemann, C. Meinhardt, M. Scharun, V. Sturm, J. Makowe, and C. Gehlen, “Laser-induced breakdown spectroscopy expands into industrial applications,” Spectrochim. Acta, Part B 93, 41–51 (2014). [CrossRef]

7. R. Noll, C. F. Begemann, S. Connemann, C. Meinhardt, and V. Sturm, “LIBS analyses for industrial applications – an overview of developments from 2014 to 2018,” J. Anal. At. Spectrom. 33(6), 945–956 (2018). [CrossRef]

8. D. W. Hahn and N. Omenetto, “Laser-Induced Breakdown Spectroscopy (LIBS), Part II: Review of Instrumental and Methodological Approaches to Material Analysis and Applications to Different Fields,” Appl. Spectrosc. 66(4), 347–419 (2012). [CrossRef]

9. Q. D. Zeng, F. Deng, Z. H. Zhu, Y. Tang, B. Y. Wang, Y. J. Xiao, L. B. Xiong, H. Q. Yu, L. B. Guo, and X. Y. Li, “Portable fiber-optic laser-induced breakdown spectroscopy system for the quantitative analysis of minor elements in steel,” Plasma Sci. Technol. 21(3), 034006 (2019). [CrossRef]

10. Q. D. Zeng, G. H. Chen, X. G. Chen, B. Y. Wang, B. Y. Wan, M. T. Yuan, Y. Liu, H. Q. Yu, L. B. Guo, and X. Y. Li, “Rapid online analysis of trace elements in steel using a mobile fiber-optic laser-induced breakdown spectroscopy system,” Plasma Sci. Technol. 22(7), 074013 (2020). [CrossRef]

11. S. Moncayo, J. D. Rosales, R. Izquierdo-Hornillos, J. Anzano, and J. O. Caceres, “Classification of red wine based on its protected designation of origin (PDO) using Laser-induced Breakdown Spectroscopy (LIBS),” Talanta 158, 185–191 (2016). [CrossRef]

12. T. L. Zhang, D. H. Xia, H. S. Tang, X. F. Yang, and H. Li, “Classification of steel samples by laser-induced breakdown spectroscopy and random forest,” Chemom. Intell. Lab. Syst. 157, 196–201 (2016). [CrossRef]

13. J. J. Lin, X. M. Lin, L. B. Guo, Y. M. Guo, Y. Tang, Y. W. Chu, S. S. Tang, and C. J. Che, “Identification accuracy improvement for steel species using a least squares support vector machine and laser-induced breakdown spectroscopy,” J. Anal. At. Spectrom. 33(9), 1545–1551 (2018). [CrossRef]

14. Y. Tang, Y. M. Guo, Q. Q. Sun, S. S. Tang, J. M. Li, L. B. Guo, and J. Duan, “Industrial polymers classification using laser-induced breakdown spectroscopy combined with self-organizing maps and K-means algorithm,” Optik 165, 179–185 (2018). [CrossRef]

15. H. Kim, J. Lee, E. Srivastava, S. Shin, S. Jeong, and E. Hwang, “Front-end signal processing for metal scrap classification using online measurements based on laser-induced breakdown spectroscopy,” Spectrochim. Acta, Part B 184(7), 106282 (2021). [CrossRef]

16. E. Kim, Y. Kim, E. Srivastava, S. Shin, S. Jeong, and E. Hwang, “Soft classification scheme with pre-cluster-based regression for identification of same-base alloys using laser-induced breakdown spectroscopy,” Chemom. Intell. Lab. Syst. 203(1), 104072 (2020). [CrossRef]

17. B. Campanella, E. Grifoni, S. Legnaioli, G. Lorenzetti, S. Pagnotta, F. Sorrentino, and V. Palleschi, “Classification of wrought aluminum alloys by Artificial Neural Networks evaluation of Laser Induced Breakdown Spectroscopy spectra from aluminum scrap samples,” Spectrochim. Acta, Part B 134, 52–57 (2017). [CrossRef]

18. E. Vors, K. Tchepidjian, and J. B. Sirven, “Evaluation and optimization of the robustness of a multivariate analysis methodology for identification of alloys by laser induced breakdown spectroscopy,” Spectrochim. Acta, Part B 117, 16–22 (2016). [CrossRef]

19. J. P. Castro and E. R. Pereira-Filho, “Twelve different types of data normalization for the proposition of classification, univariate and multivariate regression models for the direct analyses of alloys by laser-induced breakdown spectroscopy (LIBS),” J. Anal. At. Spectrom. 31(10), 2005–2014 (2016). [CrossRef]

20. R. Hedwig, I. Tanra, I. Karnadi, M. Pardede, A. M. Marpaung, Z. S. Lie, K. H. Kurniawan, M. M. Suliyanti, T. J. Lie, and K. Kagawa, “Suppression of self-absorption effect in laser-induced breakdown spectroscopy by employing a Penning-like energy transfer process in helium ambient gas,” Opt. Express 28(7), 9259–9268 (2020). [CrossRef]

21. Y. Tang, S. X. Ma, R. Yuan, Y. Y. Ma, W. Sheng, S. P. Zhan, J. N. Wang, and L. B. Guo, “Spectral interference elimination and self-absorption reduction in laser-induced breakdown spectroscopy assisted with laser-stimulated absorption,” Opt. Lasers Eng. 134, 106254 (2020). [CrossRef]

22. Z. Wang, M. S. Afgan, W. Gu, Y. Song, and Z. Li, “Recent Advances in Laser-induced Breakdown Spectroscopy Quantification: from Fundamental Understanding to Data Processing,” TrAC, Trends Anal. Chem. 143, 116385 (2021). [CrossRef]

23. J. L. Gottfried, “Discrimination of biological and chemical threat simulants in residue mixtures on multiple substrates,” Anal. Bioanal. Chem. 400(10), 3289–3301 (2011). [CrossRef]

24. S. Yoshino, B. Thornton, T. Takahashi, Y. Takaya, and T. Nozaki, “Signal preprocessing of deep-sea laser-induced plasma spectra for identification of pelletized hydrothermal deposits using Artificial Neural Networks,” Spectrochim. Acta, Part B 145, 1–7 (2018). [CrossRef]

25. Y. M. Guo, Y. Liu, A. Oerlemans, S. Y. Lao, S. Wu, and M. S. Lew, “Deep learning for visual understanding: A review,” Neuro. Comp. 187, 27–48 (2016). [CrossRef]

26. S. Pirmoradi, M. Teshnehlab, N. Zarghami, and A. Sharifi, “The Self-Organizing Restricted Boltzmann Machine for Deep Representation with the Application on Classification Problems,” Expert Sys. Applic. 149, 113286 (2020). [CrossRef]

27. T. Kuremoto, S. Kimura, K. Kobayashi, and M. Obayashi, “Time series forecasting using a deep belief network with restricted Boltzmann machines,” Neuro. Comp. 137, 47–56 (2014). [CrossRef]

28. J. Vrábel, P. Pořízka, and J. Kaiser, “Restricted Boltzmann Machine method for dimensionality reduction of large spectroscopic data,” Spectrochim. Acta, Part B 167, 105849 (2020). [CrossRef]

29. Y. Zhao, M. L. Guindo, X. Xu, M. Sun, J. Y. Peng, F. Liu, and Y. He, “Deep Learning Associated with Laser-Induced Breakdown Spectroscopy (LIBS) for the Prediction of Lead in Soil,” Appl. Spectrosc. 73(5), 565–573 (2019). [CrossRef]

30. G. E. Hinton and R. R. Salakhutdinov, “Reducing the Dimensionality of Data with Neural Networks,” Science 313(5786), 504–507 (2006). [CrossRef]

31. G. E. Hinton, S. Osindero, and Y. W. Teh, “A fast learning algorithm for deep belief nets,” Neur. Comput. 18(7), 1527–1554 (2006). [CrossRef]

32. S. Kamada, T. Ichimura, A. Hara, and K. J. Mackin, “Adaptive structure learning method of deep belief network using neuron generation–annihilation and layer generation,” Neural Comput. & Applic. 31(11), 8035–8049 (2019). [CrossRef]

33. M. Scarpiniti, F. Colasante, S. D. Tanna, M. Ciancia, Y. C. Lee, and A. Uncini, “Deep Belief Network based audio classification for construction sites monitoring,” Expert Sys. Applic. 177, 114839 (2021). [CrossRef]

34. S. Shin, Y. Moon, J. Lee, H. Jang, E. Hwang, and S. Jeong, “Signal processing for real-time identification of similar metals by laser-induced breakdown spectroscopy,” Plasma Sci. Technol. 21(3), 034011 (2019). [CrossRef]

35. Y. P. Huang and M. F. Yen, “A new perspective of performance comparison among machine learning algorithms for financial distress prediction,” Appl. Soft Comp. 83, 105663 (2019). [CrossRef]

36. F. C. De Lucia Jr and J. L. Gottfried, “Influence of variable selection on partial least squares discriminant analysis models for explosive residue classification,” Spectrochim. Acta, Part B 66(2), 122–128 (2011). [CrossRef]

37. C. Zhang, T. T. Shen, F. Liu, and Y. He, “Identification of Coffee Varieties Using Laser-Induced Breakdown Spectroscopy and Chemometrics,” Sensors 18(1), 95 (2017). [CrossRef]

No.	Brands of sample	C	Mn	Si	Ni	Cr	V	Mo	Cu
1	15NiCuMoNb	0.15	0.87	0.34	1.08	0.23	0.003	0.30	0.59
2	P91	0.09	0.44	0.33	0.20	8.64	0.19	0.90	0.11
3	12Cr1MoVG	0.12	0.50	0.26	0.18	1.10	0.20	0.27	0.09
4	20Cr2Mo	0.20	0.25	0.29	0.20	2.57	0.02	0.35	0.07
5	42CrMoA	0.38	0.77	0.24	0.02	1.02	0.005	0.20	0.02
6	P22	0.11	0.47	0.37	0.15	2.13	0	0.94	0
7	25#	0.26	0.65	0.22	0	0	0	0	0
8	Q345	0.19	1.57	0.31	0	0	0.002	0	0
9	35CrNi1Mo	0.374	0.372	0.292	1.419	1.524	0	0.251	0.074
10	42CrMo	0.422	0.66	0.349	0.066	1.05	0.012	0.195	0.063
11	4Cr5MoSiV1	0.399	0.365	1	0.281	5.02	0.789	1.14	0.094
12	15NiCuMoNb5	0.15	1.05	0.377	1.17	0.117	0	0.384	0.605
13	Cr-Mo	0.107	0.333	0.327	0.123	8.22	0.236	0.9	0.115

Significant element	Main spectral lines (nm)
Mn	403.076, 403.307, 403.449, 404.136
Ni	341.476, 345.846, 346.165, 349.296
Cr	357.868, 425.433, 428.973
V	437.923, 438.998
Mo	379.825, 386.41
Cu	324.754, 327.396

Method	Full-spectra signals (8400 variables)			Characteristic spectra lines (84 variables)
Method	Train set accuracy	Validation set accuracy	Test set accuracy	Train set accuracy	Validation set accuracy	Test set accuracy
SOM	60.94%	57.69%	62.31%	84.38%	87.98%	87.69%
LDA	68.27%	71.75%	86.15%	88.46%	91.11%	92.31%
KNN	96.25%	95.20%	96.16%	98.22%	97.40%	99.23%
BP	77.76%	78.85%	70.77%	99.88%	98.56%	98.46%
DBN	99.92%	99.90%	100%	99.88%	99.52%	98.46%

No.	Brands of sample	C	Mn	Si	Ni	Cr	V	Mo	Cu
1	15NiCuMoNb	0.15	0.87	0.34	1.08	0.23	0.003	0.30	0.59
2	P91	0.09	0.44	0.33	0.20	8.64	0.19	0.90	0.11
3	12Cr1MoVG	0.12	0.50	0.26	0.18	1.10	0.20	0.27	0.09
4	20Cr2Mo	0.20	0.25	0.29	0.20	2.57	0.02	0.35	0.07
5	42CrMoA	0.38	0.77	0.24	0.02	1.02	0.005	0.20	0.02
6	P22	0.11	0.47	0.37	0.15	2.13	0	0.94	0
7	25#	0.26	0.65	0.22	0	0	0	0	0
8	Q345	0.19	1.57	0.31	0	0	0.002	0	0
9	35CrNi1Mo	0.374	0.372	0.292	1.419	1.524	0	0.251	0.074
10	42CrMo	0.422	0.66	0.349	0.066	1.05	0.012	0.195	0.063
11	4Cr5MoSiV1	0.399	0.365	1	0.281	5.02	0.789	1.14	0.094
12	15NiCuMoNb5	0.15	1.05	0.377	1.17	0.117	0	0.384	0.605
13	Cr-Mo	0.107	0.333	0.327	0.123	8.22	0.236	0.9	0.115

Significant element	Main spectral lines (nm)
Mn	403.076, 403.307, 403.449, 404.136
Ni	341.476, 345.846, 346.165, 349.296
Cr	357.868, 425.433, 428.973
V	437.923, 438.998
Mo	379.825, 386.41
Cu	324.754, 327.396

Classification of steel using laser-induced breakdown spectroscopy combined with deep belief network

Abstract

1. Introduction

2. Experimental setup and methodology

2.1 Experimental setup

2.2 Sample preparation

2.3 Deep belief network (DBN)

3. Results and discussion

3.1 Data pre-treatment

3.2 Data pre-treatment

3.3 Influence of various DBN constructions

3.4 Comparison with input of characteristic spectral lines

3.5 Comparison with other ML methods

4. Conclusions

Funding

Disclosures

Data availability

References

Data availability

Cited By

Figures (8)

Tables (3)

Equations (2)

Optics Express