Abstract
Phosphoric acid-hydrogen peroxide (PHP) pretreatment is an effective method to obtain a cellulose-enriched fraction from biomass. In this study, artificial neural network (ANN) was used to predict PHP pretreatment efficiency of cellulose content (C-C), cellulose recovery (C-Ry), hemicellulose removal (H-Rl), and lignin removal (L-Rl) under various conditions of pretreatment time (t), temperature (T), H3PO4 concentration (Cp), and H2O2 concentration (Ch). The final optimized topology structure of the ANN models had 1 hidden layers with 9 neurons for C-C and 10 neurons for C-Ry, 10 neurons for H-Rl, and 12 neurons for L-Rl. The actual testing data fit the predicted data with R2 values ranging from 0.8070 to 0.9989. The relative importance (RI) revealed that Cp and Ch were significant factors influencing the efficiency of PHP pretreatment with total RI values ranging from 12% to 62.6%. However, their weights for the three components of biomass were different. The value of T dominated hemicellulose removal effectiveness with an RI value of 78.6%, while t did not seem to be a main factor dominating PHP pretreatment efficiency. The results of this study provide insights into the convenient development and optimization of biomass pretreatment from ANN modeling perspectives.
Download PDF
Full Article
Artificial Neural Network Modeling to Predict the Efficiency of Phosphoric Acid-Hydrogen Peroxide Pretreatment of Wheat Straw
Qing Wang,a,b,c Jinxiang Hua,b Jinguang Hu,c Li Zhao,b Mei Huang,b Dong Tian,b Yongmei Zeng,b Shihuai Deng,b Fei Shen,b,* and Xinquan Zhang a,*
Phosphoric acid-hydrogen peroxide (PHP) pretreatment is an effective method to obtain a cellulose-enriched fraction from biomass. In this study, artificial neural network (ANN) was used to predict PHP pretreatment efficiency of cellulose content (C-C), cellulose recovery (C-Ry), hemicellulose removal (H-Rl), and lignin removal (L-Rl) under various conditions of pretreatment time (t), temperature (T), H3PO4 concentration (Cp), and H2O2 concentration (Ch). The final optimized topology structure of the ANN models had 1 hidden layers with 9 neurons for C-C and 10 neurons for C-Ry, 10 neurons for H-Rl, and 12 neurons for L-Rl. The actual testing data fit the predicted data with R2 values ranging from 0.8070 to 0.9989. The relative importance (RI) revealed that Cp and Ch were significant factors influencing the efficiency of PHP pretreatment with total RI values ranging from 12% to 62.6%. However, their weights for the three components of biomass were different. The value of T dominated hemicellulose removal effectiveness with an RI value of 78.6%, while t did not seem to be a main factor dominating PHP pretreatment efficiency. The results of this study provide insights into the convenient development and optimization of biomass pretreatment from ANN modeling perspectives.
DOI: 10.15376/biores.19.1.288-305
Keywords: Lignocellulosic biomass; ANN model; Pretreatment efficiency; Prediction; Relative importance
Contact information: a: College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan 611130, P. R. China; b: Institute of Ecological and Environmental Sciences, Sichuan Agricultural University, Chengdu, Sichuan 611130, P. R. China; c: Department of Chemical and Petroleum Engineering, Schulich School of Engineering, the University of Calgary, Calgary T2N 4H9, Canada; *Corresponding authors: fishen@sicau.edu.cn; zhangxq@sicau.edu.cn
INTRODUCTION
The rapid expansion of human society relies on the consumption of fossil resources, such as petroleum, coal, and natural gas, which cause severe problems, including environmental contamination and climate change (Rashid et al. 2021). This situation has led to an increased interest in alternative and renewable energy sources (Hosseini et al. 2019). Lignocellulosic biomass is the world’s most abundant renewable organic carbon-based resource, and it is considered the most promising alternative to fossil resources (Luterbacher et al. 2014). Lignocellulosic biomass consists mainly of cellulose, hemicellulose, and lignin. These three components are closely coupled with physical forces and chemical bonds to form a 3D cross-linked structure, making it challenging to separate the three components effectively (Tocco et al. 2021).
The authors’ previous study proposed an efficient biomass pretreatment method named phosphoric acid–hydrogen peroxide (PHP) pretreatment (Wang et al. 2014). The systematic research established that PHP pretreatment could treat various softwoods, hardwoods, straws, and herbs with mild conditions and low grinding requirements to achieve a sufficient lignocellulose deconstruction. Under typical pretreatment conditions, the final recovered cellulose-enriched fraction (CEF) achieved a cellulose recovery of 92%, while 83.7% of the lignin and 100% of the hemicellulose were removed. After pretreatment, 29.1 to 32.6 g glucose can be harvested from 100 g wheat straw via enzyme hydrolysis (Wang et al. 2016). Through the simultaneous saccharification and fermentation (SSF) process at 15.3% solid loading for 120 h, it could harvest 15.5 g ethanol from 100 g wheat straw (Qiu et al. 2018). The PHP pretreatment can also separate the hemicellulose and lignin from biomass to prepare high-value products, such as oligosaccharides and supercapacitors (Wan et al. 2019; Liu et al. 2021). The oxidative tail gas from PHP pretreatment could be employed to achieve 68% to 98.3% methyl blue degradation (Lei et al. 2022). A recycle experiment confirmed that 86.0% phosphoric acid could be recovered after ≥ 11 runs of pretreatment (Yao et al. 2019).
Previous studies clarified the transformation mechanism of principal components and the formation of multiple oxidation systems within PHP pretreatment, which is of great significance for the in-depth development of the PHP method (Wang et al. 2018; Tian et al. 2021). However, component separation is vital in most application scenarios, which demonstrates the urgency to carry out optimization experiments to increase the component separation efficiency, especially the separation of cellulose. Response surface methodology (RSM) is a commonly used approach for response surface mapping to the region of interest, response optimization, and operation conditions selection (Pereira et al. 2021). RSM with Box-Behnken design was employed to select PHP pretreatment conditions, and significant improvement was achieved. For example, cellulose yield maximally achieved 946.2 mg/g after 72 h enzymatic hydrolysis, under the optimized conditions of 40 °C, 2.0 h, and 70.2% H3PO4 (Qiu et al. 2017). However, both lignocellulose deconstruction and lignin/hemicellulose degradation processes are complex and non-linear. They pose a challenge for prediction with RSM, especially with limited experimental groups. RSM can only achieve good prediction accuracy within a limited range of pretreatment conditions. When the range of pretreatment conditions is large, the prediction results are often unsatisfactory.
Nowadays, a powerful predictive tool named Artificial Neural Network (ANN) is employed in various research areas because of its modeling ability, even if limited experimental data are provided (Rashid et al. 2021). The ANN technology is enlightened by the operational mode of the human brain and nervous system, which contains numerous neurons (also known as processing elements or perceptron) in multiple layers. A neuron may connect to all or a subset of the neurons in the subsequent layer, with these connections simulating the brain’s synaptic connections (Walczak and Cerpa 2001). For this reason, ANN can learn from complex, linear, and non-linear systems without any prior fitting function specified (Rashid et al. 2021). A recent study collected a total of 482 samples, and it evaluated the primary pyrolysis products of lignocellulosic biomass via ANN modeling; this approach successfully achieved the best possible results over different reactor systems, conditions, and biomass for the solid, liquid, and gaseous pyrolysis product yields (Tsekos et al. 2021). A prediction of phenolic compounds/glucose content from dilute inorganic acid pretreatment of biomass was proposed, which suggests that the ANN model could predict the pretreatment efficiency with limited conditions and groups of pretreatments (Luo et al. 2021). Moreover, the applications of the ANN model in biomass components estimation (Kartal and Özveren 2021), kinetic parameters prediction of biomass oxidation (Sunphorka et al. 2017), and pretreatment for lignocellulose degradation (Bhange et al. 2017) were studied. And these applications fully demonstrate the potential of the ANN model in biomass valorization. This study aimed to develop an ANN model to estimate the efficiency of PHP pretreatment. Four significant factors, time, temperature, H3PO4 concentration, and H2O2 concentration, were used as input variables. The output variables included cellulose content and recovery, hemicellulose removal, and lignin removal. The relative importance of these pretreatment conditions was evaluated by analyzing the neural net weights in the developed ANN model.
EXPERIMENTAL
Materials and Methods
Wheat straw (WS) was collected from a farm at Sichuan Agricultural University (Chengdu, China). The WS was air-dried and milled to pass through a 40-mesh sieve (≤ 0.45 mm) before undergoing PHP pretreatment. All reagents used were of analytical grade and were obtained from Sigma-Aldrich (Shanghai, China) unless stated otherwise.
Methods
PHP pretreatment
To carry out the PHP pretreatment, a PHP solution was prepared by diluting 85% H3PO4 with 30% H2O2. Wheat straw was then added to the solution at a solid/liquid ratio of 1:10 (w/w) in a 250 mL screw-cap bottle and mixed thoroughly. The mixture was shaken at the designed pretreatment temperature and reaction time under a rotation speed of 160 r/min. After reaching the set time, 1.0 L of deionized water was added to stop the pretreatment. The treated WS was then filtered and washed to a neutral pH to obtain the cellulose-enriched fraction (CEF), which was frozen at −20°C until further use. It should be noted that the data for modeling (training and validation of the ANN model) were derived from previous experimental results, while the data for model testing were newly collected in this study under different pretreatment conditions from the previous ones. Furthermore, this study will keep collecting more data on PHP pretreatment and enhancing the ANN model proposed herein to obtain better prediction outcomes. The PHP pretreatment conditions is listed in Table 1.
Table 1. PHP Pretreatment Conditions
Note: The raw data obtained for ANN training, validation, and testing are listed in Tables S1 and S2, respectively.
Analytical methods
The main components of WS material and CEF, including cellulose, hemicellulose, and lignin were analyzed according to the NREL method (National Renewable Energy Laboratory of the US) (Sluiter et al. 2010). The hydrolysate sugars were separated using a Shodex SH1011 column at 60 °C with a 0.05 mol/L H2SO4 mobile phase at a flow rate of 0.8 mL/min. The separated sugars were then quantified using an Agilent 1260 Infinity Ⅱ HPLC system with a G7162A differential refractive index detector (Agilent Technologies, Santa Clara, CA, USA). The solid recovery (SR) of WS after pretreatment was calculated using the Eq. 1,
SR (%) = m1/m0 × 100% (1)
where m0 represents the dry weight of WS (in this case, m0 = 5.00 g), and m1 represents the dry weight of CEF (g).
The cellulose recovery (Ry) and removal of hemicellulose or lignin (Rl) after PHP pretreatment were calculated using the following equations:
Ry (%) = SR × (C1 / C0) × 100% | (2) | |
Rl (%) = 100% − Ry | (3) |
where C0 and C1 represent the contents of the related components in WS and CEF, respectively.
ANN modeling
Figure 1a shows the flow diagram of PHP pretreatment. The primary parameters that affect the efficiency of PHP pretreatment are time (t), temperature (T), concentration of phosphoric acid (Cp), and concentration of hydrogen peroxide (Ch). After pretreatment, the main components of the obtained CEF, including solid recovery, the content of cellulose/hemicellulose/lignin as well as their corresponding recovery and removal percentages, were analyzed simultaneously (see Table S1 for the results).
Fig. 1. The flowchart of PHP pretreatment and the development of the ANN model: (a) Experimental work illustrates the procedure and data acquisition for PHP pretreatment; (b) The topology of a three-layer ANN model for predicting cellulose content and recovery of CEF. b1, j represents the bias of inputs; b2, k represents the bias of outputs; IWj, i represents weights from inputs to hidden layers; LWk, j represents weights from neurons in hidden layers to output layers.
The ANN model was developed using the Neural Network Fitting app (version 1.33) based on the neural net library in R software and implemented in OriginPro 2022 (OriginLab Corp., Northampton, MA, USA).
As shown in Fig. 1b, the ANN model proposed in this study is a multilayer perceptron (MLP) network. The MLP is a class of feedforward artificial neural network, that consists of an input layer with four input variables, a hidden layer, and an output layer with one output variable. The neurons in each layer are fully connected to the neurons in the following layer.
The number of neurons (n) in the hidden layer was determined using an empirical equation (Yang et al. 2020) as follows,
(4)
where n represents the number of neurons in the hidden layer, i represents the number of input variables, k represents the number of output variables, and α is a constant ranging from 1 to 10. The accuracy of the ANN modeling and predictions was assessed using the Root Means Square Error (RMSE), which was calculated using the following equation,
(5)
where represents the predicted value obtained from the ANN model and represents the experimental value; m is the number of samples used for ANN modeling.
The relative importance of the four input variables on each output variable was evaluated using the Garson’s equation,
(6)
where Ii represents the relative importance of the ith input variable on the output variable; IWj,i represents the net weight from ith input variable to jth neuron in the hidden layer; and LWk, j represents the net weight from the jth neuron in the hidden layer to the kth output variable.
Table 2. Selection of Modeling Parameters
Table 2 lists the key parameters of an ANN model. The activation functions, number of iterations, and number of neurons in hidden layers still need to be determined within a given range. As Fig. 2 illustrates, the entire dataset derived from previous experimental results (see Table S1) was split into a 7:3 ratio for ANN training and validation. The ANN model was constructed by selecting an activation function, choosing the number of iterations, and selecting the number of neurons in the hidden layer. The RMSE served as the main indicator for comparing NP and TP. The trained ANN model was then tested with a new dataset (see Table S2) that had different pretreatment conditions from the previous ones to assess its accuracy in real applications. The RI values were also calculated using the final proposed ANN model to better understand each parameter’s role in PHP pretreatment.
Fig. 2. Flowchart of the ANN modeling; NP, network performance; TP, target network performance; C, component content; Ry, component recovery rate; Rl, component removal rate
Statistical analysis
The raw data was digitized and preprocessed in Excel (Microsoft Corp., Redmond, WA, USA), where formula calculations, such as component recovery/removal and RMSE values, were performed. One-way analysis of variance (ANOVA) was carried out using OriginPro (OriginLab Corp., Northampton, MA, USA) software. Data are displayed as means ± standard deviation (SD) and differences among means were compared using Fisher’s Least Significant Difference (LSD) method at the significance level of “ns” P > 0.05, * P < 0.05, ** P < 0.01, and *** P < 0.001.
RESULTS AND DISCUSSION
Pretreatment of WS under Various Conditions
PHP pretreatment was performed for WS based on a single-factor experiment design within a predefined parameter range. The contents of cellulose, hemicellulose, and lignin were analyzed in the recovered CEF according to well-established procedures (Wang et al. 2016). The removal of lignin and hemicellulose is a critical evaluation index of pretreatment efficiency because they form a protective physical barrier to general valorization applications (Ohgren et al. 2007). Figure 3a shows how L-Rl varied with the changes of T and Cp (Ch). The authors observed that high T and Ch led to strong delignification.
Fig. 3. Effect of PHP pretreatment conditions on CEF components: (a) L-Rl, lignin removal; (b) H-Rl, hemicellulose removal; (c) C-C, cellulose content; (d) C-Ry, cellulose recovery. The parameters of each PHP condition are normalized in [0, 1] to draw a ternary contour plot. The color mapping refers to an actual percentage of component content, recovery, or removal. Each evaluation index is exhibited by two ternary contour plots, with the abscissa being either H3PO4 concentration or H2O2 concentration.
However, when Ch was close to 100%, lignin removal decreased significantly. Moreover, L-Rl did not change much as t increased. These results indicate that T, Cp, and Ch may dominate lignin removal in PHP pretreatment. Figure 3b shows the distribution of H-Rl under various PHP conditions. Hemicellulose removal sharply increased to 100% when t and T increased, suggesting its sensitivity to t and T in PHP pretreatment.
The purpose of pretreatment is to separate cellulose for more straightforward utilization. Thus, cellulose purity and yield are the primary evaluation indicators that should be considered carefully (Tang et al. 2019). In this study, the authors determined two related indicators: C-C and C-Ry, as shown in Figs. 3c and 3d, respectively. As depicted in Fig. 3c, C-C was relatively low under low pretreatment intensity (short t or low T) or excessive Cp/Ch. From the comprehensive analysis of the L-Rl and H-Rl, the relative content of cellulose was low because of the presence of lignin and hemicellulose under low pretreatment intensity. The C-C increases when pretreated with severe conditions (high T or long t), and C-Ry decreased significantly because of oxidative degradation or acid hydrolysis (Fig. 3d). Therefore, estimating PHP pretreatment efficiency is a complicated process that involves balancing barrier component removal and cellulose degradation. This process makes all essential PHP conditions vital to C-C and C-Ry.
Training/validation for the ANN model
Pretreatment methods that target specific components can help reduce energy and chemical consumption and achieve multi-stage utilization of biomass (Wagle et al. 2022). However, finding the optimal pretreatment conditions for different types of biomasses is a complex task that requires accurate and reliable evaluation models. In this study, the authors proposed a novel model based on ANN technology to predict pretreatment efficiency for WS. The authors used four pretreatment conditions (t, T, Cp, and Ch) as input variables and four obtained results (C-C, C-Ry, H-Rl, and L-Rl) as output variables for our ANN model.
The ANN model consists of three parts: interconnections, activation functions, and learning rules (Sadiq et al. 2019). The authors used a multilayer feed-forward network with resilient backpropagation with backtracking as the learning rule (see Fig. 1 and Table 2). The main challenge is to select a suitable activation function. Activation functions enable deep neural networks to learn complex mappings between input and output. Without them, a neural network can only learn a linear relation (Goyal et al. 2020).
The authors compared different activation functions for PHP pretreatment: rectified linear units (ReLU) and sigmoid functions such as logistic and tangent hyperbolic (tanh). Figure 4a shows the RMSE value for ANN training with each activation function. Tanh performed significantly better than logistic for C-C, C-Ry, H-Rl, and L-Rl output variables (P < 0.01 or P < 0.001). Tanh also outperformed ReLU for C-C (P < 0.05) and L-Rl (P < 0.001) output variables, but not for C-Rl and H-Rl output variables (P > 0.05). Therefore, the authors chose tanh as the ideal activation function for ANN modeling of PHP pretreatment.
Figure 4b shows the RMSE values under the different number of iterations. As the number of iterations increases (≥ 550 iterations), the RMSE decreases to a relatively stable state. The coefficient of variation (CV) is employed to estimate the differences between various output variables (refer to the subplot). The number of iterations needed to find an optimal solution for a given accuracy largely determines the overall computational efforts and the performance of an algorithm.
Fig. 4. RMSE values of C-C, C-Ry, H-Rl, and L-Rl for the ANN training and validation. The RMSE values of three activation functions (a), different numbers of iterations (b), and various numbers of neurons in the hidden layer (c)
A better ANN model should use less computation and fewer iterations (She 2014). Therefore, in this study, 900 iterations were found to be suitable for the ANN model considering its accuracy and computational amount.
Studies have shown that too many neurons in the hidden layer increase the training time and make it difficult to achieve the desired effect, even if the training data contains enough information (Panchal et al. 2011). Therefore, choosing a suitable number of neurons in the hidden layer that minimizes the error rate and maximizes the generalization ability is crucial. Figure 4c exhibits the RMSE of training/validation for four ANN models with different numbers of neurons under 900 iterations. The range of neuron numbers (1 to 13) was confirmed by an empirical equation (Eq. 4). As the iteration times increased, the RMSE of ANN training gradually decreased until reaching a stable state (the demarcation is 8th, 8th, 6th, and 11th for C-C, C-Ry, H-Rl, and L-Rl, respectively). However, its RMSE for ANN validation changes with different neuron numbers. According the previous study, the general idea is that fewer neurons will underfit, whereas too many neurons will overfit an ANN model; this is true when the other factors such as the complexity of the problem, the number of layers, the activation functions, and the number of neurons in each layer are fixed (Adil et al. 2022). To avoid these circumstances, the authors employed a facility principle of “the minimum RMSE value with minimum neurons” to determine the number of neurons for each output variable. The final option is 9 neurons for C-C, 10 neurons for C-Ry and H-Rl, and 12 neurons for L-Rl. Figure 5 compares experimental results with predicted results obtained from the optimized parameters: all data are well fitted with an R2 value ranging from 0.9648 to 0.9957, which indicates excellent prediction accuracy.
Fig. 5. Comparison of the experimental results and predicted results derived from the ANN model for (a) C-C, (b) C-Ry, (c) H-Rl, and (d) L-Rl in the training periods
Prediction and Relative Importance of Input Variables
The optimal ANN structure was determined through numerical experiments using training and validation datasets (see Table S1 for complete datasets). A separate group of the experiment (see Table 1 for conditions) was conducted to verify the effectiveness of the trained ANN model. For the results of testing, see Table S2. Figure 6a displays the fitting relationships between predicted and experimental values for C-C, C-Ry, H-Rl, and L-Rl. The slope (S) and coefficient of determination (R2) of the fit curve were used to evaluate the accuracy of the trained ANN model (Luo et al. 2021). Values near 1.00 for both S and R2 indicate high accuracy in predicting a specific variable. As shown in Fig. 6a, C-C, H-Rl, and L-Rl achieved excellent modeling precision with S values ranging from 0.96 to 1.07 and R2 values from 0.9917 to 0.9989. C-C had an S value of 0.63 and an R2 value of 0.8070, which is still valid for practical application. Based on these results, it can be concluded that the trained ANN model is an effective tool for predicting PHP pretreatment efficiency (see Table S3 for optimized key parameters).
Table 3. Weights and Biases of the Hidden and Output Layers