Skip to main content

Main menu

  • Home
  • Content
    • Current Issue
    • Early Online
    • Archive
    • Subject Collections
  • Info For
    • Authors
    • Reviewers
    • Subscribers
    • Advertisers
  • About
    • About JSWC
    • Editorial Board
    • Permissions
    • Alerts
    • RSS Feeds
    • Contact Us

User menu

  • Register
  • Subscribe
  • My alerts
  • Log in
  • My Cart

Search

  • Advanced search
Journal of Soil and Water Conservation

  • Register
  • Subscribe
  • My alerts
  • Log in
  • My Cart
Journal of Soil and Water Conservation

Advanced Search

  • Home
  • Content
    • Current Issue
    • Early Online
    • Archive
    • Subject Collections
  • Info For
    • Authors
    • Reviewers
    • Subscribers
    • Advertisers
  • About
    • About JSWC
    • Editorial Board
    • Permissions
    • Alerts
    • RSS Feeds
    • Contact Us
  • Follow SWCS on Twitter
  • Visit SWCS on Facebook
Research ArticleResearch Section

Gully erosion susceptibility prediction in Mollisols using machine learning models

Y. Wang, Y. Zhang and H. Chen
Journal of Soil and Water Conservation September 2023, 78 (5) 385-396; DOI: https://doi.org/10.2489/jswc.2023.00019
Y. Wang
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Y. Zhang
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
H. Chen
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • References
  • Info & Metrics
  • PDF
Loading

Abstract

In recent years, gully erosion has caused soil loss, land degradation, and a large sediment yield in the Mollisols in northeastern China, threatening agricultural development and national food security. Moreover, the prediction of gully erosion remains a great challenge owing to the difficulty of determining suitable environmental indicators and identifying the best models for predicting gully erosion prone areas. Therefore, the objective of this study was to quantify the contributions of the main factors controlling gully erosion and to identify the best model for predicting areas susceptible to gully erosion in Hailun City, northeastern China. Initially, the spatial distribution of the gully erosion was investigated through visual interpretation of GaoFen-1 satellite images. The analyzed gullies were evenly distributed in the study region, and we selected 70% of the gullies as the training data set and the remaining 30% as the validation data set. Subsequently, 12 variables, including the elevation, slope, aspect, plan curvature, profile curvature, topographic wetness index (TWI), soil type, land use, normalized difference vegetation index (NDVI), precipitation, distance from rivers, and distance from existing gullies, were selected as the indicators of gully erosion. Then, multicollinearity analysis was conducted to determine the main indicators without linearity. Finally, the contributions of the indicators and the areas susceptible to gully erosion were determined using machine learning models, including support vector machine (SVM), multilayer perceptron neural network (MLPNN), random forest (RF), and extreme gradient boosting (XGBoost) models. The results revealed that there was no multicollinearity among the 12 indicators, so they were all employed in the machine learning models for the gully erosion susceptibility prediction. The XGBoost model had the highest R2 and lowest root mean square error (RMSE) values in the model validation stage (0.81 and 0.60, respectively), followed by the RF (0.78 and 0.61, respectively), MLPNN (0.65 and 0.70, respectively), and SVM (0.62 and 0.70, respectively). The gully distance had the largest relative importance score (>35%) for gully erosion, followed by the profile curvature, plan curvature, land use, elevation, and soil type, which had relative importance scores of 10% to 15%. The gully erosion susceptibility map revealed that the central part of the study area was more susceptible to gully erosion than the other regions. These results can help managers to identify the regions that are prone to gully erosion and to design soil conservation practices to slow down the soil erosion process.

Key words:
  • controlling factors
  • gully erosion susceptibility
  • machine learning
  • Mollisols
  • relative importance

Introduction

Soil erosion, an important cause of soil loss, land degradation, and sediment yield, threatens agricultural development and the ecological environment (Zhao et al. 2016; Poesen 2018). The Mollisol region in northeastern China is an important agricultural production base and is subject to severe soil erosion. The average erosion rate in this region was approximately 15 Mg ha−1 y−1 or more during the last century (Nearing et al. 2017). Gully erosion is a common type of soil erosion in the Mollisol region in northeastern China. Gully erosion is defined as linear soil erosion over a short time period, even after an intense rainfall event. At present, there are nearly 300,000 gullies in the Mollisols region in northeastern China, and 61% of them are distributed in cultivated land. Moreover, the area of cultivated land has decreased by 0.5% to 3% (Ministry of Water Resources 2010). Therefore, to prevent gully erosion, it is necessary to determine the main factors controlling gully erosion and to identify the spatial pattern of the areas prone to gully erosion.

Based on previous studies, gully erosion is influenced by the climate, topography, hydrology, geology, and environmental conditions (Pourghasemi et al. 2017a; Gayen et al. 2019; Mokarram and Zarei 2021). In recent years, a variety of studies have been conducted to investigate the controlling factors in different areas. For example, Mokarram and Zarei (2021) determined the areas prone to gully erosion in the southern region of Fars Province and found that the normalized different vegetation index (NDVI), slope, topographic wetness index (TWI), altitude, terrain ruggedness index (TRI), lithology, and land use were the most important controlling factors. Chen (2021) reported that the altitude, land use, distance from road, and soil characteristics were the key factors controlling gully erosion in the Meimand watershed. Amiri et al. (2019) revealed that three factors, including the land use, distance from river, and clay content, had the most important impacts on gully erosion, and they quantified the contribution of each factor. However, many factors influencing the distribution of gully erosion prone areas have been identified, and it is too costly and time-consuming to employ them all in the determination of gully erosion prone areas. In addition, because the Mollisol region in northeastern China is a rolling and hilly area, with concentrated rainfall in summer, alternating freezing and thawing in winter and spring, ridged tillage, and long-term high-intensity utilization, the distribution of the erosion gullies is complicated, and the controlling factors have large uncertainties (Zhang et al. 2018; Wen et al. 2021). At present, the main factors influencing the gully erosion in the Mollisol region in northeastern China and their contributions remain unclear.

Owing to the limited data available, the prediction of gully erosion remains a great challenge worldwide, especially at the regional scale and larger scales (e.g., basin scale, province scale, and country scale). Gully susceptibility prediction models can be categorized into four types, including primal inventory-based models, heuristic models, data-driven statistical models, and physically-based deterministic models (Soleimanpour et al. 2021). Among these categories, data-driven statistical models are considered to be more reliable than the other types of models that rely on subjective and heuristic methods (Chen et al. 2021). As data-driven statistical models, machine learning methods have been applied in prediction systems, especially for the determination of highly complex relationships (Kubat 2017). Machine learning models such as the random forest (RF) (Roy et al. 2020), boosted regression tree (BRT) (Buston and Elith 2011), extreme gradient boosting (XGBoost) (Yang et al. 2021), support vector machine (SVM) (Roy et al. 2020), and artificial neural network (ANN) (Pourghasemi et al. 2017b) models have been applied for gully erosion susceptibility (GES) prediction. Previous research has focused on the development of ANN models to predict areas susceptible to gully erosion; however, the multilayer perceptron neural network (MLPNN) also has a great potential for computing nonlinear relationships between inputs and outputs, and it has been seldom used in gully prediction studies (Pourghasemi et al. 2017b; Oludolapo et al. 2012; de Oliveira et al. 2021). In previous studies, various machine learning models, such as supervised learning, neural network, and tree-based models, have been reported to be helpful in GES mapping. In addition, the contributions of the key controlling factors could also be obtained using machine learning models in this region. However, there is still debate as to the most effective and accurate model for gully erosion prediction because of the models’ complex structures.

Therefore, the objectives of this study were (1) to select suitable indicators of gully erosion and assess the contributions of the key factors controlling gully erosion; (2) to compare the performances of different machine learning models (SVM, MLPNN, RF, and XGBoost) and identify the best model for the prediction of GES; and (3) to obtain a GES map of the Mollisol region in northeastern China. Although the four models used in this study have been employed individually in previous studies, their application and comparison in a new, unstudied region contributes to our knowledge of the reliability of machine learning methods in predicting areas prone to gully erosion in different regions.

Materials and Methods

Study Area. Hailun (46°58′ to 47°52′ N, 126°14′ to 127°45′ E) is located in the central Mollisol region in Heilongjiang Province, China, with elevations ranging from 144 to 484 m (figure 1). It is an important national commodity grain base in China, and the large difference in elevation and long-term extensive usage of the land have led to serious soil erosion in Hailun City. This area is plain land, with an area of 4,667 km2. Hailun is located in the north temperate continental monsoon climate zone, with a maximum temperature of 37°C and a minimum temperature of −39.5°C during the year. The main soil types include black soil, meadow soil, swamp soil, and small amounts of paddy soil, dark-brown soil, and albic soil (Tang et al. 2021).

Figure 1
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1

Location of Hailun and the spatial distribution of the gullies.

Mapping of Gully Erosion Situation. Since gully erosion prone areas are closely linked to the distribution pattern of existing gullies, gully erosion investigation is essential for GES prediction. As figure 2 shows, the gullies photographed using an unmanned aerial vehicle were captured in the central part of Hailun, with a viewing azimuth of ~45°. There were 4,662 existing gullies distributed in Hailun based on visual interpretation of GaoFen-1 (GF-1) images with a 2 m resolution. GaoFen is the first civilian optical remote sensing satellite that was independently developed by China. This GF series satellite is a high-definition Earth observation system (HDEOS) developed by the China Academy of Space Technology (CAST) (Sun et al. 2018). Of the 4,662 gullies identified, 70% (3,263 gullies, evenly distributed throughout the entire study area) were selected and used to calibrate the prediction model; the other 30% (1,399 gullies) were used for the model validation (figure 1).

Figure 2
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 2

Unmanned aerial vehicle photographs of several gullies in the study area.

Selection of Indicators. The indicators were selected based on the following criteria: data availability, previous experiences and reports in the literature, data connection and heterogeneity, and local geo-environmental characteristics (Arabameri et al. 2018; Teimouri and Kornejady 2019; Zabihi et al. 2019; Gayen et al. 2019; Chen et al. 2021). The data connection was taken into account because the factors should be connected to each other and work together to influence the gully erosion process. In addition, the data heterogeneity was considered to ensure that the factors with a similar pattern across the study region were excluded while more factors with different properties were involved in the modeling process (Soleimanpour et al. 2021). It is worth noting that some of the environmental indicators (e.g., soil erodibility and runoff) with high-resolution data were not available for the study area, so such indicators were not used in this study. In addition, Hailun is mainly a water erosion area, so wind erosion related factors such as the wind direction and velocity were not considered in this study. Finally, 12 indicators were selected: elevation, slope, aspect, plan curvature, profile curvature, TWI, soil type, land use, NDVI, precipitation, distance from rivers, and distance from existing gullies (table 1). We found that half of the indicators were topography-related variables. This occurred because Hailun is a rolling and hilly area, so the topographic features have more impact on gully erosion in this region. Moreover, the topographic factors were associated with other environmental variables (e.g., precipitation, NDVI, soil type, and land use) that affect the gully erosion process together.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1

Details of the controlling factors used in gully erosion prediction.

Topographic attributes play an important role in the expansion and control of gully erosion by indirectly affecting the vegetation and precipitation characteristics (Nhu et al. 2020). In this study, the digital elevation model (DEM) data were obtained from the Advanced Land Observing Satellite (ALOS) phased array type L-band synthetic aperture radar (PALSAR) sensor, with a resolution of 12.5 m. The slope and aspect can indirectly affect the erosion process by controlling the sun exposure, vegetation type, and soil moisture (Zabihi et al. 2018). The plan and profile curvature affect the divergence or convergence of water during its descent (Avand et al. 2019). The TWI also has an impact on the morphometric and hydrological characteristics. Thus, slope, aspect, plan curvature, profile curvature, and TWI maps were derived from the DEM data using the ArcGIS 10.6 software. The TWI is expressed as equation 1 (Moore et al. 1991):

Embedded Image 1

where As is the cumulative catchment area (m2 m−1) of a point, and θ is the slope at this point. The soil properties affect the flow of the current subsurface and the occurrence of piping erosion, which can cause gully erosion (Rahmati et al. 2016). In this study, a soil type map of the study area was obtained based on the Second National Soil Survey of China. Land use is an important factor in the occurrence of gully erosion due to its impact on the vegetation conditions. In this study, a land use map was interpreted from GF-1 images acquired in 2021 (figure 1) using the object-based classification method in the eCognition 9.0 software. There were 14 land use types in the study area: traffic land, residential land, dry land, reservoirs, rice paddies, rivers, lakes, grassland, wetlands, meadow, coniferous forest, broad leaved forest, canals, and mixed broadleaf-conifer forest. Finally, an accuracy of 90.62% was obtained for the classification. The NDVI can reflect the vegetation status, and the vegetation status is of great importance in hydrological and soil studies. In this study, the NDVI data were derived from moderate resolution imaging spectroradiometer (MODIS) products (MOD13) with a spatial resolution of 250 m. Precipitation is also an important factor affecting runoff and gully erosion process. A precipitation map was prepared from the mean annual precipitation (MAP) data obtained from the China Meteorological Data Service Centre over a statistical period of 30 years (1992 to 2021). There are 17 meteorological stations, which are nearly evenly distributed, in Hailun City, and the precipitation data were interpolated using the kriging method. Because the areas near the rivers and existing gullies were more susceptible to erosion, these two factors were considered to be factors causing gully erosion. A distance from rivers and gullies map was also obtained using the Euclidean distance tool in the Spatial Analyst module of the ArcGIS software. This tool can calculate the Euclidean distance to the closest river or gully for each pixel.

Except for the topographic attributes and soil type, the other indicators all exhibited spatiotemporal variations. The topographic attributes and soil type provided a total geographic base for gully erosion. For example, the study area is a rolling and hilly area, which is determined by the topographic characteristics, and the soil factor determines the soil erodibility. The land use type, NDVI, precipitation, and distance from rivers and existing gullies exhibited spatiotemporal variations, and they had variable influences on the gully erosion process. All of the above factors jointly affected the gully erosion process.

Multicollinearity Test of the Indicators. Because there may be correlations between pairs of indicators, it is essential to test the multicollinearity before conducting gully erosion prediction (Saha 2017; Gayen et al. 2019). In this study, after the various indicators were selected, multicollinearity analysis was conducted to remove the variables with collinearity. Two parameters were selected to analyze the multicollinearity between the factors: the tolerance and the variance inflation factor (VIF). These parameters are defined as equations 2 and 3:

Embedded Image 2

Embedded Image 3

where R2J is the regression coefficient of the determination of factor J on all of the other indicators. Tolerance > 0.10 and VIF > 5 indicate a multicollinearity problem (Pourghasemi et al. 2017b).

Machine Learning Models for Gully Erosion Susceptibility Prediction. The structures of machine learning models are different and somewhat complex, and in general, these models have various accuracies and transparencies. For example, the SVM has the potential to solve the overfitting problem when using high-dimensional data (Chlingaryan et al. 2018). Neural networks have a good ability to model complex nonlinear relationships between dependent and independent variables (Cui et al. 2018). The RF and XGBoost models are both tree-based models, and they can obtain the relative importance of the predictors (Zhang et al. 2021). The goal of this study was to identify a more widely applicable type of model for GES prediction. Thus, we considered and compared the currently widely used types of machine learning models, including supervised learning, multilayer perceptrons, and tree-based methods, for predicting areas susceptible to gully erosion. The most well-known models of these types are the SVM, MLPNN, RF, and XGBoost algorithms.

Support Vector Machine. SVM regression is a supervised, nonparametric statistical learning method, and it uses structural risk minimization (SRM) to obtain an optimal overall response (Vapnik 1995). The SVM model can handle a high-dimensional multivariate space (Karatzoglou et al. 2006). It includes four types of classification functions (i.e., linear, polynomial, radial, and sigmoid), and each kernel has its own optimization parameters (Joachims 1997). There are two main features that need to be optimized: one is the selection of the kernel function, and the other one is the noise tolerance of each kernel. The information regarding the specific SVM parameters used for the GES prediction has not been explored as deeply.

Multilayer Perceptron Neural Network. An ANN can build the complex and functional relationships between different variables without setting any null hypothesis (Yang et al. 2017). The MLPNN is one of the most commonly used neural models for prediction, and it uses the back-propagation technique for learning. It is interconnected by connecting forces represented by the synaptic weights (de Oliveira et al. 2021). The MLPNN contains three layers: a first input layer, a middle hidden layer, and a last output layer. After the data set is input into the model, the training process is repeated until the error is reduced to an acceptable value.

Random Forest. RF regression is a tree modeling decision-making method, which combines the anticipation of several single algorithm using rules (Breiman 2001). Three user-defined parameters, including the number of trees (ntree), the number of variables used as predictors for each tree (mtry), and the minimum size in each terminal mode (node size), are contained in the RF model (Friedman and Meulman 2003). In this study, the ntree was set as 500 to 2,000, and the optimal ntree was selected to stabilize the error. In general, the mtry is set to one-third of the number of selected indicators (Zhang et al. 2019).

Extreme Gradient Boosting. The XGBoost model is also a decision tree-based model, and it is a modification of the gradient boosting model. Compared with the RF model, the XGBoost model grows each tree based on the residuals of the previous tree, and some studies have found that it outperformed RF in regression models (Stojic et al. 2019; Huang et al. 2020). This algorithm is generally applied in classification, credit rating, and prediction studies (Cherif and Kortebi 2019). Among the existing machine learning methods, this algorithm has gained popularity because of its generalizability, low risk of overfitting, and high interpretability (Liu et al. 2021). The XGBoost model outperforms other machine learning models in terms of its predictive capability, and it has also been used to accurately estimate the aboveground biomass of maize (Zea mays L.) (Zhang et al. 2021).

The contributions of the indicators of gully erosion can also be calculated quantitatively using the RF and XGBoost models. In this study, the importance of the factors in the gully erosion prediction was determined according to the relative importance, which was calculated through comprehensive evaluation (equation 4) (Niu et al. 2021).

Embedded Image 4

where wi is the relative importance of factor i; Ai and Bi are the %IncMSE and IncNodePurity values of factor i, respectively; and n is the total number of factors. %IncMSE is the increase in the percentage of the mean squared error, i.e., the value by which the accuracy of the prediction decreases after the variable is removed. IncNodePurity is the increase in the node purity. A larger node purity indicates a more important variable (Angermueller et al. 2016).

In general, machine learning models depend on databases composed of point samples, which represent the occurrence of gullies and the nonoccurrence of gullies. The sample points containing gullies were derived from the interpreted GF-1 imagery. The sample points without gullies were obtained using the random sampling tool in ArcGIS 10.6.

Model Validation and Accuracy Assessment. To compare the reliabilities of the predictions of the different models, the coefficient of determination (R2), root mean square error (RMSE), and mean absolute error (MAE) were selected to compare the predictions of the different models (Paul et al. 2019; Pourghasemi et al. 2020). The R2 value is usually used as an index of the goodness-of-fit and is computed to ensure the fit of the predicted outcomes (Harel 2009). The RMSE can be used as a benchmark for crosschecking the inaccuracies of models (Tien Bui et al. 2016). However, when the model contains large values and outliers, the RMSE becomes susceptible to imprecision (Chai and Draxler 2014). Thus, the MAE was also employed in this study. In this study, these three parameters were used to identify the best models for gully erosion prediction. They were calculated as equations 5 through 7:

Embedded Image 5

Embedded Image 6

Embedded Image 7

where zi and Embedded Image are the observed values and predicted values at site i, z is the average of the observed values, and n is the number of samples. Then, the best model was selected to conduct the GES prediction in the study area. In addition, the performances of the four models were assessed based on the receiver operating characteristic (ROC) curve, which is a typical measure for assessing the outcomes of the application of machine learning models. The area under the curve (AUC) represents the model’s performance. Values close to 1 indicate a perfect model, while values close to 0.5 indicate an inaccurate model (Chung and Fabbri 2003).

Results and Discussion

Extraction of Indicators. To prepare the GES map for Hailun, we employed 12 indicators. Based on the above methods, these indicators were extracted. The results are presented in figure 3. In figure 3, most of the subfigures exhibit different spatial patterns, providing environmental indicators for gully erosion prediction from multiple aspects.

Figure 3
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 3

Indicators extracted in this study: (a) elevation, (b) slope, (c) aspect, (d) plan curvature, (e) profile curvature, (f) topographic wetness index, (g) soil type, (h) land use, (i) normalized difference vegetation index, (j) precipitation, (k) distance from rivers, and (l) distance from gullies.

Multicollinearity Test. The multicollinearity assessment was conducted to exclude biased variables and select appropriate indicators for the modeling. The multicollinearity analysis results for the 12 indicators are presented in table 2. The multicollinearity of the selected variables was analyzed based on the VIF and tolerance limit. The lowest VIF represents the highest tolerance levels. The lowest and highest VIF were related to the distance from existing gullies (1.046) and slope (2.020). The 12 indicators all had VIFs of less than 5 (the VIF limit). Therefore, there was no multicollinearity among the 12 factors.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2

Multicollinearity test results.

Comparison of the Prediction Models. By comparing the four models (SVM, MLPNN, RF, and XGBoost) for GES prediction, it was found that the XGBoost model was the best model for predicting GES because it had the lowest RMSE (0.60) and MAE (0.50) values and the highest R2 value (0.81) (table 3). In addition, the RF model also had a good performance, with R2 = 0.78, RMSE = 0.61, and MAE = 0.50. The SVM model had the poorest performance, with R2 = 0.62, RMSE = 0.70, and MAE = 0.50. The performances of the models were also evaluated based on their ROC curves (figure 4). Figure 4 clearly shows that the SVM, MLPNN, RF, and XGBoost models had AUC values of 0.832, 0.867, 0.966, and 0.985, respectively. The XGBoost and RF models outperformed the other two models, achieving the best results in terms of their abilities to predict the occurrence and absence of gullies. Therefore, we selected the XGBoost and RF models as the final prediction models for the GES mapping.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 3

Validation results of the four models.

Figure 4
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 4

Receiver operator curve (ROC) and area under the curve (AUC) values for the four models ([a] support vector machine [SVM], [b] multilayer perceptron neural network [MLPNN], [c] random forest [RF], and [d] extreme gradient boosting [XGBoost]).

Relative Importance of Indicators. The relative importance can reflect the influence of each controlling factor on the GES prediction. In this study, both the RF and XGBoost models had good performances, so only these two models were used to calculate the contribution of each factor. Thus, the relative importance of the indicators of gully erosion were obtained based on the combination of the importance scores calculated using the RF and XGBoost models (figure 5).

Figure 5
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 5

Relative importance of the indicators for gully erosion prediction.

As figure 5 shows, the distance from existing gullies was the most important of the 12 indicators, and its relative importance was greater than 35%. The profile curvature, plan curvature, land use, elevation, and soil type were the second most important factors controlling gully erosion, with contributions of 10% to 15%. The other six indicators, including slope, river distance, precipitation, aspect, NDVI, and TWI, had the smallest impacts on gully erosion, with contributions of less than 5%.

Mapping of Areas Susceptible to Gully Erosion. The areas susceptible to gully erosion were predicted using the four different machine learning methods, but only the two prediction results obtained from the models with good performances are presented. Figure 6 presents the GES maps for Hailun City obtained using the XGBoost and RF models. The GES in the study area was classified into four levels: very high, high, medium, and low level. The spatial heterogeneities of the GES presented by the two GES maps obtained using the XGBoost and RF models are consistent. These GES maps indicate that the central part of Hailun was more susceptible to gully erosion than the northeastern and southwestern parts. In addition, the areas and percentages of the four levels for GES are shown in table 4. Most of the study area was predicted to have a medium susceptibility to gully erosion (53.82% to 61.64%), while the areas with low (8.12% to 11.59%) and very high (5.32% to 5.78%) susceptibility levels were small. The area with a high level of susceptibility was fairly large (24.92% to 28.81%).

Figure 6
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 6

Gully erosion susceptibility (GES) maps obtained using the (a) extreme gradient boosting (XG-Boost) and (b) random forest (RF) models.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 4

Area of each gully susceptibility levels in extreme gradient boosting (XGBoost) and random forest (RF) model.

Discussion. The intensive gully erosion in the Mollisol region in northeastern China, which is an important region of crop production in China, has caused serious soil loss and cropland degradation (Gao et al. 2015). Thus, it is essential to identify the areas susceptible to gully erosion and to slow down the erosion process in this region. In general, the selection of effective indicators of gully erosion is a crucial step in identifying the areas susceptible to gully erosion. However, there are no fixed rules for choosing suitable indicators for GES prediction (Gutiérrez et al. 2009; Chen et al. 2021). Based on the above-mentioned criteria and multicollinearity test results (table 2), all 12 indicators were retained and employed in the model for predicting areas susceptible to gully erosion.

In recent years, different researchers have tried to predict areas susceptible to gully erosion by incorporating different types of algorithms, generally including traditional statistical and novel machine learning methods (Garosi et al. 2018; Soleimanpour et al. 2021; Chen et al. 2021; Mokarram and Zarei 2021). However, the use of machine learning methods has been reported to be a reliable approach for GES prediction (Bunker and Thabtah 2019; Chen et al. 2021). For example, Soleimanpour et al. (2021) revealed that the quick, unbiased, efficient statistical tree (QUEST) model outperformed other prediction models, such as the frequency ratio (FR) model and the evidential belief function (EBF) model, and this model achieved the highest area under the receiver operating characteristic (AUROC) curve and true skill statistic (TSS) values of 83.2% and 0.63, respectively. Chen et al. (2021) compared different machine learning models for predicting areas susceptible to gully erosion and found that the deep boost (DB) model had a significantly higher accuracy than other models, including the boosted tree (BT) model, boosted generalized linear model (BGLM), and BRT model.

As the results in this study show, the XGBoost model outperformed the other three algorithms (SVM, MLPNN, and RF), and its ability to predict areas susceptible to gully erosion in the study was demonstrated (table 3 and figure 4). In addition, based on the comparison of the four gully erosion prediction models, the tree-based machine learning models (such as the RF and XGBoost) had better GES prediction performances than the other models. The SVM model only has three hyperparameters for tuning, which leads to its weakness in fitting the complex relationships between predictors and indicators. Thus, preanalysis has been highly recommended, which can be regarded as a baseline for comparison with other models (Tang and Na 2021). The MLPNN has a higher model generalization ability due to its multilayered architecture, but as the number of hidden layers increases, the estimation behind the network becomes more complex and opaque (Mittendorf et al. 2022). The nonparametric tree-based models have the advantages of resistance to overfitting, insensitivity to noise, and unbiased error rate measurement compared to the other estimation models (Zhang et al. 2017, 2021). The tree-based models also have a clear and transparent structure and provide not only a decision path for every prediction but also a direct estimate of each indicator’s predictive power. The conclusion reached in this study is consistent with the results of other studies, which also reported that the RF and XGBoost algorithms have high performances in the prediction of land surface parameters (Garosi et al. 2019; Pourghasemi et al. 2020; Zhang et al. 2021).

In addition, the contributions of the indicators were also calculated using machine learning methods in such studies. For example, Rouhani et al. (2021) concluded that the height, distance from a fault, slope, and index of connectivity were the most important factors in gully erosion prediction. Pourghasemi et al. (2020) showed that the distance from rivers was one of the most effective factors in determining areas prone to gully erosion. Chen et al. (2021) reported that elevation was the most important factor in GES modeling. There are some consistencies between the results of this study and those of other studies on GES prediction. For example, in this study, the topography-related factors were also identified as important factors for GES prediction. Nevertheless, the distance to existing gullies has seldom been selected as an indicator in GES prediction studies. However, in this study, it was found that this indicator played the most significant role (relative importance of >35%) in the GES prediction. Existing gullies play an important role in changing the surface hydrological processes and also provide a platform for accumulated precipitation to be transformed into runoff. Therefore, it was concluded that obtaining the spatial distribution of the existing gullies is essential for GES prediction. The distribution of the existing gullies provided a base map for the gully erosion conditions in the study area, and the areas susceptible to gully erosion were identified based on the existing gullies. However, this indicator is not suitable for modeling gully erosion prediction in other areas without data for existing gullies, and thus, it would not necessarily be the most essential indicator in gully prediction in other study areas.

Indeed, the gully erosion was also affected by other environmental factors. For example, the spatial distribution of the accumulated precipitation is modified by the existing gullies, rivers, and road network in a specific region. In this study, the land use type also had a significant impact on the gully erosion prediction, which is consistent with the results of similar studies. Mokarram and Zarei (2021) reported that land use intensified the gully erosion process. Chen et al. (2021) showed that land use had the greatest impact on the GES. In particular, densely vegetated regions are less prone to soil erosion, while barren and sparsely vegetated regions are more prone to soil erosion. The soil properties play a significant role in gully erosion prediction because different soil types have various soil and water retention abilities.

Summary and Conclusions

In this study, four different machine learning algorithms were applied to investigate the factors controlling gully erosion and to predict the areas susceptible to gully erosion in Hailun in the Mollisol region in northeastern China. Accordingly, 12 factors, including topography, soil, land use, vegetation, and climate-related variables, were selected to investigate the main factors controlling gully erosion. The contributions of the controlling factors and the areas susceptible to gully erosion in Hailun City were determined using four machine learning algorithms, including the SVM, MLPNN, RF, and XGBoost models. In this study, the areas susceptible to gully erosion were predicted with an adequate accuracy using the different machine learning models. Through comparison of the four prediction models, it was found that the XGBoost and RF models both achieved good performances in predicting the areas susceptible to gully erosion. In addition, the selection of the most relevant environmental indicators for predicting areas susceptible to gully erosion is challenging due to the anthropogenic alterations of the land resources. In this study, it was found that the most important factors controlling gully erosion were the gully distance, profile curvature, plan curvature, land use, elevation, and soil type. The central part of the study area was more susceptible to gully erosion than the other areas, which is consistent with the visual interpretation results.

In future studies, the indicators (e.g., DEM and NDVI data) with a high spatial resolution will be used to improve the accuracy of the prediction of the areas susceptible to gully erosion. In addition to the 12 indicators selected in this study, other variables such as the soil texture, electrical conductivity, and run-off speed will be employed to identify the areas susceptible to gully erosion. Owing to the complex relationships between gully erosion and the controlling factors, other types of machine learning algorithms, especially deep learning models (e.g., convolutional neural networks), should be applied in GES prediction.

Acknowledgements

The work was supported by the National Key R&D Program of China (2021YFD1500800), the National Natural Science Foundation of China (U19A2061), the Science and Technology Project for Black Soil Granary (XDA28080500), and the National Earth System Science Data Center. We thank LetPub (www.letpub.com) for its linguistic assistance during the preparation of this manuscript.

  • Received January 29, 2022.
  • Revision received February 15, 2023.
  • Accepted March 22, 2023.
  • © 2023 by the Soil and Water Conservation Society

References

  1. ↵
    1. Amiri, M.,
    2. H.R. Pourghasemi,
    3. G.A. Ghanbarian, and
    4. S.F. Afzali
    . 2019. Assessment of the importance of gully erosion effective factors using Boruta algorithm and its spatial modeling and mapping using three machine learning algorithms. Geoderma 340:55-69. https://doi.org/10.1016/j.geoderma.2018.12.042.
    OpenUrl
  2. ↵
    1. Angermueller, C.,
    2. T. Pärnamaa,
    3. L. Parts, and
    4. O. Stegle
    . 2016. Deep learning for computational biology. Molecular Systems Biology 12(7):878. https://doi.org/10.15252/msb:20156651.
    OpenUrlAbstract/FREE Full Text
  3. ↵
    1. Arabameri, A.,
    2. B. Pradhan,
    3. K. Rezaei,
    4. M. Yamani,
    5. H.R. Pourghasemi, and
    6. L. Lombardo
    . 2018. Spatial modelling of gully erosion using evidential belief function, logistic regression and new ensemble EBF-LRalgorithm. Land Degradation & Development 29:4035-4049. https://doi.org/10.1002/ldr.3151.
    OpenUrl
  4. ↵
    1. Avand, M.,
    2. S. Janizadeh,
    3. S.A. Naghibi,
    4. H.R. Pourghasemi,
    5. S. Khosrobeigi Bozchaloei, and
    6. T. Blaschke
    . 2019. A comparative assessment of random forest and k-nearest neighbor classifiers for gully erosion susceptibility mapping. Water 11:2076. https://doi.org/10.3390/w11102076.
    OpenUrl
  5. ↵
    1. Breiman, L.
    2001. Random forests. Machine Learning 45(l):5-32.
    OpenUrl
  6. ↵
    1. Bunker, R.P., and
    2. F. Thabtah
    . 2019. A machine learning framework for sport result prediction. Applied Computing Informatics 15:27-33. https://doi.org/10.1016/j.aci.2017.09.005.
    OpenUrl
  7. ↵
    1. Buston, P.M., and
    2. J. Elith
    . 2011. Determinants of reproductive success in dominant pairs of clownfish: A boosted regression tree analysis. Journal of Animal Ecology 80:528-538. https://doi.org/10.1111/j.1365-2656.2011.01803.x.
    OpenUrlCrossRefPubMed
  8. ↵
    1. Chai, T., and
    2. R.R. Draxler
    . 2014. Root mean square error (RMSE) or mean absolute error (MAE)? Arguments against avoiding RMSE in the literature. Geoscience Model Development (GMD) 7(3):1247-1250. https://doi.org/10.5194/gmd-7-1247-2014.
    OpenUrl
  9. ↵
    1. Chen, W.,
    2. X. Lei,
    3. R. Chakrabortty,
    4. S.C. Pal,
    5. M. Sahana, and
    6. S. Janizadeh
    . 2021. Evaluation of different boosting ensemble machine learning models and novel deep learning and boosting framework for head-cut gully erosion susceptibility. Journal of Environmental Management 284:112015. https://doi.org/10.1016/j.jenvman.2021.112015.
    OpenUrl
  10. ↵
    1. Cherif, I.L., and
    2. A. Kortebi
    . 2019. On Using eXtreme Gradient Boosting (XGBoost) Machine Learning Algorithm for Home Network Traffic Classification. 2019 Wireless Days (WD), Manchester, UK, 2019, pp. 1-6. https://doi.org/10.1109/WD.2019.8734193.
  11. ↵
    1. Chlingaryan, A.,
    2. S. Sukkarieh, and
    3. B. Whelan
    . 2018. Machine learning approaches for crop yield prediction and nitrogen status estimation in precision agriculture: A review. Computer and Electronics in Agriculture 151:61-69. https://doi.org/10.1016/j.compag.2018.05.012.
    OpenUrl
  12. ↵
    1. Chung, C.-J.F., and
    2. A.G. Fabbri
    . 2003. Validation of spatial prediction models for landslide hazard mapping. Natural Hazards 30(3):451-472. https://doi.org/10.1023/B:NHAZ.0000007172.62651.2b.
    OpenUrlCrossRef
  13. ↵
    1. Cui, J.,
    2. Y. Li,
    3. S. Wang,
    4. Y. Chi,
    5. H. Hwang, and
    6. P. Wang
    . 2018. Directional preparation of anticoagulant-active sulfated polysaccharides from Enteromorpha prolifera using artificial neural networks. Scientific Reports 8(1):1-9. https://doi.org/10.1038/s41598-018-21556-x.
    OpenUrlCrossRef
  14. ↵
    1. De Oliveira, M.F.,
    2. A.F. dos Santos,
    3. E.H. Kazama,
    4. G. de Souza Rolim, and
    5. R.P. da Silva
    . 2021. Determination of application volume for coffee plantations using artificial neural networks and remote sensing. Computers and Electronics in Agriculture 184:106096. https://doi.org/10.1016/j.compag.2021.106096.
    OpenUrl
  15. ↵
    1. Friedman, J.H., and
    2. J.J. Meulman
    . 2003. Multiple additive regression trees with application in epidemiology. Statistics in Medicine 22(9):1365-1381. https://doi.org/10.1002/sim.1501.
    OpenUrlCrossRefPubMed
  16. ↵
    1. Gao, X.,
    2. Y. Xie,
    3. G. Liu,
    4. B. Liu, and
    5. X. Duan
    . 2015. Effects of soil erosion on soybean yield as estimated by simulating gradually eroded soil profiles. Soil & Tillage Research 145:126-134. https://doi.org/10.1016/j.still.2014.09.004.
    OpenUrl
  17. ↵
    1. Garosi, Y.,
    2. M. Sheklabadi,
    3. C. Conoscenti,
    4. H.R. Pourghasemi, and
    5. K. Van Oost
    . 2019. Assessing the performance of GIS-based machine learning models with different accuracy measures for determining susceptibility to gully erosion. Science of the Total Environment 664:1117-1132. https://doi.org/10.1016/j.scitotenv.2019.02.093.
    OpenUrl
  18. ↵
    1. Garosi, Y.,
    2. M. Sheklabadi,
    3. H.R. Pourghasemi,
    4. A.A. Besalatpour,
    5. C. Conoscenti, and
    6. K. VanOost
    . 2018. Comparison of differences in resolution and sources of controlling factors for gully erosion susceptibility mapping. Geoderma 330:65-78. https://doi.org/10.1016/j.geoderma.2018.05.027.
    OpenUrl
  19. ↵
    1. Gayen, A.,
    2. H.R. Pourghasemi,
    3. S. Saha,
    4. S. Keesstra, and
    5. S. Bai
    . 2019. Gully erosion susceptibility assessment and management of hazard-prone areas in India using different machine learning algorithms. Science of the Total Environment 668:124-138. https://doi.org/10.1016/j.scitotenv.2019.02.436.
    OpenUrl
  20. ↵
    1. Gutiérrez, Á.G.,
    2. S. Schnabel, and
    3. J.F.L. Contador
    . 2009. Using and comparing two nonparametric methods (CART and MARS) to model the potential distribution of gullies. Ecological Modelling 220:3630-3637. https://doi.org/10.1016/j.ecolmodel.2009.06.020.
    OpenUrl
  21. ↵
    1. Harel, O.
    2009. The estimation of R2 and adjusted R2 in incomplete data sets using multiple imputation. Journal of Applied Statistics 36(10):1109-1118. https://doi.org/10.1080/02664760802553000.
    OpenUrlCrossRef
  22. ↵
    1. Huang, J.C.,
    2. Y.C. Tsai,
    3. P.Y. Wu,
    4. Y.H. Lien,
    5. C.Y. Chien,
    6. C.F. Kuo,
    7. J.F. Hung,
    8. S.C. Chen, and
    9. C.H. Kuo
    . 2020. Predictive modeling of blood pressure during hemodialysis: A comparison of linear model, random forest, support vector regression, XGBoost, LASSO regression and ensemble method. Computer Methods and Programs in Biomedicine 195:105536. https://doi.org/10.1016/j.cmpb.2020.105536.
    OpenUrl
  23. ↵
    1. Joachims, T.
    1997. A probabilistic analysis of the Rocchio algorithm with TFIDF for text categorization. In Proceedings of the Fourteenth International Conference on Machine Learning (ICML), 143-151. San Francisco, CA: Morgan Kaufmann Publishers Inc.
  24. ↵
    1. Karatzoglou, A.,
    2. D. Meyer, and
    3. K. Hornik
    . 2006. Support vector machines in R. Journal of Statistical Software 15:1-28. https://doi.org/10.18637/jss.v015.i09.
    OpenUrl
  25. ↵
    1. Kubat, M.
    2017. An Introduction to Machine Learning. Cham, Switzerland: Springer Cham. https://doi.org/10.1007/978-3-319-63913-0.
  26. ↵
    1. Liu, P.,
    2. B. Fu,
    3. S.X. Yang,
    4. L. Deng,
    5. X. Zhong, and
    6. H. Zheng
    . 2021. Optimizing survival analysis of XGBoost for ties to predict disease progression of breast cancer. IEEE Transactions on Biomedical Engineering 68(1):148-60. https://doi.org/10.1109/TBME.2020.2993278.
    OpenUrl
  27. ↵
    1. Ministry of Water Resources (Chinese Academy of Sciences, Chinese Academy of Engineering)
    . 2010. Soil Erosion Prevention and Ecological Security in China: The Black Soil Region in Northeast China, 254-269. Beijing: Science Press.
  28. ↵
    1. Mittendorf, M.,
    2. U.D. Nielsen, and
    3. H.B. Bingham
    . 2022. Data-driven prediction of added-wave resistance on ships in oblique waves—A comparison between tree-based ensemble methods and artificial neural networks. Applied Ocean Research 118:102964. https://doi.org/10.1016/j.apor.2021.102964.
    OpenUrl
  29. ↵
    1. Mokarram, M., and
    2. A.R. Zarei
    . 2021. Determining prone areas to gully erosion and the impact of land use change on it by using multiple-criteria decision-making algorithm in arid and semi-arid regions. Geoderma 403:115379. https://doi.org/10.1016/j.geoderma.2021.115379.
    OpenUrl
  30. ↵
    1. Moore, I.D.,
    2. R.B. Grayson, and
    3. A.R. Ladson
    . 1991. Digital terrain modeling: A review of hydrological, geomorphological, and biological applications. Hydrological Processes 5:3-30. https://doi.org/10.1002/hyp.3360050103.
    OpenUrlCrossRef
  31. ↵
    1. Nearing, M.A.,
    2. Y. Xie,
    3. B. Liu, and
    4. Y. Ye
    . 2017. Natural and anthropogenic rates of soil erosion. International Soil and Water Conservation Research 5:77-84. https://doi.org/10.1016/j.iswcr.2017.04.001.
    OpenUrl
  32. ↵
    1. Nhu, V.-H.,
    2. S. Janizadeh,
    3. M. Avand,
    4. W. Chen,
    5. M. Farzin,
    6. E. Omidvar,
    7. A. Shirzadi,
    8. H. Shahabi,
    9. J.J. Clague,
    10. A. Jaafari,
    11. F. Mansoorypoor,
    12. B.T. Pham,
    13. B.B. Ahmad, and
    14. S. Lee
    . 2020. GIS-based gully erosion susceptibility mapping: A comparison of computational ensemble data mining models. Applied Science 10:2039. https://doi.org/10.3390/app10062039.
    OpenUrl
  33. ↵
    1. Niu, L.,
    2. Y. Guo,
    3. Y. Li,
    4. C. Wang,
    5. Q. Hu,
    6. L. Fan,
    7. L. Wang, and
    8. N. Yang
    . 2021. Degradation of river ecological quality in Tibet Plateau with overgrazing: A quantitative assessment using biotic integrity index improved by random forest. Ecological Indicators 120:106948. https://doi.org/10.1016/j.ecolind.2020.106948.
    OpenUrl
  34. ↵
    1. Oludolapo, O.A.,
    2. A.A. Jimoh, and
    3. P.A. Kholopane
    . 2012. Comparing performance of MLPNN and RBFNN neural network models for predicting South Africa’s energy consumption. Journal of Energy in Southern Africa 23:40-46. https://doi.org/10.17159/2413-3051/2012/V23I3A3171.
    OpenUrl
  35. ↵
    1. Paul, G.C.,
    2. S. Saha, and
    3. T.K. Hembram
    . 2019. Application of the GIS-based probabilistic models for mapping the flood susceptibility in Bansloi sub-basin of Ganga-Bhagirathi River and their comparison. Remote Sensing in Earth System Science 2(2-3):120-146. https://doi.org/10.1007/s41976-019-00018-6.
    OpenUrl
  36. ↵
    1. Poesen, J.
    2018. Soil erosion in the anthropocene: Research needs. Earth Surface Processes and Landforms 43:64-84. https://doi.org/10.1002/esp.4250.
    OpenUrl
  37. ↵
    1. Pourghasemi, H.R., and
    2. M. Rossi
    . 2017a. Landslide susceptibility modeling in a landslide prone area in Mazandarn Province, north of Iran: A comparison between GLM, GAM, MARS, and M-AHP methods. Theoretical and Applied Climatology 130(1-2):609-633. https://doi.org/10.1007/s00704-016-1919-2.
    OpenUrl
  38. ↵
    1. Pourghasemi, H.R.,
    2. N. Sadhasivam,
    3. N. Kariminejad, and
    4. A.L. Collins
    . 2020. Gully erosion spatial modeling: Role of machine learning algorithms in selection of the best controlling factors and modeling process. Geoscience Frontiers 11:2207-2219. https://doi.org/10.1016/j.gsf.2020.03.005.
    OpenUrl
  39. ↵
    1. Pourghasemi, H.R.,
    2. S. Yousefi,
    3. A. Kornejady, and
    4. A. Cerd
    . 2017b. Performance assessment of individual and ensemble data-mining techniques for gully erosion modeling. Science of the Total Environment 609:764-775. https://doi.org/10.1016/j.scitotenv.2017.07.198.
    OpenUrl
  40. ↵
    1. Rahmati, O.,
    2. A. Haghizadeh,
    3. H.R. Pourghasemi, and
    4. F. Noormohamadi
    . 2016. Gully erosion susceptibility mapping: The role of GIS-based bivariate statistical models and their comparison. Natural Hazards 82:1231-1258. https://doi.org/10.1007/s11069-016-2239-7.
    OpenUrl
  41. ↵
    1. Rouhani, H.,
    2. A. Fathabadi, and
    3. J. Baartman
    . 2021. A wrapper feature selection approach for efficient modelling of gully erosion susceptibility mapping. Progress in Physical Geography: Earth and Environment 45(4). https://doi.org/10.1177/0309133320979897.
  42. ↵
    1. J.K. Rout,
    2. M. Rout, and
    3. H. Das
    1. Roy, P.,
    2. R. Chakrabortty,
    3. I. Chowdhuri,
    4. S. Malik,
    5. B. Das, and
    6. S.C. Pal
    . 2020. Development of different machine learning ensemble classifier for gully erosion susceptibility in Gandheswari watershed of West Bengal, India. In Machine Learning for Intelligent Decision Science, Algorithms for Intelligent Systems, ed. J.K. Rout, M. Rout, and H. Das, 1-26. Singapore: Springer Singapore. https://doi.org/10.1007/978-981-15-3689-2_1.
  43. ↵
    1. Saha, S.
    2017. Groundwater potential mapping using analytical hierarchical process: A study on Md. Bazar Block of Birbhum District, West Bengal. Spatial Information Research 25(4):615-626. https://doi.org/10.1007/s41324-017-0127-1.
    OpenUrl
  44. ↵
    1. Soleimanpour, S.M.,
    2. H.R. Pourghasemi, and
    3. M. Zare
    . 2021. A comparative assessment of gully erosion spatial predictive modeling using statistical and machine learning models. Catena 207:105679. https://doi.org/10.1016/j.catena.2021.105679.
    OpenUrl
  45. ↵
    1. Stojić, A.,
    2. N. Stanić,
    3. G. Vuković,
    4. S. Stanišić,
    5. M. Perišić,
    6. A. Šoštarić, and
    7. L. Lazić
    . 2019. Explainable extreme gradient boosting tree-based prediction of toluene, ethylbenzene and xylene wet deposition. Science of the Total Environment 653:140-147. https://doi.org/10.1016/j.scitotenv.2018.10.368.
    OpenUrl
  46. ↵
    1. Sun, N.,
    2. W. Zhu, and
    3. Q. Cheng
    . 2018. GF-1 and Landsat observed a 40-year wetland spatiotemporal variation and its coupled environmental factors in Yangtze River estuary. Estuarine Coastal and Shelf Science 207:30-39. https://doi.org/10.1016/j.ecss.2018.03.022.
    OpenUrl
  47. ↵
    1. Tang, H.,
    2. X. Meng,
    3. X. Su,
    4. T. Ma,
    5. H. Liu,
    6. Y. Bao,
    7. M. Zhang,
    8. X. Zhang, and
    9. H. Huo
    . 2021. Hyperspectral prediction on soil organic matter of different types using CARS algorithm. Transactions of the Chinese Society of Agricultural Engineering (CSAE) 37(2):105-113 (in Chinese with English abstract). https://doi.org/10.11975/j.issn.1002-6819.2021.2.013.
    OpenUrl
    1. Tang, L., and
    2. S. Na
    . 2021. Comparison of machine learning methods for ground settlement prediction with different tunneling datasets. Journal of Rock Mechanics and Geotechnical Engineering 13:1274-1289. https://doi.org/10.1016/j.jrmge.2021.08.006.
    OpenUrl
  48. ↵
    1. Teimouri, M., and
    2. A. Kornejady
    . 2019. The dilemma of determining the superiority of data mining models: Optimal sampling balance and end users’ perspectives matter. Bulletin of Engineering Geology and the Environment 79():1707-1720. https://doi.org/10.1007/s10064-019-01687-9.
    OpenUrl
  49. ↵
    1. Tien Bui, D.,
    2. B. Pradhan,
    3. H. Nampak,
    4. Q.T. Bui,
    5. Q.A. Tran, and
    6. Q.P. Nguyen
    . 2016. Hybrid artificial intelligence approach based on neural fuzzy inference model and metaheuristic optimization for flood susceptibility modeling in a high-frequency tropical cyclone area using GIS. Journal of Hydrology 540:317-330. https://doi.org/10.1016/j.jhydrol.2016.06.027.
    OpenUrl
  50. ↵
    1. Vapnik, V.
    1995. The Nature of Statistical Learning Theory. New York: Springer Science & Business Media.
  51. ↵
    1. Wen, Y.,
    2. T. Kasielke,
    3. H. Li,
    4. H. Zepp, and
    5. B. Zhang
    . 2021. A case study on history and rates of gully erosion in northeast China. Land degradation and Development 32:4254-4266. https://doi.org/10.1002/ldr.4031.
    OpenUrl
  52. ↵
    1. Yang, T.,
    2. A.A. Asanjan,
    3. M. Faridzad,
    4. N. Hayatbini,
    5. X. Gao, and
    6. S. Sorooshian
    . 2017. An enhanced artificial neural network with a shuffled complex evolutionary global optimization with principal component analysis. Information Sciences 418-419:302-316. https://doi.org/10.1016/j.ins.2017.08.003.
    OpenUrl
  53. ↵
    1. Yang, A.,
    2. C. Wang,
    3. G. Pang,
    4. Y. Long,
    5. L. Wang,
    6. R.M. Cruse, and
    7. Q. Yang
    . 2021. Gully erosion susceptibility mapping in highly complex terrain using machine learning models. ISPRS International Journal of Geo-Information 10:680. https://doi.org/10.3390/ijgi10100680.
    OpenUrl
  54. ↵
    1. Zabihi, M.,
    2. F. Mirchooli,
    3. A. Motevalli,
    4. A.K. Darvishan,
    5. H.R. Pourghasemi,
    6. M.A. Zakeri, and
    7. F. Sadighi
    . 2018. Spatial modelling of gully erosion in Mazandaran Province, northern Iran. Catena 161:1-13. https://doi.org/10.1016/j.catena.2017.10.010.
    OpenUrl
  55. ↵
    1. H.R. Pourghasemi and
    2. M. Rossi
    1. Zabihi, M.,
    2. H.R. Pourghasemi,
    3. A. Motevalli, and
    4. M.A. Zakeri
    . 2019. Gully erosion modeling using GIS-based data mining techniques in northern Iran: A comparison between boosted regression tree and multivariate adaptive regression spline. In Natural Hazards GIS-Based Spatial Modeling Using Data Mining Techniques, ed. H.R. Pourghasemi and M. Rossi, 1-26. Cham: Springer. https://doi.org/10.1007/978-3-319-73383-8_1.
  56. ↵
    1. Zhang, K.L., and
    2. H.Y. Liu
    . 2018. Research progresses and prospects on freeze-thaw erosion in the black soil region of northeast China. Science of Soil and Water Conservation 16(1):17-24. https://doi.org/10.13254/j.jare.2017.0328.
    OpenUrl
  57. ↵
    1. Zhang, Y.,
    2. B. Sui,
    3. H. Shen, and
    4. L. Ouyang
    . 2019. Mapping stocks of soil total nitrogen using remote sensing data: A comparison of random forest models with different predictors. Computers and Electronics in Agriculture 160:23-30. https://doi.org/10.1016/j.compag.2019.03.015.
    OpenUrl
    1. Zhang, H.,
    2. P. Wu,
    3. A. Yin,
    4. X. Yang,
    5. M. Zhang, and
    6. C. Gao
    . 2017. Prediction of soil organic carbon in an intensively managed reclamation zone of eastern China: A comparison of multiple linear regressions and the random forest model. Science of the Total Environment 592:704-713. https://doi.org/10.1016/j.scitotenv.2017.02.146.
    OpenUrl
  58. ↵
    1. Zhang, Y.,
    2. C. Xia,
    3. X. Zhang,
    4. X. Cheng,
    5. G. Feng,
    6. Y. Wang, and
    7. Q. Gao
    . 2021. Estimating the maize biomass by crop height and narrowband vegetation indices derived from UAV-based hyperspectral images. Ecological Indicators 129:107985. https://doi.org/10.1016/j.ecolind.2021.107985.
    OpenUrl
  59. ↵
    1. Zhao, J.,
    2. M. Vanmaercke,
    3. L. Chen, and
    4. G. Govers
    . 2016. Vegetation cover and topography rather than human disturbance control gully density and sediment production on the Chinese Loess Plateau. Geomorphology 274:92-105. https://doi.org/10.1016/j.geomorph.2016.09.022.
    OpenUrl
PreviousNext
Back to top

In this issue

Journal of Soil and Water Conservation: 78 (5)
Journal of Soil and Water Conservation
Vol. 78, Issue 5
September/October 2023
  • Table of Contents
  • Table of Contents (PDF)
  • About the Cover
  • Index by author
  • Front Matter (PDF)
Print
Download PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for your interest in spreading the word on Journal of Soil and Water Conservation.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Gully erosion susceptibility prediction in Mollisols using machine learning models
(Your Name) has sent you a message from Journal of Soil and Water Conservation
(Your Name) thought you would like to see the Journal of Soil and Water Conservation web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
5 + 2 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.
Citation Tools
Gully erosion susceptibility prediction in Mollisols using machine learning models
Y. Wang, Y. Zhang, H. Chen
Journal of Soil and Water Conservation Sep 2023, 78 (5) 385-396; DOI: 10.2489/jswc.2023.00019

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Request Permissions
Share
Gully erosion susceptibility prediction in Mollisols using machine learning models
Y. Wang, Y. Zhang, H. Chen
Journal of Soil and Water Conservation Sep 2023, 78 (5) 385-396; DOI: 10.2489/jswc.2023.00019
Reddit logo Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • Introduction
    • Materials and Methods
    • Results and Discussion
    • Summary and Conclusions
    • Acknowledgements
    • References
  • Figures & Data
  • Info & Metrics
  • References
  • PDF

Related Articles

  • No related articles found.
  • Google Scholar

Cited By...

  • No citing articles found.
  • Google Scholar

More in this TOC Section

  • Effects of economic and climatic factors on agricultural water use in arid regions
  • Sediment delivery ratios from forest road networks and harvesting operational features by physiographic region in the southeastern United States
Show more Research Section

Similar Articles

Keywords

  • controlling factors
  • gully erosion susceptibility
  • machine learning
  • Mollisols
  • relative importance

Content

  • Current Issue
  • Early Online
  • Archive
  • Subject Collections

Info For

  • Authors
  • Reviewers
  • Subscribers
  • Advertisers

Customer Service

  • Subscriptions
  • Permissions and Reprints
  • Terms of Use
  • Privacy

SWCS

  • Membership
  • Publications
  • Meetings and Events
  • Conservation Career Center

© 2023 Soil and Water Conservation Society