Abstract

Soil water content is a critical environmental parameter in research and practice, though various technological and contextual constraints limit its estimation in arid areas with vegetation cover. This study combined the multitemporal remote sensing data of Sentinel-1 and Landsat 8 to conduct an inversion study on surface soil water content under low vegetation cover in Nagqu, central Tibetan Plateau. Four vegetation indices (NDVI, ARVI, EVI, and RVI) were extracted from optical remote sensing data. A water cloud model was used to eliminate the influence of the vegetation layer on the backscattering coefficient associated with vegetation cover, and a predictive model suitable for the Nagqu area was constructed. The water cloud model effectively incorporated a vegetation index instead of vegetation water content. We found that VV polarization was more suitable for soil water content inversion than VH polarization. Among the four vegetation indices, the soil water content inversion model constructed with RVI under VV polarization had the best fit (R2 = 0.8212; RMSE = 6.30). The second-best fit was observed for vegetation water content-NDVI (R2 = 0.8201). The soil water content inversion models all had an R2 > 0.6, regardless of the vegetation index used, though the RVI had the best fitting effect, indicating that this vegetation index is highly applicable in the water cloud model, as a substitute for vegetation water content, and is expected to perform well in similar study sites.

1. Introduction

Soil water content (soil moisture) is not only a basic condition for plant growth but is also an important environmental parameter in the fields of ecology, hydrology, agriculture, and climate change and a useful indicator for drought monitoring and crop yield estimation, among others. It is one of the most important parameters for characterizing ground surfaces [1, 2]. Therefore, large-scale monitoring of soil water content is not only an important part of agricultural research and the evaluation of environmental factors but also of great importance for developing regional and global climate mitigation strategies and predicting regional dry and wet conditions.

Traditional soil water content monitoring is mainly conducted using established specialized monitoring or meteorological stations. Although this method has high accuracy, it is limited by a small detection range, low turnover, and data representativeness and requires considerable resources [35]. With the maturation of satellite remote sensing technology, multisource remote sensing can generate large-scale soil water content estimates with high spatial and temporal resolution, facilitating the acquisition of dynamic real-time information and compensating for the shortcomings of traditional monitoring methods [6].

Techniques for soil water content inversion using remote sensing are mainly categorized into optical remote sensing inversion, microwave remote sensing inversion, and collaborative inversion [79]. Optical remote sensing inversion is performed by extracting sensitive information related to soil water content from optical remote sensing images. According to the different extraction approaches, it can be divided into the reflectivity, index, and thermal inertia methods [10]. Optical remote sensing cannot directly provide spectral information for soil in densely vegetated areas and is mainly applicable to the inversion of bare surfaces. Compared with optical remote sensing, microwave remote sensing, categorized as active or passive according to the working method of the sensor, has the advantages of not being affected by weather, strong penetration, and all-weather capabilities [1113]. Passive microwave remote sensing inverts the soil water content using a prediction model established with soil radiation temperature (obtained using the microwave radiometer) and measured soil data [1416]. Active microwave remote sensing, which is less affected by interference, is based on the mathematical relationship between the backscattering coefficient and soil water content, among which the linear regression model is the most commonly used [1719]. The radar backscattering coefficient of active microwave remote sensing is mainly affected by the soil dielectric constant, surface roughness, and vegetation cover. In the vegetation-covered area, the scattering and attenuation effect of the vegetation layer on the radar backscatter echo will reduce the sensitivity of the backscatter coefficients to the change in soil water content, thus affecting the accuracy of the inversion. Therefore, the effect of the vegetation cover on the backscatter coefficients needs to be eliminated.

Vegetation index can provide information about vegetation growth status and cover, so vegetation index can be used instead of vegetation cover. At the same time, the relationship between different types of vegetation index and soil water content may be different. Therefore, understanding the applicability of vegetation index is helpful to better select the soil water content inversion model suitable for specific regions and environments, so as to improve the accuracy and reliability of soil water content inversion. Alternatively, the sensitivity of optical remote sensing to vegetation information can be used to extract relevant vegetation canopy information, and the influence of the vegetation layer can be eliminated by water cloud [20] or MIMICS models [21], which enable collaborative inversion with microwave remote sensing and improve the inversion accuracy of soil moisture in vegetation-covered areas. For example, Liu et al. [22] combined Sentinel-1A and -2A images, used the water cloud model to remove the influence of the vegetation layer, and then estimated soil moisture in the farmland area using machine learning. Rawat et al. [23] estimated soil moisture by combining the soil’s dielectric constant properties with the radar backscattering coefficient obtained using the SA schemes of RISAT-1 data. Su and Cao [24] used full-polarization data to invert soil moisture using AIEM, water cloud, and MIMICS models according to land use types and vegetation in different rocky desert areas.

Our study locality, Nagqu City, Tibet, is located at high altitudes with a cold and dry climate, making drought an important factor affecting agricultural production. An effective inversion method can improve the monitoring of soil water content and enable the dissemination of this crucial information to producers at all levels within this region. In this study, multitemporal remote sensing images with a large time span were used for collaborative inversion of soil water content. Given the unique context of this study locality, we explored the applicability of the vegetation index instead of vegetation water content to further improve drought detection capabilities in this region. .

2. Materials and Methods

2.1. Study Area

The Nagqu soil temperature and humidity observation network in the central Qinghai-Tibet Plateau is located within a 100 × 100 km area (91.5°–92.5°E, 31°-32°N, Figure 1) at an average altitude of 4650 m above sea level. Most of the selected sites were located in the central and northern parts of Nagqu County, and a few were located southeast of Amdo County and southwest of Nyainrong County. It belongs to the high-altitude subcold climate zone, where the total annual hours of sunshine are 2852.6–2881.7, annual average temperature is −0.9 to −3.3°C, annual average relative humidity is 48–51%, and the annual average precipitation is 380 mm. Most of the study area is located in the pure summer and autumn pastoral area of Nagqu. The vegetation type is a typical alpine meadow belt, mainly composed of wormwood meadows. The grass layer has a height of 1–5 cm and total annual growth days of 150–170 d, indicating that the withering period is longer than the growing period and vegetation cover is generally low.

2.2. Data Sources
2.2.1. Remote Sensing Data

Sentinel-1, a follow-up satellite of ERS-2 and Envisat, is an important component of the European Space Agency’s Copernicus Program (previously known as GMES) and integral to the Earth observation program. The constellation comprises Sentinel-1A and Sentinel-1B satellites equipped with a C-band synthetic aperture radar (SAR) at an orbital altitude of 693 km. The observation period of a single satellite is 12 d, and the combined observation of two satellites is 6 d. Sentinel-1 has four imaging modes, SMStripmap (SM), interferometric wide swath (IW), extrawide swath (EW), and wave mode (WV). The GRD data in the IW mode were selected for this study. The obtained SAR image comprised L1 data with a spatial resolution of 5 × 20 m, with VV and VH dual polarization (Table 1). The downloaded image data were subjected to orbit correction, thermal noise removal, radiometric calibration, multiview, filtering, terrain correction, and decibel processing in the ESA SNAP. We used Refined Lee filtering to obtain the total backscattering coefficient of the radar.

Landsat 8 is an Earth observation satellite jointly launched by the National Aeronautics and Space Administration (NASA) and the United States Geological Survey (USGS) on February 11, 2013, carrying two sensors, a land imager (OLI) and a thermal infrared sensor (TIRS). The OLI includes nine bands and the TIRS includes two thermal infrared bands. Optical images from the same day as the SAR data were downloaded from the USGS website (https://earthexplorer.usgs.gov). Landsat 8 L2 SP T1-level OLI data were used to calculate the normalized difference vegetation index (NDVI), enhanced vegetation index (EVI), ratio vegetation index (RVI), and atmospheric impedance index (ARVI) in ENVI 5.6 to obtain the corresponding grid image (Table 2).

To promote the accuracy of the model, we selected optical and microwave remote sensing images for the same date; the cloud content of the optical remote sensing images was 5%.

2.2.2. Soil Water Content Data

The soil water content data were obtained from the multiscale observation network of soil temperature and moisture on the central Tibetan Plateau provided by the National Tibetan Plateau Data Center [25, 26], which mainly considers soil water content in the surface layer (0–5 cm) at 57 monitoring stations. By screening and eliminating abnormal data from six image datasets, our final dataset considers data for 178 monitoring points for the construction and verification of the remote sensing image inversion model.

2.3. Analytical Approaches
2.3.1. Quantification of Soil Water Content

Soil water content is usually expressed as soil weight water content, soil volumetric water content, and relative soil water content. Soil weight water content refers to the weight ratio of soil water to dried soil. The soil volumetric water content refers to the volumetric ratio of soil water per unit of soil. Relative soil water content refers to the percentage of soil water content relative to field capacity.

The soil water content data obtained in this study reflected soil weight water content; therefore, we converted the data to soil volumetric water content using the following formula:where is the soil volumetric water content (%), is the soil weight water content (%), and ρ is the soil bulk weight (g·cm−3).

2.3.2. Optical and Microwave Remote Sensing Collaborative Inversion Method

We observed a linear correlation between the soil backscattering coefficient and soil water content, according to which the soil water content inversion model can be established as follows:where is the soil volumetric water content (%), is the soil backscattering coefficient (dB), and a and b are the empirical parameters.

2.3.3. Water Cloud Model

The scattering and attenuation effects of the vegetation layer cause partial reflection of the signal received by the sensor. To eliminate the influence of the vegetation layer on the inversion of soil water content in areas with vegetation cover, we applied the water cloud model proposed by Baghdadi et al. [18], which is often used to invert soil water content in areas with low vegetation cover [27, 28]. The water cloud model divides the total received radar backscatter coefficient into two parts, the soil scattering coefficient through the vegetation and the vegetation scattering coefficient , and is modeled as

The collation can be deduced as

In equations (3)–(6), is the total radar backward scattering coefficient (dB), is the vegetation layer backward scattering coefficient (dB), is the soil backward scattering coefficient (dB), is the two-time attenuation factor of the vegetation scattering coefficient (%), is the vegetation water content (g·cm−3), is the radar incidence angle (°), and A and B are the empirical vegetation parameters.

As the study area is dominated by alpine meadows, the empirical parameters A and B were set to 0.0014 and 0.084, respectively, according to Bindish and Barros [29] (Table 3).

2.3.4. Estimation of Vegetation Water Content

Vegetation water content is an important parameter in water cloud models and can be inverted using optical remote sensing data based on three methods: spectral reflectivity, spectral index, and radiative transfer model methods. The spectral index method has a higher accuracy than the reflectivity method and is simpler than the radiative transfer method; therefore, it is more widely used [30].

The NDVI and normalized difference water index (NDWI) are commonly used to estimate vegetation water content using water cloud models. To explore the feasibility of different vegetation indices for characterizing the VWC in the water cloud model, we first calculated a reference vegetation water content estimate using the empirical relationship between the NDVI and VWC established by Jackson and Gao et al. [31, 32]. With the formula,

Second, the EVI, RVI, and atmospherically resistant vegetation index (ARVI) were directly substituted into the vegetation water content parameter in the water cloud model to invert the soil water content. Compared to the inversion, these vegetation indices were calculated as follows:

2.3.5. Accuracy Evaluation of Model

We evaluated the accuracy of the inversion model of soil water content using goodness of fit (R2) and root mean square error (RMSE), which can be calculated as follows:where is the inversion value of soil water content (%), is the mean soil water content (%), is the measured value of soil water content (%), and n is the number of samples used in verification.

R2 represents the fitting relationship between the inversion and measured values; the closer it is to 1, the better the fit of the model. RMSE represents the degree of deviation between the inversion and measured values; the smaller the value, the better the fit of the model.

In this study, the vegetation index and backscattering coefficient in satellite images of six time-phases are extracted, respectively, and the soil backscattering coefficient is solved by using the water cloud model to eliminate the influence of vegetation cover. There are a total of 178 available monitoring point data, each time-phase of about 30 data. Then, the empirical parameters a and b are solved by linear fitting the measured data of some monitoring points with the soil backscattering coefficient. Finally, the remaining measured data are brought into the constructed inversion model to detect its accuracy.

The technical workflow for inverting soil water content is shown in Figure 2.

3. Results and Discussion

3.1. Analysis of Soil Water Content Collaborative Inversion Results

Previous studies reported a correlation between the radar backscattering coefficient and soil water content [33]. Simultaneously, different polarization modes are affected by vegetation cover to varying degrees, resulting in different polarization modes and soil water content correlations [34, 35].

According to the above principles and processes, data from 100 monitoring points in the reference dataset were selected to construct the inversion model, and the remaining 78 data points were used to verify the accuracy of the model. The ratio of validation set to test set allocation is approximately 3 : 4. Firstly, the vegetation index and backscatter coefficient of 178 monitoring points were calculated by optical image and SAR, respectively, and the soil backscattering coefficient is extracted by using the water cloud model to eliminate the influence of vegetation cover. Then, the solved soil backscattering coefficient is linearly fitted with 100 measured data. The fitting results of 100 measured soil water content and soil backscattering coefficient under VV/VH dual polarization are shown in Figure 3. The fitting degree of vegetation water content and vegetation index on the soil backscattering coefficient and measured soil water content under VH polarization was lower than that under VV polarization (Figure 3). This indicates that the ability of microwaves to penetrate the vegetation layer under VV polarization is stronger than that under VH polarization, which is consistent with previous results. Meanwhile, the accuracy of the model constructed varied with different parameters for vegetation water content. Under VV polarization, the soil backscattering coefficients based on VWC-NDVI and RVI best fit with the measured soil water content data (R2 = 0.6727 and 0.6744, respectively). EVI corresponded with the second-best fit (R2 = 0.6323), while ARVI corresponded with the worst fit (R2 = 0.6066). Under VH polarization, the soil backscattering coefficient and soil water content based on VWC-NDVI had the best fitting effect (R2 = 0.2401), while ARVI had the worst fitting effect (R2 = 0.1624). The analysis showed that the fitting effect was better with the RVI than with the vegetation water content estimated by VWC-NDVI, indicating that RVI can directly replace the vegetation water content parameter in the model. The goodness of fit associated with EVI and ARVI was also above R2 > 0.6, indicating good model fit and applicability.

3.2. Verification of Soil Water Content Inversion Accuracy

The accuracy of the inversion model was verified using the dataset composed of the remaining 78 monitoring points based on goodness of fit (R2) and root mean square error (RMSE). The soil backscattering coefficients of 78 monitoring points were substituted into the above four linear models to invert the soil water content. Then, the inversion value of soil water content is linearly fitted with the measured value. We observed a good fit between the backscattering coefficients and soil water content after removing the influence of vegetation using VV and VH dual polarization (Figure 4), which is consistent with previous results [36]. Consistent with our findings in Section 3.1, we observed a better fit under VV polarization than under VH polarization. Under VV polarization, the RMSE did not vary much between models with different vegetation indices (the maximum was 6.84 for EVI and the minimum was 6.30 for RVI); however, we observed a large difference in R2 (the maximum value was 0.61 for RVI and the minimum value was 0.55 for VWC-NDVI and ARVI). Considering the degree of fitting and the need for accurate prediction of the inversion value, the linear regression model y = −1.0924x – 17.611 based on RVI in Figure 3(d) has the best predictive performance for soil water content estimation and is most suitable for the study area.

4. Conclusions

In this study, the NDVI, ARVI, EVI, and RVI were calculated using multitemporal data from Sentinel-1 GRD and Landsat 8. The influence of the vegetation layer was eliminated by using a water cloud model. Based on the model fit associated with the different vegetation indices, we constructed a predictive model for soil water content inversion in vegetation-covered areas and made the following conclusions:(1)Using Sentinel-1 and Landsat 8 remote sensing data for collaborative inversion, we found that the model fit and accuracy were better under VV polarization than those under VH polarization, proving that VV polarization was more suitable for soil moisture inversion in this area.(2)ARVI, EVI, and RVI were selected to replace the vegetation water content parameter in the inversion model, and all had R2 values >0.6, indicating a good fitting effect. RVI performed better than VWC-NDVI (R2 = 0.6744), suggesting that it can effectively replace vegetation water content in the water cloud model.(3)By calculating NDVI, ARVI, EVI, and RVI and using the corresponding empirical relationship formula to convert NDVI to VWC-NDVI as a reference, we found that using different vegetation parameters to invert or replace vegetation water content can improve the water cloud model performance and enhance the accuracy of soil water content inversion to different extents.

We effectively applied NDVI, ARVI, EVI, and RVI to the collaborative inversion of soil water content. RVI had the best fitting effect, which may be related to the hilly and mountainous surface characteristics of the study area in Nagqu. We therefore propose that RVI is especially applicable in mountainous study sites, where it can eliminate various terrain effects to improve inversion accuracy.

Note that there are some limitations to our study. The data provided by Sentinel-1 are in the VH dual polarization mode, while research on the HH polarization mode is lacking. Therefore, future studies should analyze the applicability of the remaining two polarization modes to obtain better inversion results. Although the vegetation layer and soil roughness greatly affect the radar backscattering coefficient, we only considered the effect of scattering from the vegetation layer on the inversion of soil water content. Therefore, subsequent studies could benefit from applying soil roughness to the inversion model, which may further improve model accuracy at our and comparable study sites.

Data Availability

The data that support the findings of this study are openly available in the National Tibetan Plateau Data Center at (https://doi.org/10.11888/Terre.tpdc.271918) and (https://doi.org/10.11888/Terre.tpdc.272408).

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this article.

Acknowledgments

This research was funded by the National Natural Science Foundation of China (Grant no. 42161067). The National Natural Science Foundation of China is from the Kunming University of Science and Technology, so the authors are very grateful to the school for the cultivation of our students. At the same time, the authors also appreciate the open source data provided by the National Tibetan Plateau Data Center and ASF so that the authors could carry out this study smoothly.