چكيده لاتين
Precipitation, as one of the primary components of the hydrological and environmental cycles, plays a crucial role in providing freshwater resources, preserving ecosystems, and ensuring food security. However, accurately estimating the spatial and temporal distribution of precipitation, particularly in arid and mountainous regions, remains a significant challenge in meteorological and hydrological studies due to the limitations of ground observational data. While ground station data are considered the most reliable source of precipitation information, their limited spatial coverage, temporal discontinuity, and insufficient distribution in many regions fail to meet the extensive requirements of water resource management. On the other hand, satellite data, especially precipitation products such as IMERG, CHIRPS, PERSIANN_CDR, and ERA5, have gained prominence as valuable alternatives due to their wide coverage and provision of continuous data at regular intervals. Nevertheless, the coarse spatial resolution and systematic errors of these products underscore the need to integrate multiple precipitation datasets rather than relying on a single product, as well as to perform downscaling while accounting for environmental parameters.
This study, focusing on precipitation data for Iran in 2021, proposes a hybrid framework based on machine learning to enhance the accuracy and spatial resolution of precipitation data. In the first phase, after evaluating and correcting biases in satellite precipitation products, all satellite datasets were resampled using three different methods to standardize their spatial resolution and prepare them for integration. Subsequently, the integrated precipitation product was generated using the Random Forest model, which effectively combined satellite and ground station data into a unified dataset. This integration successfully merged the advantages of both data sources while mitigating their respective limitations.
In the second phase, the generated product was downscaled using a Convolutional Neural Network (CNN). This network incorporated not only the integrated precipitation product but also environmental parameters such as temperature, water vapor, and elevation as inputs. The inclusion of these variables allowed for a more precise consideration of topographic effects, atmospheric conditions, and other environmental influences on precipitation in the final product.
Finally, the generated products were validated using ground synoptic station data, with evaluation metrics such as correlation, bias, and RMSE. The best-performing precipitation product exhibited an RMSE of 16.51 mm and a correlation coefficient of 0.89, confirming the improved accuracy of the model and its applicability.
The results of this research demonstrate that the proposed framework effectively reduces the limitations of both ground-based and satellite data, producing precipitation datasets with higher accuracy and spatial resolution. These products have broad potential applications in fields such as flood forecasting, water resource management, agricultural planning, and climate studies.