روش‌هاي جنگل تصادفي براي تحليل داده‌هاي وابسته فضايي

شماره ركورد
25215
شماره راهنما
STA2 298
نويسنده
علي بابايي، مبينا
عنوان
روش‌هاي جنگل تصادفي براي تحليل داده‌هاي وابسته فضايي
مقطع تحصيلي
كارشناسي ارشد
رشته تحصيلي
آمار اقتصادي
دانشكده
رياضي و آمار
تاريخ دفاع
1404/07/30
صفحه شمار
167 ص .
استاد راهنما
نصراله ايران پناه
استاد مشاور
هوشنگ طالبي حبيب آبادي
كليدواژه فارسي
پيشگويي قضايي كريگينگ , رگرسيون وزني جغرافيايي , جنگل تصادفي , درون يابي فضايي جنگل تصادفي , جنگل تصادفي جغرافيايي , جنگل تصادفي وزني جغرافيايي
چكيده فارسي
در مطالعات مختلف، داده‌هايي كه مورد استفاده قرار مي‌گيرند، معمولاً به‌صورت مستقل از يكديگر نيستند و وابستگي فضايي ميان داده‌ها وجود دارد. اين وابستگي به دليل موقعيت جغرافيايي و مكان قرار گرفتن داده‌ها در فضاي مورد مطالعه شكل مي‌گيرد. چنين داده‌هايي كه در آن‌ها روابط مكاني و فضايي اهميت دارد، تحت عنوان داده‌هاي فضايي شناخته مي‌شوند. تحليل اين نوع داده‌ها به دليل وجود ناهمگوني و همبستگي فضايي، چالش‌هاي خاصي را به همراه دارد كه نيازمند استفاده از روش‌هاي پيشرفته تحليل داده است. با افزايش كاربرد داده‌هاي فضايي و سنجش از دور ماهواره‌اي، محدوديت‌هاي روش‌هاي سنتي در مقابل با وابستگي فضايي آشكار شده است. اخيراً الگوريتم‌هاي يادگيري ماشين، به ويژه جنگل تصادفي به‌عنوان جايگزين‌ براي پيش‌بيني فضايي مطرح شده‌اند. با اين حال، روش‌هاي استاندارد آن‌ها معمولاً فاقد بعد فضايي بوده و اغلب همبستگي فضايي موجود در داده‌هاي جغرافيايي را ناديده مي‌گيرند. در اين پايان‌نامه براي تحليل وابستگي ميان داده‌هاي فضايي روش‌هاي مختلفي، ارائه شده است. هدف پژوهش حاضر توسعه و ارزيابي روش‌هاي مبتني بر جنگل تصادفي براي تحليل داده‌هاي وابسته فضايي است. در اين راستا، چندين رويكرد جديد مورد بررسي قرار مي‌گيرند. در ابتدا روش درون‌يابي فضايي جنگل تصادفي كه مبتني بر جنگل تصادفي است، معرفي و عملكرد اين روش در تحليل داده‌هاي شبيه‌سازي و همچنين داده‌هاي بارش و دما بررسي و با ساير روش‌هاي درون‌يابي مانند نزديك‌ترين همسايه، نقشه سطح روند، وزن‌دهي معكوس فاصله، كريگينگ معمولي، جنگل تصادفي استاندارد و پيش‌بيني‌ فضايي جنگل تصادفي مقايسه مي‌شود. در ادامه براي تحليل وابستگي فضايي يك مدل از جنگل تصادفي با عنوان جنگل تصادفي جغرافيايي ارائه مي‌شود كه هم به‌عنوان ابزار پيش‌بيني و هم به‌عنوان ابزار اكتشافي براي مدل‌سازي جمعيت بر اساس متغيرهاي سنجش از دور عمل مي‌كند. در نهايت توسيع مدل جنگل تصادفي جغرافيايي با نام جنگل تصادفي وزني جغرافيايي معرفي مي‌شود كه با اعمال وزن‌دهي فضايي به داده‌ها دقت پيش‌بيني را افزايش مي‌دهد. چالش‌هاي مرتبط با انتخاب پهناي باند و بهينه‌سازي آن با معيارهاي اندازه دقت نيز مطرح مي‌شوند. همچنين، عملكرد جنگل تصادفي وزني جغرافيايي با ساير روش‌هاي ارائه شده مانند رگرسيون وزني جغرافيايي، جنگل تصادفي جغرافيايي، جنگل تصادفي و حداقل توان دوم معمولي مقايسه خواهد شد.
كليدواژه لاتين
Kriging Spatial Predictor , Geographically Weighted Regression , Random Forest , Random Forest Spatial Interpolation , Geographical Random Forest , Geographically Weighted Random Forest
عنوان لاتين
Random Forest Methods for the Analysis of Spatially Dependent Data
گروه آموزشي
آمار
چكيده لاتين
In many studies, the data under investigation are not mutually independent but instead exhibit spatial dependence, arising from the geographic locations of the observations. Such data, where spatial relationships are of central importance, are referred to as spatial data. The analysis of spatial data poses unique challenges due to spatial heterogeneity an‎d autocorrelation, which necessitate the use of advanced analytical methods. With the growing availability of spatial an‎d satellite remote sensing data, the limitations of traditional methods in han‎dling spatial dependence have become increasingly evident. Recently, machine learning algorithms particularly ran‎dom forest have been proposed as powerful alternatives for spatial prediction. However, stan‎dard implementations often lack a spatial component an‎d tend to ignore the inherent spatial correlation present in geographic data. This dissertation introduces an‎d eva‎luates a series of ran‎dom forest approaches for analyzing spatially dependent data. At first, the ran‎dom forest spatial interpolation method is introduced, an‎d its performance is eva‎luated using both simulated datasets an‎d real-world precipitation an‎d temperature data, an‎d is compared with alternative interpolation methods such as nearest neighbor, trend surface analysis, inverse distance weighting, ordinary kriging, stan‎dard ran‎dom forest, an‎d spatial ran‎dom forest. Subsequently, a model of the ran‎dom forest, termed the geographical ran‎dom forest, is proposed for spatial dependence analysis. This method serves both as a predictive tool an‎d as an exploratory approach for modeling population distribution based on remote sensing variables. Finally, the extension of the geographic ran‎dom forest model is presented as geographic weighted ran‎dom forest, which increases the prediction accuracy by applying spatial weighting to the data. Also, the importance of spatial weighting of observations in local models is examined an‎d demonstrated. The challenges related to ban‎dwidth selec‎tion an‎d its optimization with accuracy size criteria are also raised. The performance of the geographic weighted ran‎dom forest will also be compared with other presented methods such as geographic weighted regression, geographical ran‎dom forest, ran‎dom forest an‎d ordinary least square.
تعداد فصل ها
4
فهرست مطالب pdf
148754
لينک به اين مدرک :
https://lib.ui.ac.ir/dL/search/default.aspx?Term=25215&Field=0&DTC=3

کلیه حقوق این اثر برای شرکت مهندسی ارتباطات پيام مشرق محفوظ می باشد

علي بابايي، مبينا

روش‌هاي جنگل تصادفي براي تحليل داده‌هاي وابسته فضايي