بهبود بهره‌وري الگوريتم‌هاي زمان‌بندي وظايف در محيط رايانش ابري با استفاده از الگوريتم‌ يادگيري تقويتي‌PPO

شماره ركورد
24113
شماره راهنما
COM2 664
نويسنده
غلامحسيني، رضا
عنوان
بهبود بهره‌وري الگوريتم‌هاي زمان‌بندي وظايف در محيط رايانش ابري با استفاده از الگوريتم‌ يادگيري تقويتي‌PPO
مقطع تحصيلي
كارشناسي ارشد
رشته تحصيلي
مهندسي كامپيوتر- معماري سيستم هاي كامپيوتري
دانشكده
مهندسي كامپيوتر
تاريخ دفاع
1403/06/19
صفحه شمار
107 ص.
استاد راهنما
محمدرضا خيام‌باشي
كليدواژه فارسي
زمان‌بندي وظايف , رايانش ابري , يادگيري تقويتي عميق , بهينه‌سازي سياست مجاورت , ترافيك متغير , زمان‌بندي تطبيقي
چكيده فارسي
در عصر حاضر، رايانش ابري به عنوان يك رويكرد جديد، با ارائه خدمات محاسباتي، ذخيره‌سازي و شبكه بر اساس تقاضا، مقياس‌پذيري و كارايي را براي كسب‌وكارها و سازمان‌ها به ارمغان آورده است. با اين حال، مديريت بهينه منابع محاسباتي در محيط‌هاي ابري، با توجه به پيچيدگي و حجم كاري روزافزون، به يك چالش اساسي تبديل شده است. مسئله زمان‌بندي وظايف، به عنوان يكي از مهم‌ترين جنبه‌هاي مديريت منابع، نقش كليدي در تضمين كارايي، كاهش هزينه‌ها و افزايش رضايت كاربران را دارد. در اين مسئله، وظايف مختلفي با تقاضاهاي محاسباتي و زمان‌هاي سررسيد متفاوت به سيستم وارد مي‌شوند و بايد به گونه‌اي زمان‌بندي شوند كه هزينه‌هاي ناشي از مصرف انرژي و تأخير در پردازش وظايف به حداقل برسد. اين چالش، به دليل ماهيت پويا و تصادفي ورود وظايف و همچنين محدوديت منابع محاسباتي، نيازمند توسعه الگوريتم‌هاي زمان‌بندي هوشمند و كارآمد است. در اين پژوهش، يك الگوريتم زمان‌بندي تطبيقي تحت عنوان AEDLDF-PPO براي حل مسئله زمان‌بندي وظايف در رايانش ابري پيشنهاد شده است. اين الگوريتم، با بهره‌گيري از نقاط قوت يادگيري تقويتي عميق و قوانين اولويت‌بندي، قادر است در شرايط مختلف ترافيكي، سياست‌هاي زمان‌بندي بهينه را بياموزد و هزينه‌هاي سيستم را به حداقل برساند. در اين راستا، ابتدا مسئله زمان‌بندي به عنوان يك برنامه پويا با احتمالات انتقال نامشخص فرموله شده و يك مدل رياضي جامع براي سيستم رايانش ابري ارائه شده است. سپس، با توجه به محدوديت‌هاي روش‌هاي سنتي زمان‌بندي و الگوريتم‌هاي يادگيري تقويتي عميق موجود، الگوريتم پيشنهادي AEDLDF-PPO معرفي شده است.اين الگوريتم شامل دو مرحله اصلي است: ماژول طبقه‌بندي ترافيك و ماژول زمان‌بندي وظايف تطبيقي. ماژول طبقه‌بندي ترافيك با استفاده از يك شبكه، سطح ترافيك فعلي سيستم را به عنوان سبك يا سنگين طبقه‌بندي مي‌كند و سپس ماژول زمان‌بندي وظايف تطبيقي كه شامل دو سياست زمان‌بندي EDLDF-PPO براي ترافيك سنگين، و يك سياست EDF براي ترافيك سبك است كه در هر مرحله، بر اساس خروجي ماژول طبقه‌بندي ترافيك، سياست مناسب براي زمان‌بندي وظايف را انتخاب مي‌كند. عملكرد الگوريتم پيشنهادي AEDLDF-PPO با استفاده از شبيه‌سازي‌هاي گسترده در محيط‌هاي رايانش ابري با شرايط ترافيكي گوناگون ارزيابي شده است. نتايج شبيه‌سازي‌ها نشان مي‌دهند كه اين الگوريتم در تمام سناريوهاي مورد بررسي، عملكرد بهتري نسبت به الگوريتم‌هاي زمان‌بندي LLF-PPO ، EDF-PPO و RR-PPO دارد. اين الگوريتم، با تطبيق هوشمندانه بين سياست‌هاي EDLDF-PPO و EDF براساس شرايط ترافيك، قادر است هزينه‌هاي كلي سيستم را به حداقل برساند. به طور كلي، نتايج اين پژوهش نشان مي‌دهد كه الگوريتم پيشنهادي AEDLDF-PPO يك رويكرد كارآمد و موثر براي زمان‌بندي وظايف در محيط‌هاي رايانش ابري است. اين امر مي‌تواند به بهبود كارايي، كاهش هزينه‌ها تا 14درصد نسبت به الگوريتم EDLDF-PPO در سيستم‌هاي رايانش ابري كمك كند و در نتيجه، به توسعه و گسترش اين فناوري‌ها كمك شاياني نمايد.
كليدواژه لاتين
Task scheduling , cloud computing , deep reinforcement learning , proximal policy optimization , variable traffic , adaptive scheduling
عنوان لاتين
Improving the Efficiency of Task Scheduling Algorithms in Cloud Computing Environments using PPO Reinforcement Learning Algorithm
گروه آموزشي
مهندسي معماري كامپيوتر
چكيده لاتين
In todayʹs era, cloud computing has emerged as a new approach, offering on-demand computing, storage, and networking services, bringing flexibility, scalability, and efficiency to businesses and organizations. However, the optimal management of computing resources in cloud environments, given the increasing complexity and workload, has become a major challenge. Task scheduling, as one of the most important aspects of resource management, plays a key role in ensuring efficiency, reducing costs, and increasing user satisfaction. In this problem, various tasks with different computational demands and deadlines enter the system and must be scheduled in a way that minimizes the costs associated with energy consumption and delays in task processing. This challenge, due to the dynamic and random nature of task arrivals and the limitations of computing resources, requires the development of intelligent and efficient scheduling algorithms. In this research, an adaptive scheduling algorithm called AEDLDF-PPO is proposed to solve the task scheduling problem in cloud computing. By leveraging the strengths of deep reinforcement learning and prioritization rules, this algorithm can learn optimal scheduling policies under various traffic conditions and minimize system costs. To this end, the scheduling problem is first formulated as a dynamic program with uncertain transition probabilities, and a comprehensive mathematical model for the cloud computing system is presented. Then, considering the limitations of traditional scheduling methods and existing deep reinforcement learning algorithms, the proposed AEDLDF-PPO algorithm is introduced. This algorithm consists of two main stages: the traffic classification module and the adaptive task scheduling module. The traffic classification module uses a network to classify the current system traffic level as light or heavy, and then the adaptive task scheduling module, which includes two EDLDF-PPO scheduling policies for heavy traffic and one EDF policy for light traffic, selects the appropriate policy for task scheduling at each stage based on the output of the traffic classification module. The performance of the proposed AEDLDF-PPO algorithm has been eva‎luated using extensive simulations in cloud computing environments with various traffic conditions. The simulation results show that this algorithm outperforms the LLF-PPO, EDF-PPO, and RR-PPO scheduling algorithms in all the scenarios considered. By intelligently adapting between the EDLDF-PPO and EDF policies based on traffic conditions, this algorithm can minimize overall system costs. In general, the results of this research show that the proposed AEDLDF-PPO algorithm is an efficient and effective approach for task scheduling in cloud computing environments. This can help improve efficiency and reduce costs by up to 14% compared to the baseline algorithm in cloud computing systems, and consequently, contribute significantly to the development and expansion of these technologies.
تعداد فصل ها
6
لينک به اين مدرک :
https://lib.ui.ac.ir/dL/search/default.aspx?Term=24113&Field=0&DTC=3

کلیه حقوق این اثر برای شرکت مهندسی ارتباطات پيام مشرق محفوظ می باشد

غلامحسيني، رضا

بهبود بهره‌وري الگوريتم‌هاي زمان‌بندي وظايف در محيط رايانش ابري با استفاده از الگوريتم‌ يادگيري تقويتي‌PPO