كنترل هيبريد نيرو/موقعيت مبتني بر يادگيري تقويتي سيستم رباتيكي

مقطع تحصيلي

دكتري

رشته تحصيلي

مهندسي مكانيك- طراحي كاربردي شاخه تخصصي ديناميك، كنترل و ارتعاشات

دانشكده

فني و مهندسي

تاريخ دفاع

1404.10.22

صفحه شمار

174 ص.

استاد راهنما

دكتر مريم ملك¬زاده , دكتر عليرضا آريائي

كليدواژه فارسي

كنترل هيبريد نيرو/موقعيت , يادگيري تقويتي , كنترل مود لغزشي زمان-ثابت , الگوريتم گراديان سياست قطعي عميق , ربات دلتا , جبران تأخير زماني , الگوريتم فراپيچش , افزونگي، , گشتاور فضاي پوچ , بات OpenManipulator-X

چكيده فارسي

اين رساله به طراحي و توسعه يك چارچوب كنترلي نوين و هوشمند براي كنترل هيبريد نيرو/موقعيت سيستم‌هاي رباتيكي با هدف غلبه بر چالش‌هاي اساسي كنترل¬كننده¬هاي كلاسيك، از جمله وابستگي به مدل دقيق، حساسيت به عدم قطعيت‌ها، اغتشاشات و تأخير زماني مي‌پردازد. چارچوب اصلي اين پژوهش، تركيب توانايي‌هاي يادگيري تقويتي به عنوان يك چارچوب سازگار و يادگيرنده با استحكام و مقاومت كنترل مود لغزشي پيشرفته است. در اين راستا، دو كنترل‌كننده هوشمند مجزا طراحي و ارائه شده است: 1. يك كنترل‌كننده هوشمند هيبريد نيرو/موقعيت: در اين كنترل‌كننده، از الگوريتم يادگيري تقويتيDDPG براي بهينه‌سازي برخط پارامترهاي يك كنترل‌كننده مود لغزشي زمان-ثابت استفاده شده است. هدف نهايي اين استراتژي، كاهش همزمان خطاي رديابي نيرو و موقعيت، حذف ريشه‌اي پديده چترينگ، و كاهش چشمگير تلاش كنترلي است، به گونه‌اي كه پايداري مقاوم سيستم در برابر عدم قطعيت‌هاي پارامتري و اغتشاشات خارجي به طور همزمان تضمين گردد. 2.يك كنترل‌كننده هوشمند نيمه مدل-آزاد براي ربات سري داراي افزونگي با تأخير زماني: اين كنترل‌كننده نوين از تركيب كنترل مود لغزشي فراپيچش و يادگيري تقويتي براي يادگيري بهينه گشتاور فضاي پوچ بهره مي‌برد. مهم‌ترين ويژگي اين كنترل‌كننده، عدم نياز به جداسازي صريح و آسيب‌پذير فضاي كنترل نيرو و موقعيت است. اين كنترل‌كننده علاوه بر دستيابي به اهدافي چون رديابي دقيق، حذف چترينگ و كاهش تلاش كنترلي، به طور خاص براي مقابله با تأخيرهاي زماني در سيستم‌هاي عملگري و ارتباطي طراحي شده و پايداري سيستم را در حضور اين تأخيرها تضمين مي‌كند. بهره‌گيري از فضاي پوچ، امكان بهينه‌سازي اهداف ثانويه‌اي مانند كاهش مصرف انرژي (تلاش كنترلي) را نيز فراهم مي‌آورد. اعتبارسنجي عملكرد برتر اين كنترل‌كننده¬ها شبيه‌سازي‌هاي گسترده تحت حالات مختلف و مقايسه با روش‌هاي كلاسيك به طور كامل انجام شده است. در كنترل¬كننده پيشنهادي اول ربات دلتا به عنوان نمونه مطالعاتي انتخاب گرديد و در كنترل¬كننده پيشنهادي دوم ربات سري داراي افزونگي OpenManipulator-X به عنوان نمونه مطالعاتي بررسي گرديد. نتايج به وضوح نشان مي‌دهند كه چارچوب پيشنهادي، گامي مؤثر در جهت ايجاد سيستم‌هاي رباتيكي خود-سازگار، مقاوم و كارآمد براي انجام وظايف تعاملي پيچيده در محيط‌هاي واقعي است. همچنين مشابه كنترل¬كننده پيشنهادي دوم يك كنترل¬كننده نيمه مدل-آزاد براي كنترل وضعيت شبيه¬ساز ماهواره پياده¬سازي صورت گرفت و نتايج كارآمدي و مقاومت اين كنترل¬كننده را نشان داد.

كليدواژه لاتين

Keywords: Hybrid force/position control , reinforcement learning , fixed-time sliding mode control , Deep Deterministic Policy Gradient (DDPG , Delta robot , time-delay compensation, , super-twisting algorithm , redundancy, null-space torque , OpenManipulator-X.

عنوان لاتين

Reinforcement Learning-based Hybrid force/position Control of Robotic System

گروه آموزشي

مهندسي مكانيك

چكيده لاتين

This dissertation presents the design an‎d development of a novel intelligent control framework for hybrid force/position control of robotic systems, aiming to overcome the fundamental limitations of classical controllers, including reliance on precise models, sensitivity to uncertainties an‎d disturbances, an‎d the undesirable phenomenon of chattering. The central paradigm of this research lies in integrating the adaptability an‎d learning capabilities of reinforcement learning with the robustness of advanced sliding mode control. In this context, two distinct intelligent controllers are designed an‎d introduced: 1. A hybrid controller for a Delta parallel robot: This controller employs the Deep Deterministic Policy Gradient (DDPG) reinforcement learning algorithm to online optimize the parameters of a fixed-time sliding mode controller. The ultimate goal of this strategy is the simultaneous reduction of force an‎d position tracking errors, complete elimination of chattering, an‎d significant reduction of control effort, while guaranteeing robust system stability against parametric uncertainties an‎d external disturbances. 2.A semi model-free controller for a redundant serial robot (OpenManipulator-X): This novel controller combines super-twisting sliding mode control with reinforcement learning to optimally learn the null-space torque. Its most notable feature is the elimination of the explicit an‎d fragile separation of force an‎d position control subspaces. In addition to achieving accurate tracking, chattering elimination, an‎d reduced control effort, this controller is specifically designed to address time delays in actuation an‎d communication, thereby ensuring system stability under such conditions. Furthermore, by exploiting the null space, it enables optimization of secondary objectives such as energy efficiency (control effort minimization). The superior performance of the proposed controllers is validated through extensive simulations under various scenarios an‎d comprehensive comparisons with classical methods. The results clearly demonstrate that the proposed framework constitutes a significant step toward the development of self-adaptive, robust, an‎d efficient robotic systems capable of performing complex interactive tasks in real-world environments

تعداد فصل ها

فهرست مطالب pdf

153429

نويسنده

رادان كاشاني، مجتبي

لينک به اين مدرک

https://lib.ui.ac.ir/dl/search/default.aspx?Term=25552&Field=0&DTC=3