برنامه‌ريزي حركت و كنترل بازوي ربات با استفاده از يادگيري تقويتي عميق به‌منظور عدم برخورد با موانع

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

مهندسي برق - كنترل

دانشكده

فني و مهندسي

تاريخ دفاع

1404/06/26

صفحه شمار

116 ص .

استاد راهنما

محمد عطائي , محمد مطهري فر

استاد مشاور

نگين سياف

كليدواژه فارسي

بازوي رباتيك , برنامه‌ريزي مسير , يادگيري تقويتي عميق , ايمني

چكيده فارسي

اين پژوهش با هدف توسعه يك كنترل‌كننده هوشمند و ايمن براي برنامه‌ريزي مسير و اجتناب از موانع پويا در بازوهاي رباتيك طراحي و اجرا مي‌شود. در گام نخست، يك مسير پژوهشي جامع شامل انتخاب و ارزيابي مقايسه‌اي الگوريتم‌هاي يادگيري تقويتي پيگيري مي‌گردد. در اين ارزيابي، الگوريتم‌هاي DDPG،TD3 و SAC مورد مقايسه قرار مي‌گيرند و نتايج نشان مي‌دهد كه عملكرد SAC در وظايف پيچيده همچون كنترل همزمان موقعيت و جهت‌گيري و اجتناب از موانع پويا برتر است. علاوه بر اين، با به‌كارگيري تكنيك بازپخش تجربه با نگاه به گذشته، كارايي و نرخ موفقيت SAC و توانايي آن در سناريوهاي چالش‌برانگيز مورد بررسي قرار مي‌گيرد. با وجود اين، محدوديت‌هاي ذاتي سياست‌هاي يادگيري تقويتي در تعاملات فيزيكي نقد مي‌شود و يك معماري كنترلي هيبريدي بر پايه‌ي لايه‌اي از كنترل امپدانس به‌عنوان راهكاري براي افزايش ايمني و استواري سيستم ارائه مي‌گردد. اعتبارسنجي شبيه‌سازي‌ها نشان مي‌دهد كه اين معماري قادر است واكنش مناسبي در مواجهه با برخوردهاي ناخواسته ايجاد كند و همزمان وظايف اصلي را با دقت بالا به انجام برساند. در ادامه، ربات Franka Emika Panda با 7 درجه آزادي از صفر راه‌اندازي و آماده‌سازي مي‌شود تا بستر لازم براي پياده‌سازي عملي الگوريتم‌ها فراهم گردد. همچنين، به دليل محدوديت‌هاي موجود، الگوريتم اجتناب از مانع مبتني بر برنامه‌ريزي درجه دوم انتخاب مي‌شود و در محيط واقعي مورد ارزيابي قرار مي‌گيرد. نتايج عملي نشان مي‌دهد كه كنترل‌كننده پيشنهادي نه‌تنها در شرايط شبيه‌سازي، بلكه در مواجهه با چالش‌هاي سخت‌افزاري واقعي نيز پايداري، ايمني و كارايي مطلوبي را حفظ مي‌كند.

كليدواژه لاتين

Robotic Arm , Path Planning , Deep Reinforcement Learning , Safety

عنوان لاتين

Motion planning an‎d control of a robot manipulator using deep reinforcement learning to collision avoid obstacles

گروه آموزشي

مهندسي برق

چكيده لاتين

This research is designed an‎d conducted with the aim of developing an intelligent an‎d safe controller for path planning an‎d dynamic obstacle avoidance in robotic manipulators. In the first stage, a comprehensive research path is pursued, including the selec‎tion an‎d comparative eva‎luation of reinforcement learning algorithms. In this eva‎luation, the DDPG, TD3, an‎d SAC algorithms are compared, an‎d the results demonstrate that SAC outperforms the others in complex tasks such as simultaneous control of position an‎d orientation as well as dynamic obstacle avoidance. Furthermore, by employing the Hindsight Experience Replay (HER) technique, the efficiency an‎d success rate of SAC, along with its capability in challenging scenarios, are examined. Nevertheless, the inherent limitations of reinforcement learning policies in physical interactions are criticized, an‎d a hybrid control architecture based on an impedance control layer is proposed as a solution to enhance the safety an‎d robustness of the system. Simulation validation shows that this architecture can generate appropriate responses when facing unintended collisions while simultaneously accomplishing the primary tasks with high accuracy. Subsequently, the Franka Emika Pan‎da robot with 7 degrees of freedom is initialized an‎d set up from scratch to provide the foundation for the practical implementation of the algorithms. Moreover, due to existing constraints, an obstacle avoidance algorithm based on Quadratic Programming (QP) is selec‎ted an‎d eva‎luated in a real environment. Practical results indicate that the proposed controller not only maintains stability, safety, an‎d efficiency in simulation conditions but also preserves these properties when confronted with real-world hardware challenges.

تعداد فصل ها

فهرست مطالب pdf

145011

نويسنده

براتي، امير حسين

لينک به اين مدرک

https://lib.ui.ac.ir/dl/search/default.aspx?Term=24972&Field=0&DTC=3