طراحي ‌كنترل‌كننده مرزي تطبيقي مبتني بر يادگيري تقويتي عميق براي ماهواره‌هاي انعطاف‌پذير

مقطع تحصيلي

دكتري

رشته تحصيلي

مهندسي مكانيك- طراحي كاربردي شاخه تخصصي مكانيك جامدات

دانشكده

فني و مهندسي

تاريخ دفاع

1404/06/05

صفحه شمار

163 ص.

استاد راهنما

مهدي مرتضوي , مريم ملك‌زاده

كليدواژه فارسي

ماهواره‌هاي انعطاف‌پذير , كنترل مرزي تطبيقي , كنترل ارتعاشات , يادگيري تقويتي عميق , خرابي عملگر , محدوديت عملگر , رديابي وضعيت , شبكه عصبي پايه شعاعي , شبيه‌ساز انعطاف‌پذير

چكيده فارسي

در اين رساله، مسئله كنترل همزمان وضعيت و ارتعاشات ماهواره‌هاي انعطاف‌پذير با استفاده از رويكرد كنترل مرزي و يادگيري تقويتي عميق، مورد بررسي قرار مي‌گيرد. ماهواره انعطاف‌پذير داراي پنل‌هاي خورشيدي با ساختار لانه‌زنبوري مي‌باشد. ابتدا، معادلات ديناميكي حاكم با استفاده از اصل هميلتون و حساب تغييرات استخراج مي‌گردد. سپس با طراحي يك تابع لياپانوف نوآورانه و استفاده مستقيم از معادلات ديفرانسيل جزئي حاكم بر سيستم، كنترل‌كننده مرزي طراحي و متعاقباً با استفاده از اصل توسعه‌يافته نامتغير لاسال، پايداري مجانبي اثبات مي‌گردد. پس از آن، عملكرد كنترل‌كننده با گسسته‌سازي معادلات حاكم بر سيستم به روش مود فرضي و انجام شبيه‌سازي عددي مورد بررسي قرار مي‌گيرد. نتايج شبيه‌سازي در سناريوهاي مختلف از جمله كنترل فعال و غيرفعال، همگي بر دستيابي به كنترل همزمان وضعيت و ارتعاشات ماهواره انعطاف‌پذير، دلالت دارند. در ادامه، به‌ منظور غلبه بر محدوديت‌هاي مدل‌سازي و چالش‌هاي عملي نظير اغتشاشات خارجي و نامعيني‌ها، روش يادگيري تقويتي عميق به كار گرفته مي‌شود. تنظيم خودكار بهره‌هاي كنترل‌كننده مرزي با استفاده از يك شبكه عصبي مبتني بر مدل نقاد-بازيگر و تكنيك تفاوت زماني انجام مي‌شود. متعاقباً اثبات پايداري مجانبي سيستم حلقه بسته ناخودگردان، با استفاده از لم باربالت و با لحاظ اشباع عملگر بررسي مي‌شود. نتايج شبيه‌سازي‌ عددي در سناريوهاي مختلف، از جمله رديابي زاويه وضعيت در حضور اغتشاشات و نامعيني‌ها، نشان دادند كه روش پيشنهادي، به‌طور مؤثري مقاومت و عملكرد كنترل‌كننده را در برابر اغتشاشات و نامعيني‌ها بهبود مي‌دهد. علاوه بر آن، آموزش كنترل‌كننده منجر به كاهش زمان نشست و تلاش كنترلي گرديد. در پايان، با طراحي و ساخت فيكسچر و پنل، شبيه‌ساز ماهواره صلب مستقر در دانشگاه به شبيه‌ساز ماهواره انعطاف‌پذير ارتقا مي‌يابد. ابتدا معادلات ديناميكي حاكم بر شبيه‌ساز استخراج مي‌شوند. پس از آن پياده‌سازي كنترل‌كننده مرزي در دو سناريو مختلف، شامل كنترل‌كننده مرزي با بهره‌هاي ثابت و كنترل‌كننده مرزي تطبيقي مبتني بر يادگيري تقويتي عميق با آموزش برخط، انجام مي‌شود. در پايان، پياده‌سازي نقصان عملگر با استفاده از قانون كنترل مرزي تطبيقي مورد بررسي قرار مي‌گيرد. بررسي نتايج پياده‌سازي و مقايسه با نتايج شبيه‌سازي عددي، منجر به تأييد صحت معادلات ديناميكي استخراج شده و اعتبارسنجي بخش تئوري گرديد. ضمناً كسب نتايج دقيق پياده‌سازي، تاكيدي بر كاربردي بودن نوآوري‌ها و اهداف رساله مي‌باشد. اين پژوهش با تركيب كنترل مرزي و يادگيري تقويتي عميق، روش نويني براي كنترل وضعيت و ارتعاشات ماهواره‌هاي انعطاف‌پذير ارائه مي‌دهد و گامي مؤثر در افزايش پايداري و كارايي سيستم‌هاي فضايي برداشته است.

كليدواژه لاتين

Flexible Satellites , Adaptive Boundary Control , Vibrations Control , Deep Reinforcement Learning (DRL) , Actuator Fault , Actuator Saturation , Attitude Tracking , Radial Basis Function Neural Network (RBFNN) , Flexible Satellite Simulator.

عنوان لاتين

Design of Adaptive Boundary Controller Based on Deep Reinforcement Learning for Flexible Satellites

گروه آموزشي

مهندسي مكانيك

چكيده لاتين

In this dissertation, the issue of simultaneous attitude an‎d vibration control of flexible satellites is addressed using a boundary control approach combined with deep reinforcement learning. The flexible satellite under consideration is equipped with honeycomb-structured solar panels. First, the governing dynamic equations are derived using Hamiltonʹs principle an‎d the calculus of variations. Subsequently, an innovative Lyapunov function is designed, an‎d the boundary controller is developed by directly applying the governing partial differential equations of the system. The asymptotic stability of the system is then proven using the extended LaSalle’s invariance principle. Following this, the controllerʹs performance is eva‎luated by discretizing the governing equations of the system using the assumed modes method an‎d performing numerical simulations. Simulation results in various scenarios, including active an‎d passive control, all indicate the achievement of simultaneous attitude an‎d vibration control for the flexible satellite. Furthermore, to overcome modeling limitations an‎d practical challenges such as external disturbances an‎d uncertainties, deep reinforcement learning is utilized. The automatic tuning of the boundary controller’s gains is carried out using an actor-critic neural network combined with temporal-difference learning techniques. The asymptotic stability of nonautonomous system with actuator saturation is then verified using the Barbalatʹs lemma. Numerical simulation results, including attitude angle tracking in the presence of disturbances an‎d uncertainties, demonstrate that the proposed method effectively enhances the robustness an‎d performance of the controller against disturbances an‎d modeling errors. Additionally, the training of the controller results in a reduction of settling time an‎d control effort. Finally, by designing an‎d constructing a fixture an‎d panel, the rigid satellite simulator at the university is upgraded to a flexible satellite simulator. The dynamic equations governing the simulator are first derived, an‎d then the implementation of the boundary controller in two different scenarios is carried out, one with fixed controller gains an‎d the other with an adaptive boundary controller based on deep reinforcement learning with online training. Lastly, the implementation of actuator failure is examined using the adaptive boundary control law. The results of the implementation are compared with numerical simulation results, confirming the accuracy of the extracted dynamic equations an‎d validating the theoretical aspects. Furthermore, obtaining accurate implementation results emphasizes the practical applicability of the innovations an‎d objectives of the dissertation. This research presents a novel approach for the attitude an‎d vibration control of flexible satellites by combining boundary control an‎d deep reinforcement learning, representing a significant step forward in enhancing the stability an‎d performance of space systems.

تعداد فصل ها

فهرست مطالب pdf

144967

نويسنده

عاملي، شكراله

لينک به اين مدرک

https://lib.ui.ac.ir/dl/search/default.aspx?Term=24968&Field=0&DTC=3