بهبود برنامه‌ريزي ماموريت براي وسايل‌نقليه زميني بدون سرنشين در زمين‌هاي ناهموار و ناشناخته

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

مهندسي كامپيوتر- معماري سيستم هاي كامپيوتري

دانشكده

مهندسي كامپيوتر

تاريخ دفاع

1404/10/30

صفحه شمار

81 ص .

استاد راهنما

علي بهلولي , كمال جمشيدي

كليدواژه فارسي

يادگيري تقويتي عميق , وسايل نقليه زميني بدون سرنشين , توجه چندسر , توجه متقابل , محيط ناشناخته و ناهموار

چكيده فارسي

ناوبري خودمختار وسايل نقليه زميني خودران در محيط‌هاي ناهموار و ناشناخته، به‌عنوان يكي از حوزه‌هاي كليدي رباتيك متحرك و رباتيك هوشمند، به‌دليل وجود عدم‌قطعيت‌هاي حسگري، تغييرپذيري شرايط محيطي و فقدان ساختار منظم، يكي از چالش‌هاي بنيادين در سامانه‌هاي خودمختار به‌شمار مي‌رود. در چنين محيط‌هايي، روش‌هاي كلاسيك برنامه‌ريزي مسير مبتني بر نقشه و حتي بسياري از رويكردهاي متداول يادگيري تقويتي عميق، به‌علت ناتواني در ادغام مؤثر داده‌هاي چندوجهي، حساسيت بالا به نويز و ضعف در مدل‌سازي وابستگي‌هاي زماني، از پايداري و كارايي لازم برخوردار نيستند. در اين پژوهش، يك چارچوب يادگيري تقويتي عميق توزيعي مبتني بر Rainbow DQN براي ناوبري محلي ايمن و پايدار وسايل نقليه زميني خودران رباتيك در زمين‌هاي ناهموار پيشنهاد مي‌شود. معماري ارائه‌شده با هدف استخراج نمايش‌هاي غني و معنادار از ورودي‌هاي حسگري ناهمگن، شامل اسكن‌هاي لايدار، اطلاعات اينرسي، زاويه نسبي نسبت به هدف، جابه‌جايي نسبي و نقشه ارتفاعي محلي، طراحي شده است. در اين راستا، از يك مكانيسم توجه چندسر براي مدل‌سازي تعامل ميان توكن‌هاي حسگري مختلف استفاده شده است تا عامل رباتيك قادر باشد به‌صورت تطبيقي بر اطلاعات كليدي مؤثر در تصميم‌گيري تمركز كند. به‌منظور بهبود ادراك فضايي زمين، يك ماژول توجه فضايي بر روي نقشه ارتفاع محيط به‌كار گرفته شده است كه امكان شناسايي نواحي پرخطر و ناهمواري‌هاي بحراني را براي سامانه ناوبري فراهم مي‌سازد. افزون بر اين، با استفاده از توجه غيرفضايي، اهميت نسبي ورودي‌هاي عددي و حسگري به‌صورت پويا تنظيم مي‌شود. براي همجوشاني مؤثر اطلاعات فضايي و غيرفضايي، يك مكانيسم توجه متقابل ميان نمايش نقشه ارتفاع و ساير ورودي‌هاي حسگري طراحي شده است كه موجب افزايش انسجام تصميم‌گيري و كاهش وابستگي سامانه به ورودي‌هاي پرنويز مي‌گردد. به‌منظور مدل‌سازي وابستگي‌هاي زماني و مواجهه با مسئله مشاهده ناقص در ناوبري رباتيك، خروجي ماژول‌هاي توجه به يك شبكه LSTM با پنجره زماني محدود وارد مي‌شود. همچنين، با بهره‌گيري از يادگيري توزيعي مبتني بر C51 و بازپخش تجربه اولويت‌دار، پايداري يادگيري و دقت تخمين بازده مورد انتظار به‌طور قابل‌توجهي بهبود يافته است. ارزيابي روش پيشنهادي در محيط شبيه‌سازي‌شده Gazebo و در چارچوب ROS انجام شده و نتايج حاصل نشان مي‌دهد كه معماري ارائه‌شده در مقايسه با روش پايه و نسخه‌هاي فاقد ماژول‌هاي توجه، عملكرد برتري از نظر پايداري يادگيري، كاهش نرخ برخورد با موانع، بهبود كيفيت مسير و افزايش ايمني حركت وسايل نقليه زميني خودران ارائه مي‌دهد. علاوه بر اين، تحليل كيفي نقشه‌هاي توجه فضايي و غيرفضايي، تفسيرپذيري تصميم‌هاي عامل رباتيك را افزايش داده و نشان مي‌دهد كه مدل قادر است به‌صورت معنادار بر ويژگي‌هاي بحراني زمين تمركز نمايد. نتايج اين پژوهش بيانگر آن است كه ادغام مكانيسم‌هاي توجه چندسر و توجه متقابل در يادگيري تقويتي عميق، مي‌تواند نقش مؤثري در توسعه سامانه‌هاي ناوبري هوشمند براي محيط‌هاي خارج‌ازجاده ايفا كند.

كليدواژه لاتين

Deep Reinforcement Learning , Unmanned Ground Vehicles , Multi-head Attention , Cross-Attention , Rough an‎d Unknown Terrains

عنوان لاتين

Enhancing Mission Planning for Unmanned Ground Vehicles in Rough an‎d unknown Terrains

گروه آموزشي

مهندسي معماري كامپيوتر

چكيده لاتين

Autonomous navigation of autonomous ground vehicles, as a key area of mobile robotics an‎d intelligent robotics, is considered one of the fundamental challenges in autonomous systems due to the complexity of terrain geometry, the presence of sensory uncertainties, the variability of environmental conditions, an‎d the lack of regular structure. In such environments, classical map-based path planning methods, as well as many conventional deep reinforcement learning approaches, suffer from insufficient stability an‎d efficiency owing to their inability to effectively integrate multimodal sensory data, their high sensitivity to noise, an‎d their limited capability in modeling temporal dependencies. In this research, a distributional deep reinforcement learning framework based on Rainbow DQN is proposed for safe an‎d stable local navigation of robotic autonomous ground vehicles in rough terrains. The proposed architecture is designed to extract rich an‎d meaningful representations from heterogeneous sensory inputs, including LiDAR scans, inertial measurements, relative goal angle, relative displacement, an‎d a local elevation map. In this context, a Multi-Head Attention mechanism is employed to model interactions among different sensory tokens, enabling the robotic agent to adaptively focus on the most informative features relevant to decision-making. To enhance spatial perception of the terrain, a spatial attention module is applied to the environmental elevation map, allowing the navigation system to identify hazardous regions an‎d critical terrain irregularities. Furthermore, through the use of non-spatial attention, the relative importance of numerical an‎d sensory inputs is dynamically adjusted. For effective fusion of spatial an‎d non-spatial information, a Cross-Attention mechanism is designed between the elevation map representation an‎d other sensory inputs, which improves decision-making coherence an‎d reduces the system’s reliance on noisy observations. To model temporal dependencies an‎d address the problem of partial observability in robotic navigation, the outputs of the attention modules are fed into an LSTM network with a limited temporal window. Moreover, by leveraging C51-based distributional learning an‎d prioritized experience replay, learning stability an‎d the accuracy of expected return estimation are significantly improved. The proposed method is eva‎luated in the Gazebo simulation environment within the ROS framework, an‎d the results demonstrate that the presented architecture outperforms the baseline method an‎d attention-free variants in terms of learning stability, reduced collision rates, improved path quality, an‎d enhanced navigation safety of autonomous ground vehicles. In addition, qualitative analysis of spatial an‎d non-spatial attention maps increases the interpretability of the robotic agent’s decisions an‎d indicates that the model can meaningfully focus on critical terrain features. The findings of this research suggest that integrating multi-head an‎d cross-attention mechanisms into deep reinforcement learning can play an effective role in the development of intelligent navigation systems for off-road environments.

تعداد فصل ها

فهرست مطالب pdf

159084

نويسنده

دلاوري، فرزانه

لينک به اين مدرک

https://lib.ui.ac.ir/dl/search/default.aspx?Term=25851&Field=0&DTC=3