بازشناسي صفات شخصيتي مبتني بر رويكرد چند ماهيتي عميق

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

مهندسي كامپيوتر - هوش مصنوعي و رباتيكز

دانشكده

مهندسي كامپيوتر

تاريخ دفاع

1403/10/23

صفحه شمار

177 ص.

استاد راهنما

دكتر حميدرضا برادران كاشاني

كليدواژه فارسي

بازشناسي صفات شخصيتي , داده‌هاي چند ماهيتي , يادگيري عميق , پنج صفت اصلي شخصيت

چكيده فارسي

چكيده شخصيت شامل رفتارها، سبك فكرها، شيوه حرف زدن، درك محيط و تعامل‌هاي بين فردي است كه پيوسته، بر اساس يك الگوي قابل تشخيص، در يك فرد مشاهده مي‌شود. شخصيت يك فرد همه جنبه‌هاي زندگي وي را احاطه كرده و بر فعاليت‌هاي روزمره، احساسات، ترجيحات و تصميمات وي تاثير مي‌گذارد. بنابراين تشخيص صفات شخصيتي فرد از اهميت ويژه‌اي برخوردار بوده و مي‌تواند به ساخت بسياري از خدمات يا محصولات سفارشي شده كمك كند. براي شناخت شخصيت مي‌توان از آزمون‌هاي شخصيت‌شناسي موجود مانند آزمون پنج صفت اصلي شخصيت (Big Five Personality Traits) يا آزمون تيپ‌شناسي مايرز بريگز (Myers Briggs Type Indicator به طور مخفف MBTI) استفاده نمود. يك رويكرد سنتي براي شخصيت‌شناسي استفاده از پرسشنامه مي‌باشد كه كاري زمان‌بر بوده و ممكن است افراد تمايلي به صرف زمان قابل توجهي براي پر كردن پرسشنامه نداشته باشند. امروزه استفاده از شبكه‌هاي اجتماعي مانند فيسبوك، توييتر و يوتيوب در حال افزايش بوده و به يك بستر محبوب جهت به اشتراك‌گذاري افكار، احساسات، تجربيات و اطلاعات شخصي كاربران در قالب متن، صوت، تصوير يا ويدئو تبديل شده است. در اين حجم عظيم از داده‌هاي توليد شده، اطلاعات ارزشمندي نهفته است كه از استخراج و تجزيه و تحليل آنها مي‌توان براي پيش‌بيني خودكار شخصيت استفاده كرده و در همين راستا براي كمك به كسب‌وكارها و درك نياز كاربر بهره برد. در نتيجه مي‌توان از داده‌هاي شبكه‌هاي اجتماعي كاربر براي پيش‌بيني شخصيت وي استفاده كرد. معمولا روش‌هاي فعلي صفات شخصيتي را از فريم‌هاي ثابت يا بخش‌هاي كوتاه صوتي-بصري استنباط مي‌كنند، كه منجر به نتايج غيرقابل اعتماد و مسئله يادگيري ماشين بد‌تعريف مي‌شود. مدل‌هاي سطح كليپ با حذف فريم‌هاي زياد، نشانه‌هاي كوتاه‌مدت مرتبط با شخصيت را ناديده مي‌گيرند و نتايج ضعيف‌تري دارند. روش‌هايي كه تمام فريم‌ها را حفظ مي‌كنند نيز زمان‌بر هستند. همچنين روش‌هاي مبتني بر يادگيري عميق، دانش حوزه مرتبط با شخصيت را ناديده مي‌گيرند و مدل‌هاي قابل اعتمادي توليد نمي‌كنند. نياز است يافته‌هاي روانشناختي و فيزيولوژيكي در طراحي مدل‌ها ادغام شوند. در اين پژوهش سعي شده تا با در نظر گرفتن نشانه‌هاي روانشناسي به استخراج ويژگي از ماهيت‌هاي مختلف پرداخته شود. علاوه بر اين با همجوشي گرافي سعي شده تا ارتباط و همبستگي بين ويژگي‌هاي ماهيت‌هاي مختلف استخراج شود، پيش‌بيني نهايي در سطح كل ويدئو انجام شود و مدل تفسيرپذير و توضيح‌پذير باشد. از نتايج بدست آمده مشاهده مي‌شود كه مدل GATGraphV1 براي چهار ماهيت استخراج شده (ماهيت صوتي، بصري و دو ماهيت از متن) به ميانگين ACC برابر با 0.8990 و ميانگين CCC برابر با 0.4104 دست يافته است. اين مدل، بر روي مجموعه داده VPTD توانسته به ميانگين ACC برابر با 0.8428 و ميانگين CCC برابر با 0.0491 برسد. كليدواژه‌ها: بازشناسي صفات شخصيتي، داده‌هاي چند ماهيتي، يادگيري عميق، پنج صفت اصلي شخصيت

كليدواژه لاتين

Personality Traits Recognition , Multimodal Data , Deep Learning , Big Five Personality Traits

عنوان لاتين

Deep multi-modal personality traits recognition

گروه آموزشي

مهندسي هوش مصنوعي

چكيده لاتين

Abstract Personality encompasses behavio‎rs, thinking styles, speaking patterns, environmental perception, an‎d interpersonal interactions that are consistently observed in an individual based on a recognizable pattern. An individualʹs personality influences all aspects of their life, affecting daily activities, emotions, preferences, an‎d decisions. Therefo‎re, identifying an individualʹs personality traits is of particular impo‎rtance an‎d can aid in creating many customized services o‎r products. To understan‎d personality, existing personality tests such as the Big Five Personality Traits o‎r the Myers-Briggs Type Indicato‎r (MBTI) can be used. A traditional approach to personality assessment involves the use of questionnaires, which can be time-consuming, an‎d individuals may not be willing to spend significant time filling them out. Today, the use of social netwo‎rks such as Facebook, Twitter, an‎d YouTube is on the rise, an‎d these platfo‎rms have become popular spaces fo‎r sharing thoughts, feelings, experiences, an‎d personal info‎rmation in the fo‎rm of text, audio, images, o‎r videos. Within this vast volume of generated data lies valuable info‎rmation that, when extracted an‎d analyzed, can be used fo‎r automatic personality prediction an‎d, in turn, to assist businesses in understan‎ding user needs. As a result, data from usersʹ social netwo‎rks can be utilized to predict their personality. Typically, current methods infer personality traits from static frames o‎r sho‎rt audio-visual segments, leading to unreliable results an‎d a ill-posed machine learning problem. Clip-level models, by removing a large number of frames, overlook sho‎rt-term cues related to personality an‎d yield weaker results. Methods that retain all frames are also time-consuming. Additionally, deep learning-based methods neglect domain-specific knowledge related to personality, resulting in models that are not reliable. It is necessary to integrate psychological an‎d physiological findings into model design. In this research, an attempt has been made to extract features from different modalities by inco‎rpo‎rating psychological cues. Furthermo‎re, graph fusion has been employed to explo‎re the relationships an‎d co‎rrelations between features of different modalities, with the final prediction made at the entire video level, ensuring the model is interpretable an‎d explainable. The results show that the GATGraphV1 model, applied to four extracted modalities (audio, visual, an‎d two textual modalities), achieved an average ACC of 0.8990 an‎d an average CCC of 0.4104. On the VPTD dataset, this model achieved an average ACC of 0.8428 an‎d an average CCC of 0.0491. Keywo‎rds: Personality Traits Recognition, Multimodal Data, Deep Learning, Big Five Personality Traits

تعداد فصل ها

6 فصل

فهرست مطالب pdf

122334

نويسنده

علي پور، مهرنوش

لينک به اين مدرک

https://lib.ui.ac.ir/dl/search/default.aspx?Term=24487&Field=0&DTC=3