دانلود ترجمه مقاله روش یادگیری فازی Q برای هدایت کردن ربات خودکار

۱۳۹۷/۱۰/۱۴ بایگانی مطالب ثبت دیدگاه ارزشمند شما

دانلود ترجمه مقاله روش یادگیری فازی Q برای هدایت کردن ربات خودکار

خلاصه- الگوریتم پیشنهاد داده شده مزیت هایی را از روش منطق فازی جفت شده و یادگیری Q برای برآورده ساختن نیازهای مربوط به هدایت کردن خودکار دریافت کرده است. سپس قوانین فازی یک تصمیم گیری و یک چارچوب کاری قابل قبول را برای مدیریت کردن عدم اطمینان ها فراهم می سازد و همچنین اجازه استفاده کردن از دانش سلسله واری را نیز می دهد. ساختار دینامیکی مربوط به یادگیری Q آن را به صورت یک ابزار متعهد در آورده است که تنظیم کننده پارامترهای تداخل فازی در زمانی است که دانش کمی در این زمینه وجود دارد یا اصلا وجود ندارد که در این صورت در سراسر دنیا در دسترس می باشد. ربات در سر تا سر دنیا به صورت مجموعه ای از جفت ها با فعالیت حرکتی مدل شده است. برای هر وضعیت فازی شده، برخی از فعالیت های پیشنهاد داده شده وجود دارد. وضعیت ها مرتبط با فعالیت های منطبق آن ها از طریق قوانین فازی می باشد که خود براساس استدالال انسانی هستند. ربات فعالیت تحریک شده بیشتری را برای هر وضعیت از طریق آزمایش های آنلاین انتخاب می کند. کارایی مربوط به روش پیشنهاد داده شده از طریق آزمایش هایی براساس ربات شبیه سازی شده Khepera موجود می باشد.

کلمات کلیدی: یادگیری Q فازی- هدایت کردن خودکار- ربات Khepera

مقدمه

مستقل و خودکار بودن یکی از نکات ضروری برای ربات ها می باشد که به صورت فزاینده ای جایگزین و یا به عنوان همکاری برای انسان ها در خانه هایشان یا محل کارشان، یا در محیط های خطرناک و غیر قابل دسترس هستند. یک ربات خودکار باید به صورت هدفمند حرکت کند و کارهای خاصی را در محیط های ناشناخته انجام دهد که معمولا همراه با دینامیک های بدون پیش بینی هستند. به عنوان مثال می توان به هدف مطلوبی بدون تصادم با موانع دست یافت. یک سیستم ربات شامل برخی از واحدها با روابط متقابل اصلی است که عبارتند از: سنسورها، واحد پیش پردازش کننده، واحد تصمیم گیری، کنترل کننده ها، موتورها و بکار اندازنده ها. اگر چه همه این واحدها بر روی کارایی ربات تاثیر دارند اما واحد تصمیم گیری به عنوان مغز ربات نقش بسیار تعیین کننده تری را بازی می کند. چنان چه تمرکز این مطالعه بر روی واحد تصمیم گیری یک ربات چهار چرخه کوچک با سنسورهای ورودی می باشد که انتخاب شده است و بنابراین واحدهای دیگر برای تقاضای پردازش کردن سنگین به کار نمی روند. الگوریتم تصمیم گیری مستحکم برای هدایت کردن ربات به سمت هدف از پیش تعیین شده طراحی شده اند در حالی که از موانع دوری می کنند.

مجموعه ای فازی به صورت سلسله وار بر طبق قوانین توسط منطق و استدلال انسانی و بدون هیچ گونه پیش فرض در مورد دنیای کنونی طراحی شده است. مشارکت کردن مربوط به دانش ابتکاری برای طراحی کردن قوانین فازی خود ناشی از تعداد قوانین کمی باشد که در ابتدا منجر به اجرا شدن فرایند برنامه ریزی سریع و آسان می گردد. روش پیشنهاد داده شده مستقل از مدل جهانی یا دانش اولیه در مورد دنیا است. تنها منبع اطلاعاتی مربوط به داده های محلی توسط محدوده محدود شده ای از سنسورهای متداخل می باشد. ورودی های فازی به صورت خروجی های سنسورهای متداخل است در حالی که خروجی های فازی سرعت ربات و زاویه هدایت آن می باشد. فازی شدن مربوط به داده های سنسوری اندازه گیری شده برای مدیریت کردن موثر عدم اطمینان ها، ابهامات یا اطلاعات مختل موثر می باشد. ربات، دنیا را نسبت به تعداد کمی از وضعیت های سنسوری فازی شده مدل می کند. برای هر وضعیت فازی شده، برخی از فعالیت های پیشنهاد داده شده وجود دارد. وضعیت ها مرتبط با فعالیت های منطبق آن ها توسط قوانین فازی و براساس استدلال انسانی است. میزان سازی آنلاین برای استنتاج فازی یک سیستم تصمیم گیری انعطاف پذیر را فراهم می سازد که می تواند خودش را با محیط های ناشناخته سازگار کند. یادگیری Q یک ابزار محتمل برای میزان کردن پارامترهای استنتاج فازی می باشد که ساده، مدلی رایگان با ساختار دینامیکی است. به کار بردن اعضای خروجی فازی و یادگیری Q به صورت آنلاین از طریق روابط متقابل با دنیا میزان می گردد. آزمایش های شبیه سازی شده براساس ربات Khepera خود بیان کننده کارایی مربوط به روش پیشنهاد داده شده می باشد. بقیه مقاله به صورت زیر مورد بررسی قرار می گیرد: بخش دوم با مقالات مرتبط در این زمینه سر و کار دارد، بخش سوم توصیف مختصری را در مورد ربات Khepera فراهم می سازد. روش پیشنهاد داده شده یادگیری فازی Q در بخش چهارم توصیف شده است. نتایج شبیه سازی در بخش پنجم نشان داده شده است و سرانجام نتیجه گیری مقاله در بخش ششم بیان شده است.

A Fuzzy Q-Learning Approach to Navigation
of an Autonomous Robot
Sepideh Valiollahi
Department of Electrical and
Computer Engineering
Babol University of Technology
Babol, Iran
s.valiollahi@stu.nit.ac.ir
Reza Ghaderi
Department of Electrical and
Computer Engineering
Babol University of Technology
Babol, Iran
r_ghaderi@nit.ac.ir
Ataollah Ebrahimzadeh
Department of Electrical and
Computer Engineering
Babol University of Technology
Babol, Iran
e_zadeh@nit.ac.ir
Abstract—The proposed algorithm takes advantage of coupling
fuzzy logic and Q-learning to fulfill requirements of
autonomous navigations. Fuzzy if-then rules provide a reliable
decision making framework to handle uncertainties, and also
allow incorporation of heuristic knowledge. Dynamic structure
of Q-learning makes it a promising tool to adjust fuzzy
inference parameters when little or no prior knowledge is
available about the world. To robot, the world is modeled into
a set of state-action pairs. For each fuzzified state, there are
some suggested actions. States are related to their
corresponding actions via fuzzy if-then rules based on human
reasoning. The robot selects the most encouraged action for
each state through online experiences. Efficiency of the
proposed method is validated through experiments on a
simulated Khepera robot.

Keywords-fuzzy
Khepera robot

Q-learning;

autonomous

navigation;

I. INTRODUCTION
Autonomy is a necessity of the robots that are
increasingly replacing/cooperating human in their homes
and workspaces, or hazardous and unreachable
environments. An autonomous robot should move
purposefully and carry out specific tasks in unknown
environments usually with unpredictable dynamics. An
instance could be reaching a desired goal without colliding
with obstacles. A robot system comprises some main
interacting units: sensors, preprocessing unit, decision
making unit, controllers, motors, and actuators [1, 2].
Although, all these units affect the robot performance,
decision making unit as the robot brain plays the most
significant role. As the focus of this study is on decision
making unit, a small wheeled robot with simple infrared
sensors is selected so that the other units do not demand
heavy processing [3]. A robust decision making algorithm is
designed to navigate the robot toward a predefined goal
while avoiding obstacles.
A set of heuristic fuzzy if-then rules is designed by
human reasoning and without any restrictive presumptions
about the world. Incorporation of heuristic knowledge to
design of fuzzy rules results in few number of rules, which
in turn leads to a fast and easy implementation of planning
process. The proposed method is independent of a global
model or prior knowledge about the world. The only source
of information comes from the locally sensed data by
limited range infrared sensors. Fuzzy inputs are outputs of
infrared sensors while fuzzy outputs are the robot speed and
steering angle. Fuzzification of measured sensory data helps
an efficient handling of uncertain, imprecise, or noisy
information [2]. The robot models the world into few
fuzzified sensory states. For each fuzzified state, there are
some suggested actions. States are related to their
corresponding actions by fuzzy if-then rules based on
human reasoning. Online tuning of the fuzzy inference
provides a flexible decision making system, which could
adapt itself to unknown environments. Q-learning is a
promising tool to tune fuzzy inference parameters due to its
simple, model free, and dynamic structure [4]. Applying
Q-learning, fuzzy output memberships are tuned online and
through interactions with the world. Simulated experiments
on Khepera robot validated efficiency of the proposed
method. The rest of the paper is reviewed as follow: Section
II deals with related literature, Section III provides a brief
description about the Khepera robot. The proposed fuzzy Qlearning method is explained in section IV. Simulation
results are depicted in section V, and finally the paper is
concluded in section VI.
II. RELATED LITERATURE
The earliest methods for autonomous robot
navigation were based on hierarchical architecture usually
composed of four main layers: Sensing, Modeling,
Planning, and Acting [1, 5, 6]. The robot builds a global
model of the world using prior knowledge or sensory
information, tries to plan an optimal path, and passes it to
the execution layer. This routine continues until reaching the
ultimate goal. After sensing stage and during the rest of the
process, there is no feedback from the environment.
Obviously, such methods do not work when prior
knowledge about the world is unavailable, incomplete, or
unreliable. Furthermore, unpredictable dynamic of real
world environments plus their inherent uncertainties put
hierarchal methods in pitfall of out of date models which
lead to inadequate actions [2].

این فایل ورد (word) ترجمه در 14 صفحه و فایل اصلی لاتین pdf مقاله در 6 صفحه به خدمتتون ارائه میشود.

پایگاه علمی سعید سان دانلود رایگان مقالات علمی کنفرانس و ژورنال

دانلود ترجمه مقاله روش یادگیری فازی Q برای هدایت کردن ربات خودکار