محصولات مرتبط

دانلود قالب پاورپوینت مهندسی کامپیوتر Computer PowerPoint

دانلود حل المسائل مهندسی نرم افزار یان سامرویل Ian Sommerville

قیمت : 60,000 تومان

مقاله بررسی چارچوب های یادگیری عمیق مقیاس پذیر

قیمت : 25,750 تومان

مقاله مروری بر معماری یادگیری عمیق برای تصویربرداری مغز مبتنی بر EEG

قیمت : 25,750 تومان

آخرین محصولات

حل المسائل کتاب اقتصاد دارون عجم اوغلو ویرایش سوم Daron Acemoglu

کتاب مدیریت عملیات ویلیام استیونسون ویرایش سیزدهم William Stevenson

حل المسائل کتاب رفتار مکانیکی مواد نورمن دولینگ ویرایش سوم Norman Dowling

کتاب رفتار مکانیکی مواد نورمن دولینگ ویرایش چهارم Norman Dowling

حل المسائل کتاب رفتار مکانیکی مواد نورمن دولینگ ویرایش چهارم Norman Dowling

نمایش تصادفی

حل المسائل فیزیک مدرن استفن تورنتون ویرایش چهارم Stephen Thornton

مقاله توزیع زمان محدود اقتصادی توزیع شبکه ای از منابع انرژی

دانلود حل المسائل کتاب شیمی تجزیه دیوید هاروی David Harvey

پروژه ارزیابی تأثیر برقگیرها بر قابلیت اطمینان پست های فشار قوی

دانلود پروژه مقایسه اصول حاکم بر اختیارات وکیل در حقوق مدنی و تجارت

مقاله یادگیری Q زمان تعیین گسسته: تجزیه و تحلیل همگرایی جدید

عنوان مقاله فارسی: یادگیری Q زمان تعیین گسسته: تجزیه و تحلیل همگرایی جدید

عنوان مقاله لاتین: Discrete-Time Deterministic Q -Learning: A Novel Convergence Analysis

نویسندگان: Qinglai Wei; Frank L. Lewis; Qiuye Sun; Pengfei Yan; Ruizhuo Song

تعداد صفحات: 13

سال انتشار: 2017

زبان: لاتین

Abstract:

In this paper, a novel discrete-time deterministic Q-learning algorithm is developed. In each iteration of the developed Q-learning algorithm, the iterative Q function is updated for all the state and control spaces, instead of updating for a single state and a single control in traditional Q-learning algorithm. A new convergence criterion is established to guarantee that the iterative Q function converges to the optimum, where the convergence criterion of the learning rates for traditional Q-learning algorithms is simplified. During the convergence analysis, the upper and lower bounds of the iterative Q function are analyzed to obtain the convergence criterion, instead of analyzing the iterative Q function itself. For convenience of analysis, the convergence properties for undiscounted case of the deterministic Q-learning algorithm are first developed. Then, considering the discounted factor, the convergence criterion for the discounted case is established. Neural networks are used to approximate the iterative Q function and compute the iterative control law, respectively, for facilitating the implementation of the deterministic Q-learning algorithm. Finally, simulation results and comparisons are given to illustrate the performance of the developed algorithm.

فایل هایی که پس از خرید می توانید دانلود نمائید

discrete time deterministic q learning a novel convergence analysis_1623576670_49059_4145_1231.zip2.25 MB

پرداخت و دانلود محصول

بررسی اعتبار کد دریافت کد تخفیف

مبلغ قابل پرداخت : 25,750 تومان پرداخت از طریق درگاه

انتقال به صفحه پرداخت

بانک حل المسائل کتاب های دانشگاهی