MG游戏大全app下载

首页 >> 科学研究 >> 学术讲座 >> 正文

MG游戏大全app下载:人工智能学科交叉讲座系列第【24】期：Representation-based Reinforcement Learning

MG游戏大全app下载:信息来源: 发布时间:2024-04-09 浏览量:

报告人：Bo Dai

助理教授

佐治亚理工大学

主持人：王奕森助理教授

北京大学智能MG游戏大全app下载

时间：2024年 4月10日周三 14:00 - 15:00

地址：燕园校区理科二号楼2736报告厅

新燕园校区教学楼101教室

腾讯会议：938-779-290

报告题目：

MG游戏大全app下载: Representation-based Reinforcement Learning

报告摘要:

The majority reinforcement learning (RL) algorithms are largely categorized as model-free and model-based through whether a world model is learned in the algorithm. However, both of these two categories have their own issues, especially incorporating with function approximation: the exploration with arbitrary function approximation in model-free RL algorithms is difficult, while optimal planning becomes intractable in model-based RL algorithms with neural nonlinear world models.

In this talk, I will present our recent work on exploiting the power of representation in RL to bypass these difficulties, while enjoys best of both worlds. Specifically, we designed practical algorithms for extracting useful representations from world model, with the goal of improving statistical and computational efficiency in exploration vs. exploitation tradeoff and empirical performance in RL. We provide rigorous theoretical analysis of our algorithm, and demonstrate the practical superior performance over the existing state-of-the-art empirical algorithms on several benchmarks.

报告人简介:

Bo Dai is an assistant professor in Georgia Tech and a staff research scientist in Google DeepMind. He obtained his Ph.D. from Georgia Tech. His research interest lies in Embodied Ai with Generative Models. He is the recipient of the best paper award of AISTATS and NeurIPS workshop. He regularly serves as area chair or senior program committee member at major AI/ML conferences such as ICML, NeurIPS, AISTATS, and ICLR.

上一页：人工智能学科交叉讲座系列第【25】期：非结构化环境中机器人智能操作

下一页：人工智能学科交叉讲座系列第【23】期： ?神经网络的高效训练算法

MG游戏大全app下载-MG游戏大全V9.19.13手机版-下载吧

<tfoot id='dRs7f'></tfoot>

<legend id='dUxRj'><style id='iMSka'><dir id='PeN7E'><q id='Vdapt'></q></dir></style></legend>

<i id='GrzA3Bt'><tr id='h1wa'><dt id='bQU0bVe'><q id='sHm7'><span id='aQWe'><b id='tnvdF2'><form id='qIc7'><ins id='VbU5E'></ins><ul id='GRk6nv'></ul><sub id='zBCm'></sub></form><legend id='koYc'></legend><bdo id='B05RpuR'><pre id='fYyYS'><center id='sG9MDRvL'></center></pre></bdo></b><th id='XHuozb'></th></span></q></dt></tr></i><div id='AU5U'><tfoot id='vW51vbu'></tfoot><dl id='gtxTsOc'><fieldset id='ZhU5RS'></fieldset></dl></div>

<sup id='2Svm'><pre id='DSePlbQ'></pre></sup><em id='50O4zG'></em>

<li id='IJQebZ1'><abbr id='zaIABeL'></abbr></li>