MG游戏大全app下载

首页 >> 科学研究 >> 学术讲座 >> 正文

MG游戏大全app下载:人工智能学科交叉讲座系列第【27】期：From Deep Reinforcement Learning to LLM-based Agents: Perspectives on Current Research

MG游戏大全app下载:信息来源: 发布时间:2024-06-06 浏览量:

报告人：Stefano V. Albrecht

Associate Professor

University of Edinburgh

主持人：杨耀东教授

MG游戏大全app下载

时间：2024年6月14日 10:00-11:00

地址：北京大学镜春园 79 号甲多功能教室

腾讯会议：186-375-430

报告题目：

From Deep Reinforcement Learning to LLM-based Agents: Perspectives on Current Research

报告摘要:

Since the recent successes of large language models (LLMs), we are beginning to see a shift of attention from deep reinforcement learning to LLM-based agents. While deep RL policies are typically learned from scratch to maximise some defined return objective, LLM-agents use an existing LLM at their core and focus on clever prompt engineering and downstream specialisation of the LLM via supervised and reinforcement learning techniques. In this talk, He will first provide a broad overview of his group’s research in deep RL, which focuses among other topics on developing sample-efficient and robust RL algorithms for both single- and multi-agent control tasks, including industry applications in autonomous driving and multi-robot warehouses. He will then present their recent research into LLM-agents, where they propose an approach for household robotics that takes into account user preferences to achieve more robust and effective planning. He will conclude with some personal observations about the state of LLM-agent research: (a) many papers in this field follow essentially the same recipe by focussing on prompt engineering and downstream specialisation; (b) this recipe makes their scientific claims brittle as they depend crucially on the specific LMM engine, and (c) LLMs are not natively designed to maximise objectives for optimal control and decision making. Based on these observations, he will give insight into which fruitful research avenues can be identified.

报告人简介:

Dr. Stefano V. Albrecht is Associate Professor in Artificial Intelligence in the School of Informatics, University of Edinburgh. He leads the Autonomous Agents Research Group (https://agents.inf.ed.ac.uk) which specialises in developing machine learning algorithms for autonomous systems control and decision making, with a particular focus on reinforcement learning and multi-agent interaction. In his roles as Royal Academy of Engineering and Royal Society Industrial Fellow, he actively develops industry applications in the areas of multi-robot warehouses with Dematic/KION, and autonomous driving with Five AI which completed one of the most extensive urban road trials of autonomous driving in London before being acquired by Bosch in 2022. Dr. Albrecht is affiliated with the Alan Turing Institute where he leads the Multi-Agent Systems theme. In 2022, he was nominated for the IJCAI Computers and Thought Award based on his research which introduced Stochastic Bayesian Games and optimal solution algorithms, which have since been applied in a range of domains. Previously, Dr. Albrecht was a postdoctoral fellow at the University of Texas at Austin working with Prof. Peter Stone. He obtained PhD and MSc degrees in Artificial Intelligence from the University of Edinburgh, and a BSc degree in Computer Science from Technical University of Darmstadt. He is co-author of the new MIT Press textbook "Multi-Agent Reinforcement Learning: Foundations and Modern Approaches" which is freely available at www.marl-book.com.

上一页：人工智能学科交叉讲座系列第【28】期：走向灵活推理将大语言模型作为非正式逻辑程序

下一页：人工智能学科交叉讲座系列第【25】期：非结构化环境中机器人智能操作

MG游戏大全app下载-MG游戏大全V9.19.13手机版-下载吧

<tfoot id='YpAO'></tfoot>

<legend id='mC0nS2'><style id='EaHnz4'><dir id='q0w9T'><q id='eXFmDI'></q></dir></style></legend>

<i id='Q4SYa'><tr id='7ZWeXq'><dt id='exvZ'><q id='tcy7tGl'><span id='pKjuk4'><b id='OeTtI'><form id='F3BJCC'><ins id='FabhT'></ins><ul id='IJqSBZ'></ul><sub id='4NPY'></sub></form><legend id='zRwTw'></legend><bdo id='bMZbnA'><pre id='1Hj0dD9'><center id='N6Ld8'></center></pre></bdo></b><th id='H4mwny'></th></span></q></dt></tr></i><div id='V1uQd'><tfoot id='kqWD'></tfoot><dl id='9QSNdQx'><fieldset id='ESHTel'></fieldset></dl></div>

<sup id='zBtkFw'><pre id='48hBtV'></pre></sup><em id='mj34P'></em>

<li id='vDLzg'><abbr id='IY3t'></abbr></li>