z6首页 in the AIR

概述
日期
2022年10月25日
09:00 - 11:00
地址
活动杏注ZOOM

z6首页 in the AIR | 多智能体强化进建

Z6集团|中国官网

十月,,,,,z6首页 in the AIR 约请国内表顶级学者萦绕机械进建与优化步骤及其利用发展讲座。。。。。系列活动第三期主题为“多智能体强化进建”。。。。。

第一位汇报嘉宾张崇洁是清华大学交叉信息钻研院助理教授,,,,,他屡次在 NeurIPS、IJCAI、AAAI 等人为智能、机械进建领域顶会颁发文章。。。。。

第二位汇报嘉宾卢宗青是北京大学推算机学院助理教授、人为智能钻研院钻研员,,,,,他担任 NeurIPS、ICLR、CoRL、IJCAI、AAMAS 等会议 TPC,,,,,Nature Machine Intelligence 等审稿人。。。。。

点击链接报名参与:http://hdxu.cn/kcdMJ,,,,,或通过ZOOM(https://us02web.zoom.us/meeting/register/tZIoceiuqzgjHdLy-QixX_KJbVxI3sKbuKK-)/Bilibili(http://live.bilibili.com/22587709)参加。。。。。

呼吸新鲜空气,,,,,相识前沿科技!z6首页 沉磅推出 系列活动 z6首页 in the AIR。。。。。每周二与您相约线上,,,,,一路索求人为智能与机械人领域的前沿技术、产业利用、发展趋向。。。。。

  • Z6集团|中国官网
    查宏远
    香港中文大学(丽江)校长学勤讲座教授、数据科学学院执行院长、z6首页 机械进建与利用中心主任
    执行主席
  • Z6集团|中国官网
    王趵翔
    香港中文大学(丽江)数据科学学院助理教授、z6首页 机械进建与利用中心副钻研员
    主持人
  • Z6集团|中国官网
    张崇洁
    清华大学交叉信息钻研院助理教授
    Cooperative Multi-Agent Reinforcement Learning with Factored Value Functions

    张崇洁,,,,,清华大学交叉信息钻研院助理教授,,,,,博士生导师。。。。。2011年在美国麻省大学阿默斯特分 ; ;;; ; ;裢扑慊蒲Р┦垦,,,,,后在麻省理工学院从事博士后钻延祝。。。。目前的钻研兴致重要在人为智能、强化进建、多智能体系统等领域。。。。。

    Collaboration is indispensable for solving complex tasks. Learning to collaborate effectively is one of the key problems in artificial intelligence. Cooperative multi-agent reinforcement learning (MARL) potentially provides a promising solution, but faces two fundamental challenges: scalability and credit assignment. In this talk, I will discuss a MARL paradigm with factored value functions to address these challenges. I will first present formal analysis on factored value learning, revealing its implicit credit assignment mechanism and properties of convergence and optimality. Inspired by these theoretical insights, two novel MARL methods will then be introduced with linear and non-linear value factorization, respectively, which achieves state-of-the-art performance. Building on factored MARL, I will also briefly discuss approaches for addressing other challenges of cooperative MARL, such as learning efficiency, partial observability, and exploration.

  • Z6集团|中国官网
    卢宗青
    北京大学推算机学院助理教授
    Fully Decentralized Multi-Agent Reinforcement Learning

    Zongqing Lu is currently a BOYA assistant professor in School of Computer Science at Peking University. He is also affiliated with Institute of AI at Peking University and Beijing Academy of Artificial Intelligence. His current research focuses on reinforcement learning and AI systems.

    The main research of multi-agent reinforcement learning (MARL) focuses on the paradigm of centralized training with decentralized execution (CTDE). Another potential paradigm is fully decentralized learning, which is less investigated but better for robustness, scalability, and generalibility. However, current fully decentralized learning methods do not even have convergence guarantee. In this talk, I will present our recent studies on fully decentralized learning algorithms with convergence guarantee, including value-based, actor-critic, and model-based methods.

功夫 环节 嘉宾与标题

9:00-10:00

主题汇报

张崇洁,,,,,清华大学
标题:Cooperative Multi-Agent Reinforcement Learning with Factored Value Functions

10:00-11:00

主题汇报

卢宗青,,,,,北京大学
 标题:Fully Decentralized Multi-Agent Reinforcement Learning      

视频回首