讨论班 | 学术交流研讨班(2021/6/21-2021/6/27)
本学期统计学习讨论班与因果推断讨论班已圆满结课!学术交流研讨班将会继续伴随大家!
1、讨论班简介
学术交流研讨班
针对博士生开展,主要形式为学术论文讨论交流。
2、时间及地点
3、本期内容概述
学术交流研讨班
内容预告
Reinforcement learning and off-policy evaluation
Reinforcement learning (RL) is one of three basic machine learning paradigms, alongside supervised learning and unsupervised learning. RL concerns how intelligent agents ought to take actions in an environment in order to maximize the notion of cumulative reward. In this talk, we will briefly introduce RL and related foundamental concepts like Markov decision process and Bellman optimality equation. We also discuss classical methods for the evaluation of target policy given the data generated from another policy.
腾讯会议链接
会议主题:学术交流研讨班
会议时间:2021/05/28-2021/08/27 16:00-18:00(GMT+08:00) 中国标准时间 - 北京, 每周 (周五)
点击链接入会,或添加至会议列表:
https://meeting.tencent.com/s/vwoS2FR3Lef0
会议 ID:591 2872 1234
