搜索结果: 1-1 共查到“统计学 policy”相关记录1条 . 查询时间(0.078 秒)
Regret Bounds for Reinforcement Learning with Policy Advice
Regret Bounds Reinforcement LearningPolicy Advice
2013/6/13
In some reinforcement learning problems an agent may be provided with a set of input policies, perhaps learned from prior experience or provided by advisors. We present a reinforcement learning with p...