1. 首页
  2. 大数据
  3. 算法与数据结构
  4. DiCE:TheInfinitelyDifferentiableMonteCarloEstimator

DiCE:TheInfinitelyDifferentiableMonteCarloEstimator

上传者: 2019-03-12 04:22:27上传 PDF文件 418.86KB 热度 66次
The score function estimator is widely used for estimating gradients of stochastic objectives in Stochastic Computation Graphs (SCG), e.g., in reinforcement learning and meta-learning. While deriving the first order gradient estimators by differentiating a surrogate loss (SL) objective is computatio
用户评论