Neural Contextual Bandits with Deep Representation and Shallow Exploration

上传者：qqkindness81257 2021-01-24 03:50:46上传 .PDF文件 635.84 KB 热度 15次

Neural Contextual Bandits with Deep Representation and Shallow Exploration

We study a general class of contextual bandits, where each context-action pair is associated with a raw feature vector, but the reward generating function is unknown. We propose a novel learning algorithm that transforms the raw feature vector using the last hidden layer of a deep ReLU neural network (deep representation learning), and uses an upper confidence bound (UCB) approach to explore in the last linear layer (shallow exploration).We prove that under standard assumptions, our proposed algorithm achieves $\tilde{O}(\sqrt{T})$ finite-time regret, where $T$ is the learning time horizon. Compared with existing neural contextual bandit algorithms, our approach is computationally much more efficient since it only needs to explore in the last layer of the deep neural network.

具有深度表示和浅探的神经上下文强盗

我们研究了一类一般的情境强盗，其中每个情境动作对都与一个原始特征向量相关联，但奖励生成功能未知。我们提出了一种新颖的学习算法，该算法使用深度ReLU神经网络的最后一个隐藏层（深度表示学习）来转换原始特征向量，并使用上置信界（UCB）方法在最后一个线性层中进行探索（浅层探索）。.. 我们证明，在标准假设下，我们提出的算法可以实现 Ø〜（Ť）有限时后悔 Ť 是学习时间的视野。与现有的神经上下文强盗算法相比，我们的方法在计算上效率更高，因为它只需要在深度神经网络的最后一层进行探索。（阅读更多）

下载地址

用户评论

更多下载

下载地址

 立即下载

用户评论

发表评论

Neural Contextual Bandits with Deep Representation and Shallow Exploration

我们研究了一类一般的情境强盗，其中每个情境动作对都与一个原始特征向量相关联，但奖励生成功能未知。我们...

大小：635.84 KB | 2021-01-24 03:50:46

A Survey on Contextual Multiarmed Bandits

ContextualMulti-armedBanditsASurveyonContextualMul...

大小：0B | 2019-09-04 08:29:59

A Block Minifloat Representation for Training Deep Neural Networks

使用本地浮点表示和市售硬件很难实现高效地训练深度神经网络（DNN）。具有自定义加速功能的专门算法可能...

大小：787.97 KB | 2021-01-24 05:34:20

Deep Exploration 5.7

大小：0B | 2019-04-03 13:51:44

SHALLOW LEARNING FOR DEEP NETWORKS.pdf

SHALLOW LEARNING FOR DEEP NETWORKS文章全文,本文介绍了另一种深度网...

大小：565KB | 2020-08-09 10:00:37

Deep Learning Based Fault Localization with Contextual Information

Deep Learning-Based Fault Localization with Contex...

大小：2.37MB | 2021-03-12 07:46:37

Deep.Exploration v6.5.0

DeepExplorationv6.5.0版本，包含cr，包含6.3版本的汉化（汉化不完全）

大小：0B | 2019-07-15 07:31:44

Neural Networks and Deep Learning

ISBN978-3-319-94462-3ISBN978-3-319-94463-0(eBook)h...

大小：0B | 2019-09-22 20:25:23

Neural Network and Deep Learning

国外介绍神经网络和深度学习经典之作，通俗易懂，值得学习。

大小：0B | 2020-02-29 11:51:05

An Accurate Deep Convolutional Neural

AnAccurateDeepConvolutionalNeuralNetworksModelforN...

大小：0B | 2019-07-09 12:14:07

deep_stock_representation_deeplearning_cnn

该论文主要是将cnn神经网络应用于股票研究，并使用模块聚类算法建立投资组合并进行预测

大小：0B | 2019-05-15 03:57:12

Deep Neural Network深度学习deep learning

DeepNeuralNetwork深度学习deeplearning

大小：0B | 2019-06-05 02:34:12

Deep Exploration5.5注册机

DeepExploration5.5注册机5.5.4CADEdition注册通过

大小：0B | 2020-05-14 20:19:11

Deep Neural Networks for YouTube Recommendations

youtube代表了目前规模最大、最复杂的工业推荐系统之一。在这篇文章里，我们从系统的角度上重点讲述...

大小：0B | 2020-05-17 05:58:49

Learn Keras for Deep Neural Networks

LearnKerasforDeepNeuralNetworks深度学习书，使用keras

大小：0B | 2019-07-15 23:44:18

Deep Neural Networks are Easily Fooled

Deep neural networks (DNNs) have recently been ach...

大小：3.46MB | 2021-04-26 08:24:43