无人驾驶DeepReforcementLearning.pdf
PhilosophicalMotivationforReinforcementLearningTakeawayfromSupervisedLearning:Neuralnetworksaregreatatmemorizationandnot(yet)greatatreasoning.HopeforReinforcementLearning:Brute-forcepropagationofoutcomestoknowledgeaboutstatesandactions.Thisisakindofbrute-force“reasoning”.
下载地址
用户评论