直接量化用于训练高精度，低位宽的深度神经网络

上传者：axe12879 2021-01-22 04:10:48上传 .PDF文件 567.95 KB 热度 14次

本文提出了两种新颖的技术来训练具有低位宽权重和激活的深度卷积神经网络。首先，为了获得低的位宽权重，大多数现有方法是通过对全精度网络权重执行量化来获得量化权重的。..

Direct Quantization for Training Highly Accurate Low Bit-width Deep Neural Networks

This paper proposes two novel techniques to train deep convolutional neural networks with low bit-width weights and activations. First, to obtain low bit-width weights, most existing methods obtain the quantized weights by performing quantization on the full-precision network weights.However, this approach would result in some mismatch: the gradient descent updates full-precision weights, but it does not update the quantized weights. To address this issue, we propose a novel method that enables {direct} updating of quantized weights {with learnable quantization levels} to minimize the cost function using gradient descent. Second, to obtain low bit-width activations, existing works consider all channels equally. However, the activation quantizers could be biased toward a few channels with high-variance. To address this issue, we propose a method to take into account the quantization errors of individual channels. With this approach, we can learn activation quantizers that minimize the quantization errors in the majority of channels. Experimental results demonstrate that our proposed method achieves state-of-the-art performance on the image classification task, using AlexNet, ResNet and MobileNetV2 architectures on CIFAR-100 and ImageNet datasets.

下载地址

用户评论

更多下载

下载地址

立即下载

用户评论

直接量化用于训练高精度低位宽的深度神经网络

This paper proposes two novel techniques to train ...

大小：567.95 KB | 2021-01-22 04:10:48
cpp DeepCLOpenCL库用于训练深度卷积神经网络

DeepCL - OpenCL库用于训练深度卷积神经网络

大小：1.12MB | 2020-07-25 09:41:47
用于神经网络量化的镜像下降视图

Quantizing large Neural Networks (NN) while mainta...

大小：1.04 MB | 2021-01-23 05:22:27
使用离散状态转换训练深度神经网络

深度神经网络已经在各种人工智能任务中实现了迅猛的突破,但是,由于消耗了无法忍受的硬件资源,训练时间和...

大小：128KB | 2021-04-16 18:02:37
深度神经网络ssd检测类深度神经网络

大小：0B | 2019-01-07 19:11:45
深度神经网络

基于深度卷积神经网络的超分辨率技术的VDCN，其中包含代码

大小：0B | 2019-09-12 01:25:57
RBF神经网络的训练

大小：0B | 2019-02-25 07:48:41
javaCV神经网络训练

大小：0B | 2019-01-11 01:59:07
BP神经网络训练

大小：0B | 2019-01-21 07:44:15
训练好用于车牌分割的神经网络

训练好用于车牌识别的神经网络，0-9，A-Z（不含I和O），每个字符使用50张图片，训练好用于车牌识...

大小：0B | 2019-09-11 22:05:37
使用多个GPU的深度神经网络快速训练算法

远端深层神经网络(DNN)被成功取代语音识别领域,成为一种很具发展潜力的语音识别模型。然而,由于其训...

大小：296KB | 2021-05-11 10:04:05
FixNorm剖析体重衰减以训练深度神经网络

Weight decay is a widely used technique for traini...

大小：712.25 KB | 2021-01-22 03:37:48
DeepLearningToolbox用于分析深度神经网络的工具源码

深度学习工具箱(开发阶段) 一组用于分析和可视化深度神经网络的工具。该工具箱的最初目标是可视化网络...

大小：8.41MB | 2021-02-24 22:39:46
rbf神经网络的训练代码

训练RBF网络Result=~sum(abs(T1-Y1))%正确分类显示为1Percent1=su...

大小：0B | 2020-01-27 08:33:34
深度神经网络94.5

对模型的参数进一步调整......,有一个奇怪的地方,batch_size居然影响到了泛化能力,不过...

大小：39.63MB | 2020-08-31 00:47:21
深度神经网络调研

大小：0B | 2019-04-05 01:41:09