迈向卷积神经网络统一INT8培训

上传者：chin_53571 2021-01-22 15:44:46上传 .PDF文件 1.02 MB 热度 29次

最近，低位（例如8位）网络量化已得到广泛研究，以加快推理速度。除了推论，具有量化梯度的低位训练还可以带来更大的加速，因为后向过程通常需要大量计算。..

Towards Unified INT8 Training for Convolutional Neural Network

Recently low-bit (e.g., 8-bit) network quantization has been extensively studied to accelerate the inference. Besides inference, low-bit training with quantized gradients can further bring more considerable acceleration, since the backward process is often computation-intensive.Unfortunately, the inappropriate quantization of backward propagation usually makes the training unstable and even crash. There lacks a successful unified low-bit training framework that can support diverse networks on various tasks. In this paper, we give an attempt to build a unified 8-bit (INT8) training framework for common convolutional neural networks from the aspects of both accuracy and speed. First, we empirically find the four distinctive characteristics of gradients, which provide us insightful clues for gradient quantization. Then, we theoretically give an in-depth analysis of the convergence bound and derive two principles for stable INT8 training. Finally, we propose two universal techniques, including Direction Sensitive Gradient Clipping that reduces the direction deviation of gradients and Deviation Counteractive Learning Rate Scaling that avoids illegal gradient update along the wrong direction. The experiments show that our unified solution promises accurate and efficient INT8 training for a variety of networks and tasks, including MobileNetV2, InceptionV3 and object detection that prior studies have never succeeded. Moreover, it enjoys a strong flexibility to run on off-the-shelf hardware, and reduces the training time by 22% on Pascal GPU without too much optimization effort. We believe that this pioneering study will help lead the community towards a fully unified INT8 training for convolutional neural networks.

下载地址

用户评论

更多下载

下载地址

立即下载

用户评论

迈向卷积神经网络统一INT8培训

Recently low-bit (e.g., 8-bit) network quantizatio...

大小：1.02 MB | 2021-01-22 15:44:46
卷积神经网络

大小：0B | 2019-02-28 03:07:41
《卷积神经网络》

大小：0B | 2019-02-22 09:25:52
卷积神经网络之卷积

**各位同学上一节课我们介绍了信号上面的卷积运算,并且从信号上面的一个卷积运算类推到我们这里的二维图...

大小：739KB | 2021-01-16 21:28:39
卷积神经网络详述

大小：0B | 2019-02-15 19:29:56
解析卷积神经网络

大小：0B | 2019-01-16 01:45:48
卷积神经网络.LearningMaterials

卷积神经网络学习资料以及实验

大小：11.72MB | 2023-01-10 23:48:35
反卷积神经网络

反卷积跟1维信号处理的反卷积计算是很不一样的，FCN作者称为backwardsconvolution...

大小：0B | 2019-09-27 14:50:29
cnn卷积神经网络

利用卷积神经网络对mnist数据集进行分类，代码采用python进行编写，并有详细的注释，且文件自带...

大小：0B | 2019-09-12 02:32:34
卷积神经网络代码

卷积神经网络的源代码ConvNet-C++ConvolutionalNeuralNetworkLib...

大小：0B | 2019-08-13 17:51:04
卷积神经网络cnn

用于图像分类的源代码，深度学习的应用，cnn卷积神经网络的使用，可以自己更换数据集或者调优，适合初学...

大小：0B | 2019-08-16 12:41:09
CNN卷积神经网络

ThefirstCNNappearedintheworkofFukushimain1980andwa...

大小：0B | 2020-05-30 10:22:19
卷积神经网络论文

卷积神经网络在目标识别，图像分类，图像切割等方面的应用

大小：0B | 2019-05-22 02:58:18
卷积神经网络PPT

介绍卷积神经网络

大小：0B | 2019-05-14 22:50:35
卷积神经网络python

深度学习卷积神经网络，简单的实现卷积神经网络的架构。简单通俗

大小：0B | 2019-06-01 12:03:31
卷积神经网络ppt

大小：0B | 2019-03-10 10:15:03