SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization

上传者：qqbuckle37452 2021-01-24 09:09:11上传 .PDF文件 2.10 MB 热度 21次

SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization

Convolutional neural networks typically encode an input image into a series of intermediate features with decreasing resolutions. While this structure is suited to classification tasks, it does not perform well for tasks requiring simultaneous recognition and localization (e.g., object detection).The encoder-decoder architectures are proposed to resolve this by applying a decoder network onto a backbone model designed for classification tasks. In this paper, we argue encoder-decoder architecture is ineffective in generating strong multi-scale features because of the scale-decreased backbone. We propose SpineNet, a backbone with scale-permuted intermediate features and cross-scale connections that is learned on an object detection task by Neural Architecture Search. Using similar building blocks, SpineNet models outperform ResNet-FPN models by ~3% AP at various scales while using 10-20% fewer FLOPs. In particular, SpineNet-190 achieves 52.5% AP with a MaskR-CNN detector and achieves 52.1% AP with a RetinaNet detector on COCO for a single model without test-time augmentation, significantly outperforms prior art of detectors. SpineNet can transfer to classification tasks, achieving 5% top-1 accuracy improvement on a challenging iNaturalist fine-grained dataset. Code is at: https://github.com/tensorflow/tpu/tree/master/models/official/detection.

SpineNet：学习按比例排列的骨干以进行识别和本地化

卷积神经网络通常将输入图像编码为分辨率降低的一系列中间特征。尽管此结构适用于分类任务，但是对于需要同时识别和定位（例如，对象检测）的任务而言，它的性能并不理想。.. 提出了编码器-解码器体系结构，以通过将解码器网络应用于为分类任务设计的骨干模型来解决此问题。在本文中，我们认为编码器/解码器体系结构由于主干的规模减小而无法有效生成强大的多尺度特征。我们提出了SpineNet，这是一个具有比例排列的中间特征和跨比例连接的主干，可以通过神经体系结构搜索在对象检测任务中学习。使用相似的构建块，SpineNet模型在各种规模上的性能比ResNet-FPN模型高出约3％，而使用的FLOP则减少了10-20％。特别是，对于单个模型，SpineNet-190在不增加测试时间的情况下，使用MaskR-CNN检测器可达到52.5％的AP，而在COCO上使用RetinaNet检测器可达到52.1％的AP，大大优于检测器的现有技术。SpineNet可以转移到分类任务，在具有挑战性的iNaturalist细粒度数据集上将top-1准确性提高5％。代码位于：https：//github.com/tensorflow/tpu/tree/master/models/official/detection。（阅读更多）

下载地址

用户评论

更多下载

下载地址

 立即下载

用户评论

发表评论

SpineNet Learning Scale_Permuted Backbone for Recognition and Localization

卷积神经网络通常将输入图像编码为分辨率降低的一系列中间特征。尽管此结构适用于分类任务，但是对于需要同...

大小：2.10 MB | 2021-01-24 09:09:11

Efficient Scale_Permuted Backbone with Learned Resource Distribution

最近，SpineNet在ResNet模型上的目标检测和图像分类中展示了令人鼓舞的结果。但是，尚不清楚...

大小：489.71 KB | 2021-01-24 07:40:38

Learning Multi_scale Block Local Binary Patterns for Face Recognition.pdf

LearningMulti-scaleBlockLocalBinaryPatternsforFace...

大小：0B | 2019-05-25 14:00:02

Large Scale Learning to Rank

大小：0B | 2018-12-09 04:23:28

Large_Scale.Visual.Geo_Localization.331925779X

Thistimelyandauthoritativevolumeexploresthebidirec...

大小：0B | 2019-05-20 01:54:42

Suppressing Uncertainties for Large Scale Facial Expression Recognition

CVPR2020 Best Face Recognition Framework

大小：3.47MB | 2020-12-30 03:44:07

large scale machine learning with python

大小：0B | 2018-12-09 04:23:24

Large Scale Multiple Kernel Learning

大小：0B | 2018-12-09 04:23:22

Large Scale Machine Learning with Spark

大小：0B | 2018-12-09 04:23:27

Multi fault aware parallel localization protocol for backbone network with many

Multi-fault aware parallel localization protocol f...

大小：426KB | 2021-02-25 17:44:37

Patten Recognition and Machine Learning

PattenRecognitionandMachineLearning，引文原版，非影印版，高清，机...

大小：0B | 2020-02-14 20:41:13

Pattern Recognition and Machine Learning

PatternRecognitionandMachineLearning模式识别与机器学习

大小：0B | 2019-09-21 18:46:23

Pattern Recognition And Machine Learning

PatternRecognitionAndMachineLearning，模式识别，机器学习经典书籍...

大小：0B | 2019-07-08 05:02:57

pattern recognition and machine learning

经典的机器学习书籍，Bishop编辑！学习机器学习的必读书目！

大小：0B | 2019-07-30 00:47:37

Ppattern Recognition and Machine Learning

ChristopherM.Bishop1Introduction11.1Example:Polyno...

大小：0B | 2019-06-21 17:32:42

Pattern Recognition & Machine Learning

大小：0B | 2019-04-13 20:56:20