New Ideas and Trends in Deep Multimodal Content Understanding: A Review

上传者：tax55837 2021-01-24 07:50:31上传 .PDF文件 1.75 MB 热度 25次

New Ideas and Trends in Deep Multimodal Content Understanding: A Review

The focus of this survey is on the analysis of two modalities of multimodal deep learning: image and text. Unlike classic reviews of deep learning where monomodal image classifiers such as VGG, ResNet and Inception module are central topics, this paper will examine recent multimodal deep models and structures, including auto-encoders, generative adversarial nets and their variants.These models go beyond the simple image classifiers in which they can do uni-directional (e.g. image captioning, image generation) and bi-directional (e.g. cross-modal retrieval, visual question answering) multimodal tasks. Besides, we analyze two aspects of the challenge in terms of better content understanding in deep multimodal applications. We then introduce current ideas and trends in deep multimodal feature learning, such as feature embedding approaches and objective function design, which are crucial in overcoming the aforementioned challenges. Finally, we include several promising directions for future research.

深度多模式内容理解的新思想和趋势：回顾

这项调查的重点是分析多模式深度学习的两种模式：图像和文本。与经典的深度学习评论不同，单峰图像分类器（例如VGG，ResNet和Inception模块）是中心主题，而本文将研究最近的多峰深度模型和结构，包括自动编码器，生成对抗网络及其变体。.. 这些模型超越了简单的图像分类器，在这些分类器中，它们可以执行单向（例如，图像字幕，图像生成）和双向（例如，跨模式检索，视觉问题解答）多模式任务。此外，我们在更好地理解深度多模式应用程序的内容方面分析了挑战的两个方面。然后，我们介绍了深度多峰特征学习中的当前思想和趋势，例如特征嵌入方法和目标函数设计，这些对于克服上述挑战至关重要。最后，我们为未来的研究提供了一些有希望的方向。（阅读更多）

下载地址

用户评论

更多下载

下载地址

 立即下载

用户评论

发表评论

New Ideas and Trends in Deep Multimodal Content Understanding A Review

这项调查的重点是分析多模式深度学习的两种模式：图像和文本。与经典的深度学习评论不同，单峰图像分类器（...

大小：1.75 MB | 2021-01-24 07:50:31

Image Retrieval Ideas Influences and Trends of the New Age

Wehavewitnessedgreatinterestandawealthofpromiseinc...

大小：0B | 2019-07-19 15:19:55

Springer Content_Based Image Retrieval Ideas Influences and Current Trends

Springer.Content-BasedImageRetrieval.Ideas,Influen...

大小：0B | 2019-06-04 00:48:14

Content Based Image Retrieval Ideas Influences and Current Trends无水印原版pdf

Content-Based Image Retrieval Ideas, Influences, a...

大小：11.33MB | 2021-04-19 02:23:26

some new ideas

利用C#进行软件开发的例子，可供开发人员参考

大小：0B | 2019-09-17 16:22:07

Multimodal Deep Learning.pdf

本文提出一种在深度网络上的新应用，用深度网络学习多模态。特别的是，我们证明了跨模态特征学习——如果在...

大小：489KB | 2020-07-20 17:36:01

New trends in Databases and Informat

SelectedPapersofthe17thEastEuropeanConferenceonAdv...

大小：0B | 2019-06-05 11:07:02

Recent advances and trends in visual tracking A review

讲了近几年视觉跟踪技术的发展现状及将来的发展趋势

大小：0B | 2019-06-04 00:48:59

Multimodal tag localization based on deep learning

Multimodal tag localization based on deep learning...

大小：228KB | 2021-02-27 04:51:59

A new review of statistics

Averybasicstatsbook.

大小：0B | 2019-09-18 07:00:56

A Survey on Deep Learning for Multimodal Data Fusion

A Survey on Deep Learning for Multimodal Data Fusi...

大小：4.3MB | 2021-02-27 04:52:01

New trends in databases and Information Systems

对sap在大数据上的一些变化和对数据库及操作系统的影响的介绍

大小：0B | 2019-06-05 11:07:10

Geometric Understanding of Deep Learning

Deep learning is the mainstream technique for many...

大小：6.6MB | 2021-04-25 14:43:07

Deep Retinal Image Understanding

Deep Retinal Image Understanding 的论文，视网膜图像分割

大小：0B | 2018-12-25 21:13:29

A review of multimodal image matching Methods and applications.pdf

多模态配准方法和应用综述武汉大学电子信息学院

大小：22.68MB | 2021-04-08 04:49:27

Time Travel in Deep Learning Space An Introduction to Deep he Initial Ideas

大牛的深度学习综述，篇幅并不过长，从神经网络作为入口进行讲解，包括了主要的最新应用，对各领域中的应用...

大小：0B | 2020-03-02 14:50:06