MSAF: Multimodal Split Attention Fusion

Name: MSAF: Multimodal Split Attention Fusion
Rating: 4.5 (58 reviews)
Author: sponge_30797

上传者：sponge_30797 2021-01-24 06:10:36上传 .PDF文件 1.03 MB 热度 58次

MSAF: Multimodal Split Attention Fusion

Multimodal learning mimics the reasoning process of the human multi-sensory system, which is used to perceive the surrounding world. While making a prediction, the human brain tends to relate crucial cues from multiple sources of information.In this work, we propose a novel multimodal fusion module that learns to emphasize more contributive features across all modalities. Specifically, the proposed Multimodal Split Attention Fusion (MSAF) module splits each modality into channel-wise equal feature blocks and creates a joint representation that is used to generate soft attention for each channel across the feature blocks. Further, the MSAF module is designed to be compatible with features of various spatial dimensions and sequence lengths, suitable for both CNNs and RNNs. Thus, MSAF can be easily added to fuse features of any unimodal networks and utilize existing pretrained unimodal model weights. To demonstrate the effectiveness of our fusion module, we design three multimodal networks with MSAF for emotion recognition, sentiment analysis, and action recognition tasks. Our approach achieves competitive results in each task and outperforms other application-specific networks and multimodal fusion benchmarks.

MSAF：多模式分裂注意力融合

多模式学习模仿了人类多感官系统的推理过程，该系统用于感知周围世界。在做出预测时，人脑倾向于将来自多种信息来源的关键线索联系起来。.. 在这项工作中，我们提出了一种新颖的多峰融合模块，该模块学习着重强调所有模态的更多贡献特征。具体而言，建议的多模式拆分注意力融合（MSAF）模块将每个模态拆分为各个通道相等的特征块，并创建一个联合表示形式，该联合表示用于为整个功能块上的每个通道生成软注意力。此外，MSAF模块设计为与各种空间尺寸和序列长度的特征兼容，适用于CNN和RNN。因此，可以轻松地将MSAF添加到任何单峰网络的融合特征中，并利用现有的预训练单峰模型权重。为了证明我们的融合模块的有效性，我们设计了三个带有MSAF的多峰网络，用于情感识别，情感分析和动作识别任务。（阅读更多）

下载地址

用户评论

更多下载

下载地址

 立即下载

用户评论

发表评论

MSAF Multimodal Split Attention Fusion

多模式学习模仿了人类多感官系统的推理过程，该系统用于感知周围世界。在做出预测时，人脑倾向于将来自多种...

大小：1.03 MB | 2021-01-24 06:10:36

multimodal MER fusion源码

多峰MER融合

大小：194KB | 2021-04-27 01:26:52

Face Hallucination Using Split_Attention in Split_Attention Network

近来，注意力机制已被应用于基于卷积神经网络（CNN）的超分辨率（SR）任务，以探索内部特征图的相关性...

大小：4.29 MB | 2021-01-24 07:38:05

Multimodal Fusion for Video Search Reranking

Multimodal Fusion for Video Search Reranking

大小：538KB | 2021-02-23 02:57:13

A Survey on Deep Learning for Multimodal Data Fusion

A Survey on Deep Learning for Multimodal Data Fusi...

大小：4.3MB | 2021-02-27 04:52:01

Low rank Multimodal Fusion master源码

低等级多模式融合 Liu和Shen等人,这是“具有模态特定因素的高效低秩多模态融合”的存储库。 al...

大小：1.69MB | 2021-04-24 23:01:26

MSAF

Offical implementation of paper "MSAF: Multimodal ...

大小：103.48 MB | 2021-01-24 06:10:51

20200417_ResNeSt Split Attention Networks.pdf

ResNeSt Split-Attention Networks，ResNet的一个变形，取得了很好...

大小：549KB | 2020-07-29 10:09:17

论文研究Multimodal Information Fusion Based Housing Prices Prediction.pdf

基于多模态信息融合的房价预测，常诚，张忠宝，房价预测作为一项研究项目已有较多相关工作，多数研究均指出...

大小：0B | 2020-04-26 14:59:19

Attention_self attention_multi head attention

该文档主要介绍了attention及其变种selfattention、multi-attention...

大小：0B | 2019-06-28 01:24:37

msaf音乐结构分析框架.zip

msaf, 音乐结构分析框架基于的音乐结构分析框架分析音乐结构的python 框架。文档有关完整...

大小：4.12MB | 2020-07-21 18:46:07

PAYING MORE ATTENTION TO ATTENTION.pdf

Attention plays a critical role in human visual ex...

大小：1.17MB | 2020-08-19 16:31:23

Multimodal label free microscopy

This paper reviews the different multimodal applic...

大小：2.29MB | 2021-02-23 02:57:23

Python Multimodal Hub开源

该项目使用Python实现了多模式中间件协议。它可用于连接同一台计算机或网络中的多模式组件。多模...

大小：19KB | 2021-05-09 02:33:35

attention代码

attentionmodel，主要用在处理文本的seq2seq上，能够根据文中的每个词的重要性去生成...

大小：0B | 2020-05-06 16:48:00

multimodal_captioning源码

多模字幕

大小：79.58MB | 2021-04-08 18:51:34