Divide and Conquer: An Ensemble Approach for Hostile Post Detection in Hindi

Name: Divide and Conquer: An Ensemble Approach for Hostile Post Detection in Hindi
Rating: 4.5 (44 reviews)
Author: qqchat57211

上传者：qqchat57211 2021-01-24 09:18:47上传 .PDF文件 518.69 KB 热度 44次

Divide and Conquer: An Ensemble Approach for Hostile Post Detection in Hindi

Recently the NLP community has started showing interest towards the challenging task of Hostile Post Detection. This paper present our system for Shared Task at Constraint2021 on "Hostile Post Detection in Hindi".The data for this shared task is provided in Hindi Devanagari script which was collected from Twitter and Facebook. It is a multi-label multi-class classification problem where each data instance is annotated into one or more of the five classes: fake, hate, offensive, defamation, and non-hostile. We propose a two level architecture which is made up of BERT based classifiers and statistical classifiers to solve this problem. Our team 'Albatross', scored 0.9709 Coarse grained hostility F1 score measure on Hostile Post Detection in Hindi subtask and secured 2nd rank out of 45 teams for the task. Our submission is ranked 2nd and 3rd out of a total of 156 submissions with Coarse grained hostility F1 score of 0.9709 and 0.9703 respectively. Our fine grained scores are also very encouraging and can be improved with further finetuning. The code is publicly available.

分而治之：印地语敌对哨声检测的综合方法

最近，NLP社区已开始对敌对岗位检测的挑战性任务表现出兴趣。本文介绍了在Constraint2021上针对“印地语中的敌对哨所检测”的共享任务系统。.. 共享任务的数据以印地语Devanagari脚本提供，该脚本是从Twitter和Facebook收集的。这是一个多标签的多类别分类问题，其中每个数据实例都被注释为以下五个类别中的一个或多个：伪造，仇恨，令人反感，诽谤和非恶意。为了解决这个问题，我们提出了一个由基于BERT的分类器和统计分类器组成的两级体系结构。我们的团队“信天翁”在印地语子任务的敌对岗位检测中获得0.9709粗粒度敌对F1得分，在45个团队中获得第二名。在156个提交的材料中，我们的提交材料分别排名第2和第3，粗粒度敌意F1得分分别为0.9709和0.9703。我们的细粒度评分也非常令人鼓舞，可以通过进一步的微调来提高。（阅读更多）

下载地址

用户评论

更多下载

下载地址

 立即下载

用户评论

发表评论

Divide and Conquer An Ensemble Approach for Hostile Post Detection in Hindi

最近，NLP社区已开始对敌对岗位检测的挑战性任务表现出兴趣。本文介绍了在Constraint2021...

大小：518.69 KB | 2021-01-24 09:18:47

Hostile_Post_Detection_in_Hindi

SHARED TASK @ CONSTRAINT 2021

大小：13.41 MB | 2021-01-24 09:18:57

The Divide and Conquer Strategy

The Divide-and-Conquer Strategy

大小：594KB | 2021-05-07 20:24:07

Finding Maximum Contiguous Subsequence Sum using divide and conquer approach

Finding Maximum Contiguous Subsequence Sum using d...

大小：8KB | 2021-04-26 13:27:14

Tree and Divide Conquer

title: Tree and Divide Conquer date: 2020-03-24 20...

大小：50KB | 2020-12-22 23:03:46

divide_and_conquer源码

分而治之该项目的目标是为客户无需使用并发处理的并行编程法式“分而治之”创建一个框架接口。该骨架在...

大小：422KB | 2021-04-01 00:24:25

divide_conquer源码

divide_conquer

大小：3.59MB | 2021-04-01 00:24:26

a general method for solving divide and conquer recurrences

ageneralmethodforsolvingdivideandconquerrecurrence...

大小：0B | 2019-09-06 09:53:17

ensemble action detection源码

ensemble-action-detection

大小：1.72MB | 2021-04-21 18:46:36

Divide and Conquer, Sorting and Searching, and Randomized Algorithms python 版答案

大小：0B | 2019-04-10 00:03:14

Ensemble_Anomaly_Detection源码

整体异常检测要求 Python> = 3.6 安装首先,运行 $ git clone ht...

大小：305KB | 2021-02-23 16:56:00

基于divide and conquer策略的Sutherland_Hodgman算法.txt

Sutherland-Hodgman algorithm.txt based on the divi...

大小：0B | 2019-06-27 18:44:52

高效问题解决方法：Divide and Conquer算法

Divide and Conquer算法，又称分治法，是一种重要的问题解决方法。其核心思想是将复杂问...

大小：3.18KB | 2023-11-11 12:12:05

A divide and conquer based greedy search for two machine no wait job shop proble

This paper addresses a two-machine no-wait job sho...

大小：1.5MB | 2021-02-23 04:50:25

A computational approach to edge detection

A computational approach to edge detection

大小：0B | 2018-12-09 10:35:46

Thresholding in Edge Detection A Statistical Approach

文章阐述了一种边缘接侧的新算法。作者为American Express印度分公司商务分析师Rish...

大小：0B | 2018-12-09 10:35:46