Author: JarquÃn-Vásquez, Horacio Jesús; Montes-y-Gómez, Manuel; Villaseñor-Pineda, Luis
Title: Not All Swear Words Are Used Equal: Attention over Word n-grams for Abusive Language Identification Cord-id: mtw7vxyd Document date: 2020_4_29
ID: mtw7vxyd
Snippet: The increasing propagation of abusive language in social media is a major concern for supplier companies and governments because of its negative social impact. A large number of methods have been developed for its automatic identification, ranging from dictionary-based methods to sophisticated deep learning approaches. A common problem in all these methods is to distinguish the offensive use of swear words from their everyday and humorous usage. To tackle this particular issue we propose an atte
Document: The increasing propagation of abusive language in social media is a major concern for supplier companies and governments because of its negative social impact. A large number of methods have been developed for its automatic identification, ranging from dictionary-based methods to sophisticated deep learning approaches. A common problem in all these methods is to distinguish the offensive use of swear words from their everyday and humorous usage. To tackle this particular issue we propose an attention-based neural network architecture that captures the word n-grams importance according to their context. The obtained results in four standard collections from Twitter and Facebook are encouraging, they outperform the [Formula: see text] scores from state-of-the-art methods and allow identifying a set of inherently offensive swear words, and others in which its interpretation depends on its context.
Search related documents:
Co phrase search for related documents, hyperlinks ordered by date