QMUL-NLP at HASOC 2019: Offensive content detection and classification in social media

Jiang, A

Volume

2517

Pagination

254 - 262

ISSN

1613-0073

Metadata

Show full item record

Abstract

With the development of the Internet, the Web has become an information dissemination platform, an information amplifier, and a new social media. The information load and participation of the Internet far exceeds the existing traditional media, and various problems have emerged. There has been significant work in several languages in partic- ular for English. However, there is a lack of research in this recent and relevant topic for most other languages. This track intends to develop data and evaluation resources for several languages. The objectives are to stimulate research for these languages and to find out the quality of hate speech detection technology in other languages. The paper mainly describes the organization of the HASOC 2019 Task, a Shared Task on Hate Speech and Offensive Content Identification in Indo-European Lan- guages. The task is organized in three related classification subtasks: sub- task A is a coarse-grained binary classification to identify hate speech and offensive language, a fine-grained classification subtask B is to further classify the data from the subtask A into three categories, and subtask C will check the type of offense. This paper mainly focuses on English of- fensive language detection and shows the experimental result in subtask A and subtask B.

Authors

Jiang, A

URI

https://qmro.qmul.ac.uk/xmlui/handle/123456789/69058

Collections

Electronic Engineering and Computer Science [3475]

Licence information

Use permitted under Creative Com- mons License Attribution 4.0 International (CC BY 4.0). Attribution 3.0 United States

Copyright statements

The following license files are associated with this item:

Creative Commons

Use permitted under Creative Com- mons License Attribution 4.0 International (CC BY 4.0).

Except where otherwise noted, this item's license is described as Use permitted under Creative Com- mons License Attribution 4.0 International (CC BY 4.0).