Show simple item record

dc.contributor.authorJiang, Aen_US
dc.contributor.authorYang, Xen_US
dc.contributor.authorLiu, Yen_US
dc.contributor.authorZubiaga, Aen_US
dc.date.accessioned2023-07-20T11:12:55Z
dc.date.available2021-11-05en_US
dc.date.issued2021-12-14en_US
dc.identifier.urihttps://qmro.qmul.ac.uk/xmlui/handle/123456789/89668
dc.description.abstractOnline sexism has become an increasing concern in social media platforms as it has affected the healthy development of the Internet and can have negative effects in society. While research in the sexism detection domain is growing, most of this research focuses on English as the language and on Twitter as the platform. Our objective here is to broaden the scope of this research by considering the Chinese language on Sina Weibo. We propose the first Chinese sexism dataset – Sina Weibo Sexism Review (SWSR) dataset –, as well as a large Chinese lexicon SexHateLex made of abusive and gender-related terms. We introduce our data collection and annotation process, and provide an exploratory analysis of the dataset characteristics to validate its quality and to show how sexism is manifested in Chinese. The SWSR dataset provides labels at different levels of granularity including (i) sexism or non-sexism, (ii) sexism category and (iii) target type, which can be exploited, among others, for building computational methods to identify and investigate finer-grained gender-related abusive language. We conduct experiments for the three sexism classification tasks making use of state-of-the-art machine learning models. Our results show competitive performance, providing a benchmark for sexism detection in the Chinese language, as well as an error analysis highlighting open challenges needing more research in Chinese NLP. The SWSR dataset and SexHateLex lexicon are publicly available.1en_US
dc.relation.ispartofOnline Social Networks and Mediaen_US
dc.titleSWSR: A Chinese dataset and lexicon for online sexism detectionen_US
dc.typeArticle
dc.rights.holder© 2021 Elsevier Ltd. All rights reserved.
dc.identifier.doi10.1016/j.osnem.2021.100182en_US
pubs.notesNot knownen_US
pubs.publication-statusAccepteden_US
pubs.publisher-urlhttps://www.sciencedirect.com/science/article/abs/pii/S2468696421000604en_US
pubs.volume27en_US
dcterms.dateAccepted2021-11-05en_US
rioxxterms.funderDefault funderen_US
rioxxterms.identifier.projectDefault projecten_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record