Show simple item record

dc.contributor.authorZosa, E
dc.contributor.authorShekhar, R
dc.contributor.authorKaran, M
dc.contributor.authorPurver, M
dc.date.accessioned2021-11-05T09:36:08Z
dc.date.available2021-11-05T09:36:08Z
dc.date.issued2021
dc.identifier.urihttps://qmro.qmul.ac.uk/xmlui/handle/123456789/75039
dc.description.abstractModeration of reader comments is a significant problem for online news platforms. Here, we experiment with models for automatic moderation, using a dataset of comments from a popular Croatian newspaper. Our analysis shows that while comments that violate the moderation rules mostly share common linguistic and thematic features, their content varies across the different sections of the newspaper. We therefore make our models topic-aware, incorporating semantic features from a topic model into the classification decision. Our results show that topic information improves the performance of the model, increases its confidence in correct outputs, and helps us understand the model's outputs.en_US
dc.rightsThis article is distributed under the terms of the Creative Commons Attribution – NonCommercial – NoDerivs (CC BY-NC-ND 4.0) licence. You are permitted to download and share the original work, crediting the original source, without altering or using the material for commercial purposes.
dc.subjectcs.CLen_US
dc.subjectcs.CLen_US
dc.titleNot All Comments are Equal: Insights into Comment Moderation from a Topic-Aware Modelen_US
dc.typeArticleen_US
dc.rights.holder© 2021, The Author(s)
pubs.author-urlhttp://arxiv.org/abs/2109.10033v1en_US
pubs.notesNot knownen_US
rioxxterms.funderDefault funderen_US
rioxxterms.identifier.projectDefault projecten_US
qmul.funderEMBEDDIA: Cross-Lingual Embeddings for Less-Represented Languages in European News Media::European Commissionen_US
qmul.funderEMBEDDIA: Cross-Lingual Embeddings for Less-Represented Languages in European News Media::European Commissionen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record