Computational Pronunciation Analysis in Sung Utterances

dc.contributor.author	Demirel, E
dc.contributor.author	Ahlback, S
dc.contributor.author	Dixon, S
dc.date.accessioned	2024-07-09T10:37:44Z
dc.date.available	2024-07-09T10:37:44Z
dc.date.issued	2021-06-21
dc.identifier.uri	https://qmro.qmul.ac.uk/xmlui/handle/123456789/97929
dc.description.abstract	Recent automatic lyrics transcription (ALT) approaches focus on building stronger acoustic models or indomain language models, while the pronunciation aspect is seldom touched upon. This paper applies a novel computational analysis on the pronunciation variances in sung utterances and further proposes a new pronunciation model adapted for singing. The singing-adapted model is tested on multiple public datasets via word recognition experiments. It performs better than the standard speech dictionary in all settings reporting the best results on ALT in a capella recordings using n-gram language models. For reproducibility, we share the sentencelevel annotations used in testing, providing a new benchmark evaluation set for ALT.	en_US
dc.publisher	arXiv	en_US
dc.relation.ispartof	arXiv
dc.title	Computational Pronunciation Analysis in Sung Utterances	en_US
dc.rights.holder	© 2024 The Author(s).
dc.identifier.doi	10.48550/arxiv.2106.10977
pubs.notes	Not known	en_US
rioxxterms.funder	Default funder	en_US
rioxxterms.identifier.project	Default project	en_US
rioxxterms.funder.project	b215eee3-195d-4c4f-a85d-169a4331c138	en_US