dc.contributor.author | Demirel, E | |
dc.contributor.author | Ahlback, S | |
dc.contributor.author | Dixon, S | |
dc.date.accessioned | 2024-07-09T10:37:44Z | |
dc.date.available | 2024-07-09T10:37:44Z | |
dc.date.issued | 2021-06-21 | |
dc.identifier.uri | https://qmro.qmul.ac.uk/xmlui/handle/123456789/97929 | |
dc.description.abstract | Recent automatic lyrics transcription (ALT) approaches focus on building stronger acoustic models or indomain language models, while the pronunciation aspect is
seldom touched upon. This paper applies a novel computational
analysis on the pronunciation variances in sung utterances
and further proposes a new pronunciation model adapted for
singing. The singing-adapted model is tested on multiple public
datasets via word recognition experiments. It performs better
than the standard speech dictionary in all settings reporting
the best results on ALT in a capella recordings using n-gram
language models. For reproducibility, we share the sentencelevel annotations used in testing, providing a new benchmark
evaluation set for ALT. | en_US |
dc.publisher | arXiv | en_US |
dc.relation.ispartof | arXiv | |
dc.title | Computational Pronunciation Analysis in Sung Utterances | en_US |
dc.rights.holder | © 2024 The Author(s). | |
dc.identifier.doi | 10.48550/arxiv.2106.10977 | |
pubs.notes | Not known | en_US |
rioxxterms.funder | Default funder | en_US |
rioxxterms.identifier.project | Default project | en_US |
rioxxterms.funder.project | b215eee3-195d-4c4f-a85d-169a4331c138 | en_US |