• Login
    JavaScript is disabled for your browser. Some features of this site may not work without it.
    Towards Deep End-of-Turn Detection for Situated Spoken Dialogue Systems 
    •   QMRO Home
    • School of Electronic Engineering and Computer Science
    • Electronic Engineering and Computer Science
    • Towards Deep End-of-Turn Detection for Situated Spoken Dialogue Systems
    •   QMRO Home
    • School of Electronic Engineering and Computer Science
    • Electronic Engineering and Computer Science
    • Towards Deep End-of-Turn Detection for Situated Spoken Dialogue Systems
    ‌
    ‌

    Browse

    All of QMROCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects
    ‌
    ‌

    Administrators only

    Login
    ‌
    ‌

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Towards Deep End-of-Turn Detection for Situated Spoken Dialogue Systems

    View/Open
    Published version (387.2Kb)
    Publisher
    ISCA
    Metadata
    Show full item record
    Abstract
    We address the challenge of improving live end-of-turn detection for situated spoken dialogue systems. While traditionally silence thresholds have been used to detect the user’s end-of-turn, such an approach limits the system’s potential fluidity in interaction, restricting it to a purely reactive paradigm. By contrast, here we present a system which takes a predictive approach. The user’s end-of-turn is predicted live as acoustic features and words are consumed by the system. We compare the benefits of live lexical and acoustic information by feature analysis and testing equivalent models with different feature sets with a common deep learning architecture, a Long Short-Term Memory (LSTM) network. We show the usefulness of incremental enriched language model features in particular. Training and testing onWizard-of-Oz data collected to train an agent in a simple virtual world, we are successful in improving over a reactive baseline in terms of reducing latency whilst minimising the cut-in rate.
    Authors
    Maier, A; HOUGH, J; Schlangen, D; InterSpeech 2017
    URI
    https://qmro.qmul.ac.uk/xmlui/handle/123456789/55075
    Collections
    • Electronic Engineering and Computer Science [309]
    Copyright statements
    2017 ISCA
    Twitter iconFollow QMUL on Twitter
    Twitter iconFollow QM Research
    Online on twitter
    Facebook iconLike us on Facebook
    • Site Map
    • Privacy and cookies
    • Disclaimer
    • Accessibility
    • Contacts
    • Intranet
    • Current students

    Modern Slavery Statement

    Queen Mary University of London
    Mile End Road
    London E1 4NS
    Tel: +44 (0)20 7882 5555

    © Queen Mary University of London.