Show simple item record

dc.contributor.authorBernardy, J-Pen_US
dc.contributor.authorLappin, Sen_US
dc.contributor.editorZaenen, Aen_US
dc.date.accessioned2020-06-12T10:22:35Z
dc.date.available2017-08-10en_US
dc.date.issued2017-11-01en_US
dc.identifier.issn1945-3604en_US
dc.identifier.urihttps://qmro.qmul.ac.uk/xmlui/handle/123456789/64882
dc.description.abstractWe consider the extent to which different deep neural network (DNN) configurations can learn syntactic relations, by taking up Linzen et al.’s (2016) work on subject-verb agreement with LSTM RNNs. We test their methods on a much larger corpus than they used (a ⇠24 million example part of the WaCky corpus, instead of their ⇠1.35 million example corpus, both drawn from Wikipedia). We experiment with several different DNN architectures (LSTM RNNs, GRUs, and CNNs), and alternative parameter settings for these systems (vocabulary size, training to test ratio, number of layers, memory size, drop out rate, and lexical embedding dimension size). We also try out our own unsupervised DNN language model. Our results are broadly compatible with those that Linzen et al. report. However, we discovered some interesting, and in some cases, surprising features of DNNs and language models in their performance of the agreement learning task. In particular, we found that DNNs require large vocabularies to form substantive lexical embeddings in order to learn structural patterns. This finding has interesting consequences for our understanding of the way in which DNNs represent syntactic information. It suggests that DNNs learn syntactic patterns more efficiently through rich lexical embeddings, with semantic as well as syntactic cues, than from training on lexically impoverished strings that highlight structural patterns.en_US
dc.format.extent1 - 15 (15)en_US
dc.languageEnglishen_US
dc.relation.ispartofLinguistic Issues in Language Technologyen_US
dc.subjectDeep Learning; syntactic agreement; computational linguisticsen_US
dc.titleUsing Deep Neural Networks to Learn Syntactic Agreementen_US
dc.typeArticle
dc.rights.holder© 2017 Linguistic Issues in Language Technology
pubs.issue2en_US
pubs.notesNot knownen_US
pubs.publication-statusPublisheden_US
pubs.publisher-urlhttps://web.stanford.edu/group/cslipublications/LiLT/en_US
pubs.volume15en_US
dcterms.dateAccepted2017-08-10en_US
rioxxterms.funderDefault funderen_US
rioxxterms.identifier.projectDefault projecten_US
qmul.funderInternational Recruitment Grant::Swedish Research Councilen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record