Show simple item record

dc.contributor.authorIlhan, Een_US
dc.contributor.authorGow, Jen_US
dc.contributor.authorPerez-Liebana, Den_US
dc.contributor.authorAutonomous Agents and Multiagent Systemsen_US
dc.date.accessioned2022-01-27T14:53:56Z
dc.date.available2020-12-17en_US
dc.date.issued2021-01-01en_US
dc.identifier.isbn9781713832621en_US
dc.identifier.issn1548-8403en_US
dc.identifier.urihttps://qmro.qmul.ac.uk/xmlui/handle/123456789/76457
dc.description.abstractAction advising is a peer-to-peer knowledge exchange technique built on the teacher-student paradigm to alleviate the sample inefficiency problem in deep reinforcement learning. Recently proposed student-initiated approaches have obtained promising results. However, due to being in the early stages of development, these also have some substantial shortcomings. One of the abilities that are absent in the current methods is further utilising advice by reusing, which is especially crucial in the practical settings considering the budget constraints in peer-to-peer interactions. In this study, we present an approach to enable the student agent to imitate previously acquired advice to reuse them directly in its exploration policy, without any interventions in the learning mechanism itself. In particular, we employ a behavioural cloning module to imitate the teacher policy and use dropout regularisation to have a notion of epistemic uncertainty to keep track of which state-advice pairs are actually collected. As the results of experiments we conducted in three Atari games show, advice reusing via imitation is indeed a feasible option in deep RL and our approach can successfully achieve this while significantly improving the learning performance, even when it is paired with a simple early advising heuristic.en_US
dc.format.extent629 - 637en_US
dc.titleAction advising with advice imitation in deep reinforcement learningen_US
dc.typeConference Proceeding
pubs.notesNot knownen_US
pubs.publication-statusPublisheden_US
pubs.volume2en_US
dcterms.dateAccepted2020-12-17en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record