Show simple item record

dc.contributor.authorZhang, T
dc.contributor.authorFang, X
dc.contributor.authorWang, Z
dc.contributor.authorLiu, Y
dc.contributor.authorNallanathan, A
dc.date.accessioned2021-11-05T12:02:09Z
dc.date.available2021-11-05T12:02:09Z
dc.date.issued2021-10-01
dc.identifier.issn0018-9545
dc.identifier.urihttps://qmro.qmul.ac.uk/xmlui/handle/123456789/75055
dc.description.abstractEdge caching has become an effective solution to cope with the challenges brought by the massive content delivery in cellular networks. In device-to-device (D2D) enabled caching cellular networks with time-varying content popularity distribution and user terminal (UT) location, we model these dynamic networks as a stochastic game to design a cooperative cache placement policy. The cache placement reward of each UT is defined as the caching incentive minus the transmission power cost for content caching and sharing. We consider the long-term cache placement reward of all UTs in this stochastic game. In an effort to solve the stochastic game problem, we propose a multi-agent cooperative alternating Q-learning (CAQL) based cache placement algorithm. The caching control unit is defined to execute the proposed CAQL, in which, the cache placement policy of each UT is alternatively updated according to the stable policy of other UTs during the learning process, until the stable cache placement policy of all the UTs in the cell is obtained. We discuss the convergence and complexity of CAQL, which obtains the stable cache placement policy with low space complexity. Simulation results show that the proposed algorithm can effectively reduce the backhaul load and the average content access delay in dynamic networks.en_US
dc.format.extent1 - 1
dc.publisherIEEEen_US
dc.relation.ispartofIEEE Transactions on Vehicular Technology
dc.titleStochastic Game based Cooperative Alternating Q-Learning Caching in Dynamic D2D Networksen_US
dc.typeArticleen_US
dc.rights.holder© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.identifier.doi10.1109/tvt.2021.3120292
pubs.issue99en_US
pubs.notesNot knownen_US
pubs.volumePPen_US
rioxxterms.funderDefault funderen_US
rioxxterms.identifier.projectDefault projecten_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record