Spatio-temporal associative representation for video person re-identification

dc.contributor.author	Wu, G	en_US
dc.contributor.author	Zhu, X	en_US
dc.contributor.author	Gong, S	en_US
dc.date.accessioned	2023-08-31T12:39:59Z
dc.date.issued	2020-01-01	en_US
dc.identifier.uri	https://qmro.qmul.ac.uk/xmlui/handle/123456789/90325
dc.description.abstract	Learning discriminative spatio-temporal representation is the key for solving video re-identification (re-id) challenges. Most existing methods focus on learning appearance features and/or selecting image frames, but ignore optimising the compatibility and interaction of appearance and motion attentive information. To address this limitation, we propose a novel model to learning Spatio-Temporal Associative Representation (STAR). We design local frame-level spatio-temporal association to learn discriminative attentive appearance and short-term motion features, and global video-level spatio-temporal association to form compact and discriminative holistic video representation. We further introduce a pyramid ranking regulariser for facilitating end-to-end model optimisation. Extensive experiments demonstrate the superiority of STAR against state-of-the-art methods on four video re-id benchmarks, including MARS, DukeMTMC-VideoReID, iLIDS-VID and PRID-2011.	en_US
dc.title	Spatio-temporal associative representation for video person re-identification	en_US
dc.type	Conference Proceeding
pubs.notes	Not known	en_US
pubs.publication-status	Published	en_US