Image Search with Text Feedback by Visiolinguistic Attention Learning

Chen, Y; Gong, S; Bazzani, L; IEEE Conference on Computer Vision and Pattern Recognition

dc.contributor.author	Chen, Y
dc.contributor.author	Gong, S
dc.contributor.author	Bazzani, L
dc.contributor.author	IEEE Conference on Computer Vision and Pattern Recognition
dc.date.accessioned	2020-11-20T10:32:43Z
dc.date.available	2020-01-01
dc.date.available	2020-11-20T10:32:43Z
dc.date.issued	2020-01-01
dc.identifier.issn	1063-6919
dc.identifier.uri	https://qmro.qmul.ac.uk/xmlui/handle/123456789/68545
dc.description.abstract	Image search with text feedback has promising impacts in various real-world applications, such as e-commerce and internet search. Given a reference image and text feedback from user, the goal is to retrieve images that not only resemble the input image, but also change certain aspects in accordance with the given text. This is a challenging task as it requires the synergistic understanding of both image and text. In this work, we tackle this task by a novel Visiolinguistic Attention Learning (VAL) framework. Specifically, we propose a composite transformer that can be seamlessly plugged in a CNN to selectively preserve and transform the visual features conditioned on language semantics. By inserting multiple composite transformers at varying depths, VAL is incentive to encapsulate the multi-granular visiolinguistic information, thus yielding an expressive representation for effective image search. We conduct comprehensive evaluation on three datasets: Fashion200k, Shoes and FashionIQ. Extensive experiments show our model exceeds existing approaches on all datasets, demonstrating consistent superiority in coping with various text feedbacks, including attribute-like and natural language descriptions.	en_US
dc.format.extent	2998 - 3008
dc.publisher	IEEE	en_US
dc.title	Image Search with Text Feedback by Visiolinguistic Attention Learning	en_US
dc.type	Conference Proceeding	en_US
dc.rights.holder	© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.identifier.doi	10.1109/CVPR42600.2020.00307
pubs.notes	Not known	en_US
pubs.publication-status	Published	en_US
dcterms.dateAccepted	2020-01-01
rioxxterms.funder	Default funder	en_US
rioxxterms.identifier.project	Default project	en_US

Files in this item

Name:: Gong Image Search with 2020 ...
Size:: 5.308Mb
Format:: application/
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

Electronic Engineering and Computer Science [3424]

Show simple item record