Learning Deep Features for Robotic Inference from Physical Interactions

Dehban, A; Zhang, S; Cauli, N; Jamone, L; Santos-Victor, J

dc.contributor.author	Dehban, A	en_US
dc.contributor.author	Zhang, S	en_US
dc.contributor.author	Cauli, N	en_US
dc.contributor.author	Jamone, L	en_US
dc.contributor.author	Santos-Victor, J	en_US
dc.date.accessioned	2022-03-11T13:31:02Z
dc.date.issued	2022-02-17	en_US
dc.identifier.issn	2379-8920	en_US
dc.identifier.uri	https://qmro.qmul.ac.uk/xmlui/handle/123456789/77278
dc.description.abstract	In order to effectively handle multiple tasks that are not pre-defined, a robotic agent needs to automatically map its high-dimensional sensory inputs into useful features. As a solution, feature learning has empirically shown substantial improvements in obtaining representations that are generalizable to different tasks, compared to feature engineering approaches, but it requires a large amount of data and computational capacity. These challenges are specifically relevant in robotics due to the low signal-to-noise ratios inherent to robotic data, and to the cost typically associated with collecting this type of input. In this paper, we propose a deep probabilistic method based on Convolutional Variational Auto-Encoders (CVAEs) to learn visual features suitable for interaction and recognition tasks. We run our experiments on a self-supervised robotic sensorimotor dataset. Our data was acquired with the iCub humanoid and is based on a standard object collection, thus being readily extensible. We evaluated the learned features in terms of usability for 1) object recognition, 2) capturing the statistics of the effects, and 3) planning. In addition, where applicable, we compared the performance of the proposed architecture with other state-ofthe-art models. These experiments demonstrate that our model is capable of capturing the functional statistics of action and perception (i.e. images) which performs better than existing baselines, without requiring millions of samples or any handengineered features.	en_US
dc.publisher	Institute of Electrical and Electronics Engineers	en_US
dc.relation.ispartof	IEEE Transactions on Cognitive and Developmental Systems	en_US
dc.title	Learning Deep Features for Robotic Inference from Physical Interactions	en_US
dc.type	Article
dc.rights.holder	© 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.identifier.doi	10.1109/TCDS.2022.3152383	en_US
pubs.notes	Not known	en_US
pubs.publication-status	Published	en_US
rioxxterms.funder	Default funder	en_US
rioxxterms.identifier.project	Default project	en_US

Files in this item

Name:: Jamone Learning Deep Features ...
Size:: 10.00Mb
Format:: application/
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

Electronic Engineering and Computer Science [3387]

Show simple item record