Show simple item record

dc.contributor.authorWang, H
dc.contributor.authorZhu, C
dc.contributor.authorMa, Z
dc.contributor.authorOh, C
dc.date.accessioned2024-01-05T10:57:23Z
dc.date.available2024-01-05T10:57:23Z
dc.date.issued2022
dc.identifier.issn1520-6149
dc.identifier.urihttps://qmro.qmul.ac.uk/xmlui/handle/123456789/93422
dc.description.abstractWe present methods to estimate the physical properties of house-hold containers and their fillings manipulated by humans. We use a lightweight, pre-trained convolutional neural network with coordinate attention as a backbone model of the pipelines to accurately locate the object of interest and estimate the physical properties in the CORSMAL Containers Manipulation (CCM) dataset. We address the filling type classification with audio data and then combine this information from audio with video modalities to address the filling level classification. For the container capacity, dimension, and mass estimation, we present a data augmentation and consistency measurement to alleviate the over-fitting issue in the CCM dataset caused by the limited number of containers. We augment the training data using an object-of-interest-based re-scaling that increases the variety of physical values of the containers. We then perform the consistency measurement to choose a model with low prediction variance in the same containers under different scenes, which ensures the generalization ability of the model. Our method improves the generalization ability of the models to estimate the property of the containers that were not previously seen in the training.en_US
dc.format.extent9147 - 9151
dc.publisherIEEEen_US
dc.subjectMulti-modal perceptionen_US
dc.subjectgeneralizationen_US
dc.subjectdata augmentationen_US
dc.titleIMPROVING GENERALIZATION OF DEEP NETWORKS FOR ESTIMATING PHYSICAL PROPERTIES OF CONTAINERS AND FILLINGSen_US
dc.typeConference Proceedingen_US
dc.rights.holder© 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.identifier.doi10.1109/ICASSP43922.2022.9747349
pubs.author-urlhttps://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000864187909092&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=612ae0d773dcbdba3046f6df545e9f6aen_US
pubs.notesNot knownen_US
pubs.publication-statusPublisheden_US
rioxxterms.funderDefault funderen_US
rioxxterms.identifier.projectDefault projecten_US
qmul.funderJADE: Joint Academic Data science Endeavour - 2::Engineering and Physical Sciences Research Councilen_US


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record