Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/137909
Citations
Scopus Web of Science® Altmetric
?
?
Type: Conference paper
Title: Generalised Zero-shot Learning with Multi-modal Embedding Spaces
Author: Felix, R.
Sasdelli, M.
Harwood, B.
Carneiro, G.
Citation: Proceedings of the Digital Image Computing: Techniques and Applications (DICTA 2020), 2020, pp.1-8
Publisher: IEEE
Publisher Place: online
Issue Date: 2020
ISBN: 9781728191089
Conference Name: Digital Image Computing: Techniques and Applications (DICTA) (29 Nov 2020 - 2 Dec 2020 : virtual online)
Statement of
Responsibility: 
Rafael Felix, Michele Sasdelli, Ben Harwood, Gustavo Carneiro
Abstract: Generalised zero-shot learning (GZSL) methods aim to classify previously seen and unseen visual classes by leveraging the semantic information of those classes. In the context of GZSL, semantic information is non-visual data such as a text description of the seen and unseen classes. Previous GZSL methods have explored transformations between visual and semantic spaces, as well as the learning of a latent joint visual and semantic space. In these methods, even though learning has explored a combination of spaces (i.e., visual, semantic or joint latent space), inference tended to focus on using just one of the spaces. By hypothesising that inference must explore all three spaces, we propose a new GZSL method based on a multimodal classification over visual, semantic and joint latent spaces. Another issue affecting current GZSL methods is the intrinsic bias toward the classification of seen classes – a problem that is usually mitigated by a domain classifier which modulates seen and unseen classification. Our proposed approach replaces the modulated classification by a computationally simpler multidomain classification based on averaging the multi-modal calibrated classifiers from the seen and unseen domains. Experiments on GZSL benchmarks show that our proposed GZSL approach achieves competitive results compared with the state-of-the-art.
Rights: ©2020 IEEE
DOI: 10.1109/DICTA51227.2020.9363405
Grant ID: http://purl.org/au-research/grants/arc/FT190100525
http://purl.org/au-research/grants/arc/CE140100016
Published version: https://doi.org/10.1109/DICTA51227.2020
Appears in Collections:Australian Institute for Machine Learning publications
Computer Science publications

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.