Yinduo Wang
Learning discriminative domain-invariant prototypes for generalized zero shot learning
Wang, Yinduo; Zhang, Haofeng; Zhang, Zheng; Long, Yang; Shao, Ling
Abstract
Zero-shot learning (ZSL) aims to recognize objects of target classes by transferring knowledge from source classes through the semantic embeddings bridging. However, ZSL focuses the recognition only on unseen classes, which is unreasonable in realistic scenarios. A more reasonable way is to recognize new samples on combined domains, namely Generalized Zero Shot Learning (GZSL). Due to the fact that the source domain and target domain are disjoint and have unrelated classes potentially, ZSL and GZSL often suffer from the problem of projection domain shift. Besides, some semantic embeddings of prototypes are very similar, which makes the recognition less discriminative. To circumvent these issues, in this paper, we propose a novel method, called Learning Discriminative Domain-Invariant Prototypes (DDIP). In DDIP, both target and source domains are combined and projected into a hyper-spherical space, which is automatically learned by a regularized dictionary learning. In addition, an orthogonal constraint is employed to the latent hyper-spherical space to ensure all the class prototypes, including seen classes and unseen classes, to be orthogonal to each other to make them more discriminative. Extensive experiments on four popular benchmark and a large-scale datasets are conducted on both GZSL and standard ZSL settings, and the results show that our DDIP can outperform the state-of-the-art methods.
Citation
Wang, Y., Zhang, H., Zhang, Z., Long, Y., & Shao, L. (2020). Learning discriminative domain-invariant prototypes for generalized zero shot learning. Knowledge-Based Systems, 196, Article 105796. https://doi.org/10.1016/j.knosys.2020.105796
Journal Article Type | Article |
---|---|
Acceptance Date | Mar 19, 2020 |
Online Publication Date | Mar 24, 2020 |
Publication Date | May 21, 2020 |
Deposit Date | Mar 25, 2020 |
Publicly Available Date | Mar 24, 2021 |
Journal | Knowledge-Based Systems |
Print ISSN | 0950-7051 |
Electronic ISSN | 1872-7409 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 196 |
Article Number | 105796 |
DOI | https://doi.org/10.1016/j.knosys.2020.105796 |
Public URL | https://durham-repository.worktribe.com/output/1274504 |
Files
Accepted Journal Article
(2.7 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by-nc-nd/4.0/
Copyright Statement
© 2020 This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/
You might also like
EfficientTDNN: Efficient Architecture Search for Speaker Recognition
(2022)
Journal Article
Kernelized distance learning for zero-shot recognition
(2021)
Journal Article
A plug-in attribute correction module for generalized zero-shot learning
(2020)
Journal Article
Semantic combined network for zero-shot scene parsing
(2019)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search