Please use this identifier to cite or link to this item:
https://hdl.handle.net/2440/109518
Citations | ||
Scopus | Web of Science® | Altmetric |
---|---|---|
?
|
?
|
Type: | Journal article |
Title: | Compact representation for large-scale unconstrained video analysis |
Author: | Wang, S. Pan, P. Long, G. Chen, W. Li, X. Sheng, Q.Z. |
Citation: | World Wide Web, 2016; 19(2):231-246 |
Publisher: | Springer Nature |
Issue Date: | 2016 |
ISSN: | 1573-1413 1573-1413 |
Statement of Responsibility: | Sen Wang, Pingbo Pan, Guodong Long, Weitong Chen, Xue Li, Quan Z. Sheng |
Abstract: | Recently, newly invented features (e.g. Fisher vector, VLAD) have achieved state-of-the-art performance in large-scale video analysis systems that aims to understand the contents in videos, such as concept recognition and event detection. However, these features are in high-dimensional representations, which remarkably increases computation costs and correspondingly deteriorates the performance of subsequent learning tasks. Notably, the situation becomes even worse when dealing with large-scale video data where the number of class labels are limited. To address this problem, we propose a novel algorithm to compactly represent huge amounts of unconstrained video data. Specifically, redundant feature dimensions are removed by using our proposed feature selection algorithm. Considering unlabeled videos that are easy to obtain on the web, we apply this feature selection algorithm in a semi-supervised framework coping with a shortage of class information. Different from most of the existing semi-supervised feature selection algorithms, our proposed algorithm does not rely on manifold approximation, i.e. graph Laplacian, which is quite expensive for a large number of data. Thus, it is possible to apply the proposed algorithm to a real large-scale video analysis system. Besides, due to the difficulty of solving the non-smooth objective function, we develop an efficient iterative approach to seeking the global optimum. Extensive experiments are conducted on several real-world video datasets, including KTH, CCV, and HMDB. The experimental results have demonstrated the effectiveness of the proposed algorithm. |
Keywords: | Compact representation; feature selection; large-scale video analysis; semi-supervised learning; concept recognition; event recognition |
Rights: | © Springer Science+Business Media New York 2015 |
DOI: | 10.1007/s11280-015-0354-0 |
Published version: | http://dx.doi.org/10.1007/s11280-015-0354-0 |
Appears in Collections: | Aurora harvest 8 Computer Science publications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
RA_hdl_109518.pdf Restricted Access | Restricted Access | 785.89 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.