now publishers - Dimensionality reduction of visual features for efficient retrieval and classification

APSIPA Transactions on Signal and Information Processing > Vol 5 > Issue 1

Dimensionality reduction of visual features for efficient retrieval and classification

Industrial Technology Advances

Petros T. Boufounos, MERL – Mitsubishi Electric Research Laboratories, USA, Hassan Mansour, MERL – Mitsubishi Electric Research Laboratories, USA, Shantanu Rane, Palo Alto Research Center (PARC), USA, Anthony Vetro, MERL – Mitsubishi Electric Research Laboratories, USA, avetro@merl.com

Suggested Citation

Petros T. Boufounos, Hassan Mansour, Shantanu Rane and Anthony Vetro (2016), "Dimensionality reduction of visual features for efficient retrieval and classification", APSIPA Transactions on Signal and Information Processing: Vol. 5: No. 1, e14. http://dx.doi.org/10.1017/ATSIP.2016.14

Publication Date: 12 Jul 2016

Subjects

Keywords

Randomized embeddings, Nearest neighbors, Quantization, Low-rank matrix factorization, Visual search, Classification

Journal details

Open Access

This is published under the terms of the Creative Commons Attribution licence.

Downloaded: 4546 times

In this article:

Abstract

Visual retrieval and classification are of growing importance for a number of applications, including surveillance, automotive, as well as web and mobile search. To facilitate these processes, features are often computed from images to extract discriminative aspects of the scene, such as structure, texture or color information. Ideally, these features would be robust to changes in perspective, illumination, and other transformations. This paper examines two approaches that employ dimensionality reduction for fast and accurate matching of visual features while also being bandwidth-efficient, scalable, and parallelizable. We focus on two classes of techniques to illustrate the benefits of dimensionality reduction in the context of various industrial applications. The first method is referred to as quantized embeddings, which generates a distance-preserving feature vector with low rate. The second method is a low-rank matrix factorization applied to a sequence of visual features, which exploits the temporal redundancy among feature vectors associated with each frame in a video. Both methods discussed in this paper are also universal in that they do not require prior assumptions about the statistical properties of the signals in the database or the query. Furthermore, they enable the system designer to navigate a rate versus performance trade-off similar to the rate-distortion trade-off in conventional compression.

DOI:10.1017/ATSIP.2016.14

I. INTRODUCTION
II. QUANTIZED EMBEDDINGS OF FEATURE SPACES
III. LOW-RANK MATRIX FACTORIZATION OF VISUAL FEATURES
IV. RELATED WORK
V. CONCLUDING REMARKS

Dimensionality reduction of visual features for efficient retrieval and classification

Share

Journal details

Abstract