4.4 Article

Hierarchical Scheme for Vehicle Make and Model Recognition

Journal

TRANSPORTATION RESEARCH RECORD
Volume 2675, Issue 7, Pages 363-376

Publisher

SAGE PUBLICATIONS INC
DOI: 10.1177/03611981211019743

Keywords

-

Funding

  1. National Natural Science Foundation of China [61966035]
  2. National Science Foundation of China [U1803261]
  3. Xinjiang Uygur Autonomous Region Innovation Team [XJEDU2017T002]
  4. Autonomous Region Graduate Innovation Project [XJ2019G072]

Ask authors/readers for more resources

This paper presents a hierarchical scheme for vehicle make and model recognition, which includes a feature extraction framework, hierarchical loss function, and method of collecting and classifying images to improve accuracy and real-time performance. Experimental results demonstrate the method's superiority in recognition accuracy and frames per second for the Stanford Cars public dataset.
A vehicle make and model recognition (VMMR) system is a common requirement in the field of intelligent transportation systems (ITS). However, it is a challenging task because of the subtle differences between vehicle categories. In this paper, we propose a hierarchical scheme for VMMR. Specifically, the scheme consists of (1) a feature extraction framework called weighted mask hierarchical bilinear pooling (WMHBP) based on hierarchical bilinear pooling (HBP) which weakens the influence of invalid background regions by generating a weighted mask while extracting features from discriminative regions to form a more robust feature descriptor; (2) a hierarchical loss function that can learn the appearance differences between vehicle brands, and enhance vehicle recognition accuracy; (3) collection of vehicle images from the Internet and classification of images with hierarchical labels to augment data for solving the problem of insufficient data and low picture resolution and improving the model's generalization ability and robustness. We evaluate the proposed framework for accuracy and real-time performance and the experiment results indicate a recognition accuracy of 95.1% and an FPS (frames per second) of 107 for the framework for the Stanford Cars public dataset, which demonstrates the superiority of the method and its availability for ITS.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available