4.6 Article

Classification of Trash and Valuables with Machine Vision in Shared Cars

Journal

APPLIED SCIENCES-BASEL
Volume 12, Issue 11, Pages -

Publisher

MDPI
DOI: 10.3390/app12115695

Keywords

vision-based; shared cars; prediction model; classification; convolutional neural network; camera module

Funding

  1. Henry Ford Foundation Finland
  2. Academy of Finland [326346]
  3. Aalto University
  4. EIT Urban Mobility (Smart Trash Detection and Damage Prevention for Shared Mobility) [20007, 21010]

Ask authors/readers for more resources

This study focuses on implementing a vision-based architecture to monitor and detect trash or valuables in shared cars using a convolutional neural network. While achieving an accuracy of 91.43%, misclassifications were observed due to variations in ambient light levels and shadows. Improvements suggested for future research include expanding the training image dataset and enhancing the modularity of the camera unit.
This study focused on the possibility of implementing a vision-based architecture to monitor and detect the presence of trash or valuables in shared cars. The system was introduced to take pictures of the rear seating area of a four-door passenger car. Image capture was performed with a stationary wide-angled camera unit, and image classification was conducted with a prediction model in a remote server. For classification, a convolutional neural network (CNN) in the form of a fine-tuned VGG16 model was developed. The CNN yielded an accuracy of 91.43% on a batch of 140 test images. To determine the correlation among the predictions, a confusion matrix was used, and in addition, for each predicted image, the certainty of the distinct output classes was examined. The execution time of the system, from capturing an image to displaying the results, ranged from 5.7 to 17.2 s. Misclassifications from the prediction model were observed in the results primarily due to the variation in ambient light levels and shadows within the images, which resulted in the target items lacking contrast with their neighbouring background. Developments pertaining to the modularity of the camera unit and expanding the dataset of training images are suggested for potential future research.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available