ProNet: an accurate and light-weight CNN model for retail products recognition

Wei Yi; Yaoran Sun; Sailing He

doi:10.1117/12.2326939

24 July 2018 ProNet: an accurate and light-weight CNN model for retail products recognition

Wei Yi, Yaoran Sun, Sailing He

Proceedings Volume 10827, Sixth International Conference on Optical and Photonic Engineering (icOPEN 2018); 1082715 (2018) https://doi.org/10.1117/12.2326939
Event: Sixth International Conference on Optical and Photonic Engineering (icOPEN 2018), 2018, Shanghai, China

Abstract

Nowadays, retail products recognition technologies are mostly based on traditional two-stages computer vision methods. Those methods first create features manually, followed by a classification algorithm to distinguish all products. Since deep learning methods have achieved state-of-the-art results on many tasks and have unified pipelines, it would be promising to apply deep models into products recognition. In this paper, we have built up a new light CNN architecture named ProNet for this task. The 27-layers ProNet combines the advantages of ResNet and Mobilenet. Depth-wise separable convolution and residual connection are two main operations in the architecture design. Depth-wise separable convolution is used to cut down the computation cost. Residual connection is used to help network learn better feature representations and converge to a better point during training. Compared with other commonly used CNN architectures, our ProNet is relatively computational efficient, but it can still get good performances on several public datasets. We first test ProNet architecture on ImageNet dataset. Top 1 average accuracy of 70.8% is got. After that, we test ProNet on another public dataset ALOI and our own task-specific retail products dataset GroOpt using transfer learning. Using this base model, we get an average accuracy of 98% on ALOI and 96% on GroOpt, which are both much higher than traditional SIFT based methods. Results show that ProNet is an accurate model. To make ProNet transferable in other environments, we apply the following two strategies: (1) a white balance augmentation algorithm to randomly change the RGB ratio of every image. (2) add another linear classifier on top feature maps to help distinguish very similar samples. Using augmented training set and modified model, we have trained ProNetV2. This improved version gets an accuracy of 99% on both ALOI and GroOpt. We have also embedded ProNetV2 model into a smart phone with 2GB RAM and test it under different situations, including different light illuminations, backgrounds, etc. An average accuracy of 96% and processing time of 0.1s per image are reached. Those results prove the effectiveness and usefulness of our proposed networks.

Citation Download Citation

Wei Yi, Yaoran Sun, and Sailing He "ProNet: an accurate and light-weight CNN model for retail products recognition", Proc. SPIE 10827, Sixth International Conference on Optical and Photonic Engineering (icOPEN 2018), 1082715 (24 July 2018); https://doi.org/10.1117/12.2326939

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
9 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Convolution

RGB color model

Image processing

Detection and tracking algorithms

Computer vision technology

Databases

Machine vision

Show All Keywords

Keywords/Phrases

Search In:

Publication Years