Product recogniton is now widely used . The way of shopping by taking a photo has been adopted by many people. And the unmanned settlement platform has entered the major supermarkets, which is also supported by product recognition technology. The technology is about the process of "product detection + product identification". The product detection module is responsible for detecting potential product areas, and the product identification model is responsible for identifying the main body detected by the product detection module. The recognition module uses the retrieval method to get the similarity rank of product in database and the query image . This document mainly introduces the feature extraction part of product pictures.
See the pipline of feature learning for details.
The config file: ResNet50_vd_Aliproduct.yaml
The details are as follows.
RandomCrop
: 224x224RandomFlip
Normlize
: normlize images to 0~1
Using ResNet50_vd
as the backbone, whicle is pretrained on ImageNet.
A 512 dimensional embedding FC layer without batchnorm and activation is used.
At present, CELoss
is used. In order to obtain more robust features, other loss will be used for training in the future. Please look forward to it.
This scheme is tested on Aliproduct [1] dataset. This dataset is an open source dataset of Tianchi competition, which is the largest open source product data set at present. It has more than 50000 identification categories and about 2.5 million training pictures.
On this data, the single model Top1 Acc: 85.67%.
[1] Weakly Supervised Learning with Side Information for Noisy Labeled Images. ECCV, 2020.