A deep framework for automatic annotation with application to retail warehouses

Kanika Mahajan, Anima Majumder, Harika Nanduri, Swagat Kumar

Research output: Contribution to conferencePaper

Abstract

The paper presents a novel deep learning framework for automatic annotation and segmentation of densely cluttered objects in a warehouse application use-case as specified by the Amazon Robotics Challenge (ARC) 2017. This framework addresses two challenges of the competition: (1) reducing the amount of manual labour involved in generating a large number of annotated data that could be used for training a deep network and, (2) achieving good segmentation accuracy in a very limited amount of training time ( 30 minutes). These two problems are solved by proposing a deep architecture comprising of Residual Network and Feature Pyramidal based convolutional neural network that helps to retain primitive features along with higher level features obtained from each successive layer. In addition, a framework is proposed using this network to automatically generate a large annotated dataset having different degrees of clutters to carry out multi-class semantic segmentation after training with this machine generated dataset. The proposed framework is shown to provide better segmentation accuracy with lesser training time as compared to the existing state-of-the-art architectures such as PSPNet and Mask R-CNN. The overall working of the proposed architecture is explained by creating a new dataset from the objects specified by the ARC competition. An extensive experiment is also performed using the MIT-Princeton database [22]. Our TCS-ARCDataset [13]is made available online for the convenience of readers.

Original languageEnglish
Publication statusPublished - 1 Jan 2019
Event29th British Machine Vision Conference, BMVC 2018 - Newcastle, United Kingdom
Duration: 3 Sep 20186 Sep 2018

Conference

Conference29th British Machine Vision Conference, BMVC 2018
CountryUnited Kingdom
CityNewcastle
Period3/09/186/09/18

Keywords

  • computer vision
  • Deep learning
  • large dataset
  • Multilayer neural networks
  • Semantics
  • Warehouses

Fingerprint Dive into the research topics of 'A deep framework for automatic annotation with application to retail warehouses'. Together they form a unique fingerprint.

  • Profiles

    Cite this

    Mahajan, K., Majumder, A., Nanduri, H., & Kumar, S. (2019). A deep framework for automatic annotation with application to retail warehouses. Paper presented at 29th British Machine Vision Conference, BMVC 2018, Newcastle, United Kingdom.