TY - JOUR
T1 - Robot arm grasping using learning-based template matching and self-rotation learning network
AU - Le, Minh Tri
AU - Lien, Jenn Jier James
N1 - Funding Information:
This study was supported in part by the Ministry of Science and Technology (MOST) of Taiwan, R.O.C., under Grant No. MOST 110-2221-E-006-179. The additional support provided by Tongtai Machine & Tool Co., Ltd. (Taiwan) and Contrel Technology Co., Ltd. (Taiwan) is also gratefully acknowledged.
Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.
PY - 2022/7
Y1 - 2022/7
N2 - Applying deep neural network models to robot-arm grasping tasks requires the laborious and time-consuming annotation of a large number of representative examples in the training process. Accordingly, this work proposes a two-stage grasping model, in which the first stage employs learning-based template matching (LTM) algorithm for estimating the object position, and a self-rotation learning (SRL) network is then proposed to estimate the rotation angle of the grasping objects in the second stage. The LTM algorithm measures similarity between the feature maps of the search and template images which are extracted by a pre-trained model, while the SRL network performs the automatic rotation and labelling of the input data for training purposes. Therefore, the proposed model does not consume an expensive human-annotation process. The experimental results show that the proposed model obtains 92.6% when testing on 2400 pairs of the template and target images. Moreover, in performing practical grasping tasks on a NVidia Jetson TX2 developer kit, the proposed model achieves a higher accuracy (88.5%) than other grasping approaches on a split of Cornell-grasp dataset.
AB - Applying deep neural network models to robot-arm grasping tasks requires the laborious and time-consuming annotation of a large number of representative examples in the training process. Accordingly, this work proposes a two-stage grasping model, in which the first stage employs learning-based template matching (LTM) algorithm for estimating the object position, and a self-rotation learning (SRL) network is then proposed to estimate the rotation angle of the grasping objects in the second stage. The LTM algorithm measures similarity between the feature maps of the search and template images which are extracted by a pre-trained model, while the SRL network performs the automatic rotation and labelling of the input data for training purposes. Therefore, the proposed model does not consume an expensive human-annotation process. The experimental results show that the proposed model obtains 92.6% when testing on 2400 pairs of the template and target images. Moreover, in performing practical grasping tasks on a NVidia Jetson TX2 developer kit, the proposed model achieves a higher accuracy (88.5%) than other grasping approaches on a split of Cornell-grasp dataset.
UR - http://www.scopus.com/inward/record.url?scp=85131404750&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85131404750&partnerID=8YFLogxK
U2 - 10.1007/s00170-022-09374-y
DO - 10.1007/s00170-022-09374-y
M3 - Article
AN - SCOPUS:85131404750
VL - 121
SP - 1915
EP - 1926
JO - International Journal of Advanced Manufacturing Technology
JF - International Journal of Advanced Manufacturing Technology
SN - 0268-3768
IS - 3-4
ER -