Improved quadruple sparse census transform and adaptive multi-shape aggregation algorithms for precise stereo matching

Chih Hsuan Huang, Jar Ferr Yang

Research output: Contribution to journalArticlepeer-review

5 Citations (Scopus)

Abstract

Stereo matching is an important computer vision tool for three-dimensional (3D) positioning, 3D visual representation, precision detection and recognition with 3D information. For 3D visual representation, for example, the depth image-based rendering (DIBR) needs high accuracy depth maps to synthesise multiple views for 3D naked-eyes displays. It is hard to predict high-precision disparity for smooth regions and occlusion areas by traditional census-based stereo matching methods. In this work, two new improved algorithms, called the improved quadruple sparse census transform and the adaptive multi-shape aggregation, are proposed to achieve a precise stereo matching system. Instead of a binary threshold, the improved quadruple auxiliary census function with an adaptive threshold and the patch mean can raise the discrimination performance for better stereo matching. The selected sparse census patch is proposed to reduce the computation of the pixel-matching cost. For effective aggregation of matching costs, multi-shape windows and a texture-aware algorithm is suggested to decide a suitable window. The proposed adaptive multi-shape aggregation method performs better and solves the discontinuous depth problem occurring near the object boundaries. Experimental results show that the proposed stereo matching system with the proposed quadruple sparse census transform and multi-shape cost aggregation achieves better raw depth maps than the existing algorithms for stereo matching.

Original languageEnglish
Pages (from-to)159-179
Number of pages21
JournalIET Computer Vision
Volume16
Issue number2
DOIs
Publication statusPublished - 2022 Mar

All Science Journal Classification (ASJC) codes

  • Software
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Improved quadruple sparse census transform and adaptive multi-shape aggregation algorithms for precise stereo matching'. Together they form a unique fingerprint.

Cite this