Image-to-MIDI mapping based on dynamic fuzzy color segmentation for visually impaired people

Chieh Li Chen, Yan Fa Liao, Chung Li Tai

Research output: Contribution to journalArticlepeer-review

11 Citations (Scopus)

Abstract

In this paper, the RGB ratio is defined according to a reference color so that an image can be transformed from a conventional color space to the RGB ratio space. Different to traditional distance measurement, a road color model is determined by an ellipse area in the RGB ratio space enclosed by the estimated boundaries. The proposed dynamic fuzzy logic, where fuzzy membership functions are defined according to estimated boundaries, is introduced to implement clustering rules, such that each pixel will have its own fuzzy membership function corresponding to its intensity. A basic neural network is trained and used to achieve parameter optimization. Experimental results for road detection demonstrate the robustness of the proposed approach to variations in intensity. To provide obstacle information, especially for visually impaired people, Musical Instrument Digital Interface (MIDI) is introduced as the sound generator, and image-to-MIDI mapping algorithm is proposed. Experimental results show that the proposed method can adapt to various road types, and the resulting audio information successfully indicates the position and size of obstacles.

Original languageEnglish
Pages (from-to)549-560
Number of pages12
JournalPattern Recognition Letters
Volume32
Issue number4
DOIs
Publication statusPublished - 2011 Mar 1

All Science Journal Classification (ASJC) codes

  • Software
  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Image-to-MIDI mapping based on dynamic fuzzy color segmentation for visually impaired people'. Together they form a unique fingerprint.

Cite this