Speech-annotated photo retrieval using syllable-transformed patterns

Chung Hsien Wu, Chien Lin Huang, Wei Chuan Lee, Yu Sheng Lai

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

This study presents a novel indexing and retrieval scheme for digital photos with speech annotations based on syllable-transformed image-like patterns. Speech recognition error and out-of-vocabulary (OOV) problems generally result in incorrect indexing and degrade the retrieval performance. In this study, the recognized n-best candidates used to deal with recognition error problems are transformed into an image-like pattern using multidimensional scaling. A hybrid mechanism integrating syllables, characters, words, and image-like patterns is exploited for speech indexing and retrieval. Experiments show the hybrid indexing method integrating the syllable-transformed image-like patterns can achieve a better result compared to previous indexing methods.

Original languageEnglish
Pages (from-to)6-9
Number of pages4
JournalIEEE Signal Processing Letters
Volume16
Issue number1
DOIs
Publication statusPublished - 2009

All Science Journal Classification (ASJC) codes

  • Signal Processing
  • Electrical and Electronic Engineering
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Speech-annotated photo retrieval using syllable-transformed patterns'. Together they form a unique fingerprint.

Cite this