Combined hand gesture - Speech model for human action recognition

Sheng Tzong Cheng, Chih Wei Hsu, Jian Pan Li

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

This study proposes a dynamic hand gesture detection technology to effectively detect dynamic hand gesture areas, and a hand gesture recognition technology to improve the dynamic hand gesture recognition rate. Meanwhile, the corresponding relationship between state sequences in hand gesture and speech models is considered by integrating speech recognition technology with a multimodal model, thus improving the accuracy of human behavior recognition. The experimental results proved that the proposed method can effectively improve human behavior recognition accuracy and the feasibility of system applications. Experimental results verified that the multimodal gesture-speech model provided superior accuracy when compared to the single modal versions.

Original languageEnglish
Pages (from-to)17098-17129
Number of pages32
JournalSensors (Switzerland)
Volume13
Issue number12
DOIs
Publication statusPublished - 2013 Dec 12

Fingerprint

Gestures
Gesture recognition
Hand
human behavior
Technology
Speech recognition
speech recognition

All Science Journal Classification (ASJC) codes

  • Analytical Chemistry
  • Biochemistry
  • Atomic and Molecular Physics, and Optics
  • Instrumentation
  • Electrical and Electronic Engineering

Cite this

Cheng, Sheng Tzong ; Hsu, Chih Wei ; Li, Jian Pan. / Combined hand gesture - Speech model for human action recognition. In: Sensors (Switzerland). 2013 ; Vol. 13, No. 12. pp. 17098-17129.
@article{89150ec02a7f4331b7b9bca5578ae152,
title = "Combined hand gesture - Speech model for human action recognition",
abstract = "This study proposes a dynamic hand gesture detection technology to effectively detect dynamic hand gesture areas, and a hand gesture recognition technology to improve the dynamic hand gesture recognition rate. Meanwhile, the corresponding relationship between state sequences in hand gesture and speech models is considered by integrating speech recognition technology with a multimodal model, thus improving the accuracy of human behavior recognition. The experimental results proved that the proposed method can effectively improve human behavior recognition accuracy and the feasibility of system applications. Experimental results verified that the multimodal gesture-speech model provided superior accuracy when compared to the single modal versions.",
author = "Cheng, {Sheng Tzong} and Hsu, {Chih Wei} and Li, {Jian Pan}",
year = "2013",
month = "12",
day = "12",
doi = "10.3390/s131217098",
language = "English",
volume = "13",
pages = "17098--17129",
journal = "Sensors",
issn = "1424-3210",
publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",
number = "12",

}

Combined hand gesture - Speech model for human action recognition. / Cheng, Sheng Tzong; Hsu, Chih Wei; Li, Jian Pan.

In: Sensors (Switzerland), Vol. 13, No. 12, 12.12.2013, p. 17098-17129.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Combined hand gesture - Speech model for human action recognition

AU - Cheng, Sheng Tzong

AU - Hsu, Chih Wei

AU - Li, Jian Pan

PY - 2013/12/12

Y1 - 2013/12/12

N2 - This study proposes a dynamic hand gesture detection technology to effectively detect dynamic hand gesture areas, and a hand gesture recognition technology to improve the dynamic hand gesture recognition rate. Meanwhile, the corresponding relationship between state sequences in hand gesture and speech models is considered by integrating speech recognition technology with a multimodal model, thus improving the accuracy of human behavior recognition. The experimental results proved that the proposed method can effectively improve human behavior recognition accuracy and the feasibility of system applications. Experimental results verified that the multimodal gesture-speech model provided superior accuracy when compared to the single modal versions.

AB - This study proposes a dynamic hand gesture detection technology to effectively detect dynamic hand gesture areas, and a hand gesture recognition technology to improve the dynamic hand gesture recognition rate. Meanwhile, the corresponding relationship between state sequences in hand gesture and speech models is considered by integrating speech recognition technology with a multimodal model, thus improving the accuracy of human behavior recognition. The experimental results proved that the proposed method can effectively improve human behavior recognition accuracy and the feasibility of system applications. Experimental results verified that the multimodal gesture-speech model provided superior accuracy when compared to the single modal versions.

UR - http://www.scopus.com/inward/record.url?scp=84890186698&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84890186698&partnerID=8YFLogxK

U2 - 10.3390/s131217098

DO - 10.3390/s131217098

M3 - Article

C2 - 24351628

AN - SCOPUS:84890186698

VL - 13

SP - 17098

EP - 17129

JO - Sensors

JF - Sensors

SN - 1424-3210

IS - 12

ER -