Speaker Identification Using Discriminative Features and Sparse Representation

Yu Hao Chin, Jia Ching Wang, Chien Lin Huang, Kuang Yao Wang, Chung Hsien Wu

研究成果: Article同行評審

16 引文 斯高帕斯(Scopus)

摘要

Speaker identification is an important topic with relevance to various disciplines. This paper proposes a novel speaker identification system, which consists of two major components-feature extraction and sparse representation classifier (SRC). Although SRC has been utilized for many classification purposes, few studies have provided insight into the link between the commonly used speaker identification feature, i-vector, and SRC. To combine i-vector and SRC sufficiently, we use probabilistic principal component analysis and Bartlett test to extract high-quality i-vector to construct a discriminative dictionary in SRC, supporting effective speaker identification. Besides improving dictionary from the i-vector aspect, we also utilize dictionary learning to further enhance the content of the dictionary. Two learning methods are proposed-robust principal component analysis dictionary and SVD-dictionary. Furthermore, we propose constructing a noise dictionary and combine it with the original dictionary to absorb and suppress noise when implementing the sparse coding. Various coding methods are utilized and analyzed. A comparison to the methods for speaker identification reveals that the proposed method outperforms the baselines and confirms its feasibility.

原文English
文章編號7872470
頁(從 - 到)1979-1987
頁數9
期刊IEEE Transactions on Information Forensics and Security
12
發行號8
DOIs
出版狀態Published - 2017 8月

All Science Journal Classification (ASJC) codes

  • 安全、風險、可靠性和品質
  • 電腦網路與通信

指紋

深入研究「Speaker Identification Using Discriminative Features and Sparse Representation」主題。共同形成了獨特的指紋。

引用此