TY - JOUR
T1 - Speaker Identification Using Discriminative Features and Sparse Representation
AU - Chin, Yu Hao
AU - Wang, Jia Ching
AU - Huang, Chien Lin
AU - Wang, Kuang Yao
AU - Wu, Chung Hsien
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/8
Y1 - 2017/8
N2 - Speaker identification is an important topic with relevance to various disciplines. This paper proposes a novel speaker identification system, which consists of two major components-feature extraction and sparse representation classifier (SRC). Although SRC has been utilized for many classification purposes, few studies have provided insight into the link between the commonly used speaker identification feature, i-vector, and SRC. To combine i-vector and SRC sufficiently, we use probabilistic principal component analysis and Bartlett test to extract high-quality i-vector to construct a discriminative dictionary in SRC, supporting effective speaker identification. Besides improving dictionary from the i-vector aspect, we also utilize dictionary learning to further enhance the content of the dictionary. Two learning methods are proposed-robust principal component analysis dictionary and SVD-dictionary. Furthermore, we propose constructing a noise dictionary and combine it with the original dictionary to absorb and suppress noise when implementing the sparse coding. Various coding methods are utilized and analyzed. A comparison to the methods for speaker identification reveals that the proposed method outperforms the baselines and confirms its feasibility.
AB - Speaker identification is an important topic with relevance to various disciplines. This paper proposes a novel speaker identification system, which consists of two major components-feature extraction and sparse representation classifier (SRC). Although SRC has been utilized for many classification purposes, few studies have provided insight into the link between the commonly used speaker identification feature, i-vector, and SRC. To combine i-vector and SRC sufficiently, we use probabilistic principal component analysis and Bartlett test to extract high-quality i-vector to construct a discriminative dictionary in SRC, supporting effective speaker identification. Besides improving dictionary from the i-vector aspect, we also utilize dictionary learning to further enhance the content of the dictionary. Two learning methods are proposed-robust principal component analysis dictionary and SVD-dictionary. Furthermore, we propose constructing a noise dictionary and combine it with the original dictionary to absorb and suppress noise when implementing the sparse coding. Various coding methods are utilized and analyzed. A comparison to the methods for speaker identification reveals that the proposed method outperforms the baselines and confirms its feasibility.
UR - http://www.scopus.com/inward/record.url?scp=85020267724&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85020267724&partnerID=8YFLogxK
U2 - 10.1109/TIFS.2017.2678458
DO - 10.1109/TIFS.2017.2678458
M3 - Article
AN - SCOPUS:85020267724
SN - 1556-6013
VL - 12
SP - 1979
EP - 1987
JO - IEEE Transactions on Information Forensics and Security
JF - IEEE Transactions on Information Forensics and Security
IS - 8
M1 - 7872470
ER -