TY - GEN
T1 - Regression-based clustering for hierarchical pitch conversion
AU - Lee, Chung Han
AU - Hsia, Chi Chun
AU - Wu, Chung Hsien
AU - Lin, Mai Chun
PY - 2009
Y1 - 2009
N2 - This study presents a hierarchical pitch conversion method using regression-based clustering for conversion function modeling. The pitch contour of a speech utterance is first extracted and decomposed into sentence-, wordand sub-syllable-level features in a top-down mechanism. The pair-wise source and target pitch feature vectors at each level are then clustered to generate the pitch conversion function. Regression-based clustering, which clusters the feature vectors to achieve a minimum conversion error between the predicted and the real feature vectors is proposed for conversion function generation. A classification and regression tree (CART), incorporating linguistic, phonetic and source prosodic features, is adopted to select the most suitable function for pitch conversion. Several objective and subjective evaluations were conducted and the comparison results to the GMMbased methods for pitch conversion confirm the performance of the proposed regression-based clustering approach.
AB - This study presents a hierarchical pitch conversion method using regression-based clustering for conversion function modeling. The pitch contour of a speech utterance is first extracted and decomposed into sentence-, wordand sub-syllable-level features in a top-down mechanism. The pair-wise source and target pitch feature vectors at each level are then clustered to generate the pitch conversion function. Regression-based clustering, which clusters the feature vectors to achieve a minimum conversion error between the predicted and the real feature vectors is proposed for conversion function generation. A classification and regression tree (CART), incorporating linguistic, phonetic and source prosodic features, is adopted to select the most suitable function for pitch conversion. Several objective and subjective evaluations were conducted and the comparison results to the GMMbased methods for pitch conversion confirm the performance of the proposed regression-based clustering approach.
UR - http://www.scopus.com/inward/record.url?scp=70349223929&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70349223929&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2009.4960403
DO - 10.1109/ICASSP.2009.4960403
M3 - Conference contribution
AN - SCOPUS:70349223929
SN - 9781424423545
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 3593
EP - 3596
BT - 2009 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings, ICASSP 2009
T2 - 2009 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2009
Y2 - 19 April 2009 through 24 April 2009
ER -