Thông tin chung

  English

  Đề tài NC khoa học
  Bài báo, báo cáo khoa học
  Hướng dẫn Sau đại học
  Sách và giáo trình
  Các học phần và môn giảng dạy
  Giải thưởng khoa học, Phát minh, sáng chế
  Khen thưởng
  Thông tin khác

  Tài liệu tham khảo

  Hiệu chỉnh

 
Số người truy cập: 107,053,656

 F0 parameterization of glottalized tones for HMM-based Vietnamese TTS
Tác giả hoặc Nhóm tác giả: Duy Khanh Ninh, Yoichi Yamashita
Nơi đăng: The 16th Annual International Conference of ISCA (Interspeech), Dresden, Germany; Số: 2015;Từ->đến trang: 2202–2206;Năm: 2015
Lĩnh vực: Công nghệ thông tin; Loại: Báo cáo; Thể loại: Quốc tế
TÓM TẮT
A conventional HMM-based TTS system for Hanoi Vietnamese often suffers from the hoarse quality due
to the incomplete F0 parameterization of glottalized tones. As estimating F0 in glottalization is rather
problematic for usual F0 extractors, we propose a pitch marking algorithm where the pitch marks are
propagated from regular regions of speech signal to glottalized one, from which the complete F0
contour of a glottalized tone is derived. The proposed F0 parameterization scheme was confirmed to
significantly reduce the hoarseness whilst improve the tone naturalness of synthetic speech by both
objective and listening tests. The pitch marking algorithm works as a refinement step based on the
results of an F0 extractor. Therefore, the proposed scheme can be combined with any F0 extractor.
ABSTRACT
A conventional HMM-based TTS system for Hanoi Vietnamese often suffers from the hoarse quality due
to the incomplete F0 parameterization of glottalized tones. As estimating F0 in glottalization is rather
problematic for usual F0 extractors, we propose a pitch marking algorithm where the pitch marks are
propagated from regular regions of speech signal to glottalized one, from which the complete F0
contour of a glottalized tone is derived. The proposed F0 parameterization scheme was confirmed to
significantly reduce the hoarseness whilst improve the tone naturalness of synthetic speech by both
objective and listening tests. The pitch marking algorithm works as a refinement step based on the
results of an F0 extractor. Therefore, the proposed scheme can be combined with any F0 extractor.
© Đại học Đà Nẵng
 
 
Địa chỉ: 41 Lê Duẩn Thành phố Đà Nẵng
Điện thoại: (84) 0236 3822 041 ; Email: dhdn@ac.udn.vn