Science curiculum vitae personally - University of Da Nang




	Thông tin chung

	English

	Đề tài NC khoa học
	Bài báo, báo cáo khoa học
	Hướng dẫn Sau đại học
	Sách và giáo trình
	Các học phần và môn giảng dạy
	Giải thưởng khoa học, Phát minh, sáng chế
	Khen thưởng
	Thông tin khác

	Tài liệu tham khảo

	Hiệu chỉnh


Số người truy cập: 109,413,626

Introduction of duration models and dynamic features in MGE training for HSMM-based speech synthesis

Tác giả hoặc Nhóm tác giả: Duy Khanh Ninh, Kook Cho and Yoichi Yamashita

Nơi đăng: Proceeding of Spring Meeting of Acoustical Society of Japan; Số: 3-2012;Từ->đến trang: 431-434;Năm: 2012

Lĩnh vực: Công nghệ thông tin; Loại: Báo cáo; Thể loại: Quốc tế

TÓM TẮT

ABSTRACT

Minimum generation error (MGE) training has been proposed for HMM-based TTS to minimize the error between training and generated data. However, two potential problems in current MGE schemes exist. We apply conventional MGE scheme to HSMM- based TTS and propose two solutions. First, we augment the generation error function with the errors of dynamic features to improve the dynamics of generated parameter sequence. Second, we propose a new MGE scheme, where state duration models are used to determine the state sequence for MGE training, to resolve an inconsistency of conventional one when speech is to be synthesized from specified duration of speech segments. Objective evaluations confirm the robustness of our proposed methods.

Địa chỉ: 41 Lê Duẩn Thành phố Đà Nẵng

Điện thoại: (84) 0236 3822 041 ; Email: dhdn@ac.udn.vn