Automatic Vowel Sequence Reproduction for a Talking Robot Based on PARCOR Coefficient Template Matching |
Tác giả hoặc Nhóm tác giả: Vo Nhu Thanh, Hideyuki Sawada |
Nơi đăng: Journal IEIE Transactions on Smart Processing and Computing; Số: 5;Từ->đến trang: 215-221;Năm: 2016 |
Lĩnh vực: Khoa học công nghệ; Loại: Bài báo khoa học; Thể loại: Quốc tế |
TÓM TẮT |
This paper describes an automatic vowel sequence reproduction system for a talking
robot built to reproduce the human voice based on the working behavior of the human articulatory
system. A sound analysis system is developed to record a sentence spoken by a human (mainly
vowel sequences in the Japanese language) and to then analyze that sentence to give the correct
command packet so the talking robot can repeat it. An algorithm based on a short-time energy
method is developed to separate and count sound phonemes. A matching template using partial
correlation coefficients (PARCOR) is applied to detect a voice in the talking robot’s database
similar to the spoken voice. Combining the sound separation and counting the result with the
detection of vowels in human speech, the talking robot can reproduce a vowel sequence similar to
the one spoken by the human. Two tests to verify the working behavior of the robot are performed.
The results of the tests indicate that the robot can repeat a sequence of vowels spoken by a human
with an average success rate of more than 60%. |
ABSTRACT |
This paper describes an automatic vowel sequence reproduction system for a talking
robot built to reproduce the human voice based on the working behavior of the human articulatory
system. A sound analysis system is developed to record a sentence spoken by a human (mainly
vowel sequences in the Japanese language) and to then analyze that sentence to give the correct
command packet so the talking robot can repeat it. An algorithm based on a short-time energy
method is developed to separate and count sound phonemes. A matching template using partial
correlation coefficients (PARCOR) is applied to detect a voice in the talking robot’s database
similar to the spoken voice. Combining the sound separation and counting the result with the
detection of vowels in human speech, the talking robot can reproduce a vowel sequence similar to
the one spoken by the human. Two tests to verify the working behavior of the robot are performed.
The results of the tests indicate that the robot can repeat a sequence of vowels spoken by a human
with an average success rate of more than 60%. |
[ 2016\2016m07d025_13_6_34IEEKSPC_2016_5_3_20160718163528_1.pdf ] |