Intergrated Synthesis of Mandarin, Min-Nan, and Hakka Speech

An Initial System for Integrated Synthesis of Mandarin, Min-nan, and Hakka Speech

國語、閩南語、客家語之整合式語音合成系統

Hung-Yan Gu, e-mail: guhy@mail.ntust.edu.tw

感謝國科會計畫支援, 2006/07

A. Examples of Synthetic Mandarin Speech    B. C. D. E. F.

(國語合成語音例子)

* Fixed synthesis unit, i.e., each Mandarin syllable is uttered and recorded only once.

   (使用固定的合成單元，即409種國語音節中的每一種，都只錄音、儲存一次)
   Mandarin has 409 different syllables when tones not distinguished.
   The 409 syllables are uttered by a female.

Pitch-Contour Model (基週軌跡模型)

Min-Nan model adapted to Mandarin

閩南語訓練之模型調適成國語模型

text file

文字檔

Synthetic speech

合成語音

Synthetic speech

合成語音

Original Mandarin-trained model

原始國語語料訓練之模型

text file

文字檔

Synthetic speech V

合成語音

Synthetic speech V

合成語音

Direct concatenation of recorded syllables (原始錄音音節直接串接): zia_ren_de_shou.wav

Time-warping method (時間軸校正方法)
Synthesis Method: TIPW (a variant of PSOLA), 合成方法: TIPW

Linear (線性)

text file

Synthetic speech

Synthetic speech

Piece-wise linear (片段線性)

text file

Synthetic speech V

Synthetic speech V

Direct concatenation of recorded syllables (原始錄音音節直接串接): dou_ziang_dien.wav

B. Examples of Synthetic Min-Nan Speech

(閩南語合成語音例子)
* Fixed synthesis unit.
   Syllables are uttered by a female.

(1)

text file

Synthetic speech

(2)

text file

(a)Synthetic speech,   (b)Synthetic speech

(3)

text file

(a)Synthetic speech,   (b)Synthetic speech

C. Examples of Synthetic Hakka (sea-land accent) Speech
(海陸腔客家語合成語音例子)
* Fixed synthesis unit.
   Syllables are uttered by a male.

(1) Using adapted Min-Nan pitch model

(使用調適過的閩南語基週軌跡模型)

text file

Synthetic speech

(2) Using adapted Min-Nan pitch model

(使用調適過的閩南語基週軌跡模型)

text file

Synthetic speech

D. Examples of Synthetic Hakka (four-country accent) Speech
(四縣腔客家語合成語音例子)
* Fixed synthesis unit.
Using the same syllables as in C.

(1) Using adapted Min-Nan pitch model

(使用調適過的閩南語基週軌跡模型)

text file

Synthetic speech

(2) Using adapted Min-Nan pitch model

(使用調適過的閩南語基週軌跡模型)

text file

Synthetic speech

E. 技術簡介

信號波形合成
signal waveform synthesis

(a) TIPW
(b) piece-wise linear time warping

variant of PSOLA

基週軌跡模型
pitch-contour model

(a)HMM & ANN
(b)adapt Min-nan trained model to Hakka

振幅和音長
amplitude & duration

rule-based

Synthesis flowchart (合成流程圖):

F. 線上測試

=> http://guhy.csie.ntust.edu.tw/hmtts/speak.html
.