An Initial System for Integrated Synthesis of Mandarin, Min-nan, and Hakka Speech

國語、閩南語、客家語之整合式語音合成系統

 

Hung-Yan Gu, e-mail: guhy@mail.ntust.edu.tw

 

感謝國科會計畫支援,  2006/07

 

 

 

 

 

 

 

 A. Examples of Synthetic Mandarin Speech    B.   C.   D.   E.   F.

(國語合成語音例子)


* Fixed synthesis unit, i.e., each Mandarin syllable is uttered and recorded only once.

   (使用固定的合成單元,即409種國語音節中的每一種,都只 錄音、儲存一次)
   Mandarin has 409 different syllables when tones not distinguished.
   The 409 syllables are uttered by a female.

Pitch-Contour Model (軌跡模型)

Min-Nan model adapted to Mandarin

閩南語訓練之模型調適成國語模型

text file

文字檔

Synthetic speech

合成語音

Synthetic speech

合成語音

Original Mandarin-trained model

原始國語語料訓練之模型

text file

文字檔

Synthetic speech V

合成語音

Synthetic speech V

合成語音

Direct concatenation of recorded syllables (原始錄音音節直接串接): zia_ren_de_shou.wav

Time-warping method (間軸校正方法)
Synthesis Method: TIPW (a variant of PSOLA), 合 成方法: TIPW

Linear (線性)

text file

Synthetic speech

Synthetic speech

Piece-wise linear (片段線 性)

text file

Synthetic speech V

Synthetic speech V

Direct concatenation of recorded syllables (原始錄音音節直接串 接): dou_ziang_dien.wav

 
 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

B. Examples of Synthetic Min-Nan Speech

(閩南語合成語音例子)
* Fixed synthesis unit.
   Syllables are uttered by a female.

(1)

text file

  Synthetic speech

(2)

text file

(a)Synthetic speech,   (b)Synthetic speech

(3)

text file

 (a)Synthetic speech,   (b)Synthetic speech

 

 

 

C. Examples of Synthetic Hakka (sea-land accent) Speech
(海陸腔客家語合成語音例子)
* Fixed synthesis unit.
   Syllables are uttered by a male.

(1) Using adapted Min-Nan pitch model

(使用調適過的閩南語基軌跡模型)

text file

Synthetic speech

(2) Using adapted Min-Nan pitch model

(使用調適過的閩南語基軌跡模型)

text file

Synthetic speech

 

 

 
 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

D. Examples of Synthetic Hakka (four-country accent) Speech
(四縣腔客家語合成語音例子)
* Fixed synthesis unit.
   Using the same syllables as in C.

(1) Using adapted Min-Nan pitch model

(使用調適過的閩南語基軌跡模型)

text file

Synthetic speech

(2) Using adapted Min-Nan pitch model

(使用調適過的閩南語基軌跡模型)

text file

Synthetic speech

 

 

 

 E. 技術簡介

信號波形 合成
signal waveform synthesis

(a) TIPW
(b) piece-wise linear time warping

variant of PSOLA

軌跡 模型
pitch-contour model

(a)HMM & ANN
(b)adapt Min-nan trained model to Hakka

 

振幅和
amplitude & duration

rule-based

 

 

Synthesis flowchart (合成流程圖):

 

synthesis flowchart

 

 

F. 線上測試

    => http://guhy.csie.ntust.edu.tw/hmtts/speak.html
.