蔡哲彰
出生地
高雄市
學歷
台北市立大安高工
國立勤益科技大學電子工程系
國立台灣科技大學資訊工程所
興趣
電玩、閱讀、聽音樂
- 指導教授:古鴻炎老師
- 中文題目:國語合成歌聲流暢度改進之研究
- 英文題目:Fluency Improving for Mandarin Singing Voice Synthesis
- 中文摘要:本論文的目標是使用少量的合成單元來合成出流暢的歌聲。在相鄰音節之間,我們提出以反射係數為基礎的頻譜內插方法,來平順連接相鄰音節的共振峰軌跡;在音節內流暢度的改進方面,我們參考以前學長用於語音合成之頻演路徑之概念,修改出適用於歌聲合成的頻演模型。我們也修改了HNM合成程式,以配合上述兩項流暢性改進方法;此外,以更多的語料來訓練ANN抖音參數模型,希望藉以提升合成歌聲的自然度。經由主觀的自然度聽測實驗,所得的評分顯示,使用頻演模型及共振峰軌跡連接處理,的確可以增進歌聲信號的流暢性。
- 英文摘要:In this thesis, the goal is to synthesize fluent singing voice by using a small amount of synthesis units. Between adjacent syllables, we propose a reflection-coefficient based spectrum interpolation method to let the formant traces be smoothly connected. To improve the intra-syllable fluency level of a synthetic syllable, we make use of the concept of spectrum progression proposed for speech synthesis to construct a spectrum progression model suitable for singing voice synthesis. Since the two fluency promoting methods must be realized with signal synthesis, we modify and correct the HNM synthesis program developed by others. In addition, we use a larger corpus to train the ANN vibrato parameter models in order to increase the naturalness level of the synthetic singing voice. According to the results of the listening tests, the score obtained by using spectrum progression model and formant trace connecting processing is indeed higher than those obtained without such processing.
- 研究成果